public abstract class AbstractDocumentSplitter extends AbstractImporterHandler implements IDocumentSplitter
Base class for splitters.
Subclasses inherit this IXMLConfigurable
configuration:
<!-- multiple "restrictTo" tags allowed (only one needs to match) -->
<restrictTo>
<fieldMatcher
method="[basic|csv|wildcard|regex]"
ignoreCase="[false|true]"
ignoreDiacritic="[false|true]"
partial="[false|true]">
(field-matching expression)
</fieldMatcher>
<valueMatcher
method="[basic|csv|wildcard|regex]"
ignoreCase="[false|true]"
ignoreDiacritic="[false|true]"
partial="[false|true]">
(value-matching expression)
</valueMatcher>
</restrictTo>
Constructor and Description |
---|
AbstractDocumentSplitter() |
Modifier and Type | Method and Description |
---|---|
protected abstract List<Doc> |
splitApplicableDocument(HandlerDoc doc,
InputStream input,
OutputStream output,
ParseState parseState) |
List<Doc> |
splitDocument(HandlerDoc doc,
InputStream docInput,
OutputStream docOutput,
ParseState parseState) |
addRestriction, addRestriction, addRestrictions, clearRestrictions, detectCharsetIfBlank, equals, getRestrictions, hashCode, isApplicable, loadFromXML, loadHandlerFromXML, removeRestriction, removeRestriction, saveHandlerToXML, saveToXML, toString
public final List<Doc> splitDocument(HandlerDoc doc, InputStream docInput, OutputStream docOutput, ParseState parseState) throws ImporterHandlerException
splitDocument
in interface IDocumentSplitter
ImporterHandlerException
protected abstract List<Doc> splitApplicableDocument(HandlerDoc doc, InputStream input, OutputStream output, ParseState parseState) throws ImporterHandlerException
ImporterHandlerException
Copyright © 2009–2023 Norconex Inc.. All rights reserved.