Package com.norconex.importer.handler
Interface IImporterHandler
-
- All Known Subinterfaces:
IDocumentFilter
,IDocumentSplitter
,IDocumentTagger
,IDocumentTransformer
- All Known Implementing Classes:
AbstractCharStreamFilter
,AbstractCharStreamTagger
,AbstractCharStreamTransformer
,AbstractDocumentFilter
,AbstractDocumentSplitter
,AbstractDocumentTagger
,AbstractDocumentTransformer
,AbstractStringFilter
,AbstractStringTagger
,AbstractStringTransformer
,CharacterCaseTagger
,CharsetTagger
,CharsetTransformer
,ConstantTagger
,CopyTagger
,CountMatchesTagger
,CsvSplitter
,CurrentDateTagger
,DateFormatTagger
,DateMetadataFilter
,DebugTagger
,DeleteTagger
,DocumentLengthTagger
,DOMContentFilter
,DOMDeleteTransformer
,DOMFilter
,DOMPreserveTransformer
,DOMSplitter
,DOMTagger
,EmptyFilter
,EmptyMetadataFilter
,ExternalTagger
,ExternalTransformer
,FieldReportTagger
,ForceSingleValueTagger
,HierarchyTagger
,ImageTransformer
,KeepOnlyTagger
,LanguageTagger
,MergeTagger
,NoContentTransformer
,NumericMetadataFilter
,PDFPageSplitter
,ReduceConsecutivesTransformer
,ReferenceFilter
,RegexContentFilter
,RegexMetadataFilter
,RegexReferenceFilter
,RegexTagger
,RejectFilter
,RenameTagger
,ReplaceTagger
,ReplaceTransformer
,ScriptFilter
,ScriptTagger
,ScriptTransformer
,SplitTagger
,StripAfterTransformer
,StripBeforeTransformer
,StripBetweenTransformer
,SubstringTransformer
,TextBetweenTagger
,TextFilter
,TextPatternTagger
,TextStatisticsTagger
,TitleGeneratorTagger
,TranslatorSplitter
,TruncateTagger
,URLExtractorTagger
,UUIDTagger
,XMLStreamSplitter
public interface IImporterHandler
Identifies a class as being an import handler. Handlers performs specific tasks on the importing content (other than parsing to extract raw content). They can be invoked before or after a document is parsed. There are four types of handlers currently supported:
IDocumentFilter
: accepts or reject an incoming document.IDocumentTagger
: modifies a document metadata.IDocumentTransformer
: modifies a document content.IDocumentSplitter
: splits a document into multiple ones.
- Author:
- Pascal Essiembre