Interface IDocumentTagger
-
- All Superinterfaces:
IImporterHandler
- All Known Implementing Classes:
AbstractCharStreamTagger
,AbstractDocumentTagger
,AbstractStringTagger
,CharacterCaseTagger
,CharsetTagger
,ConstantTagger
,CopyTagger
,CountMatchesTagger
,CurrentDateTagger
,DateFormatTagger
,DebugTagger
,DeleteTagger
,DocumentLengthTagger
,DOMTagger
,ExternalTagger
,FieldReportTagger
,ForceSingleValueTagger
,HierarchyTagger
,KeepOnlyTagger
,LanguageTagger
,MergeTagger
,RegexTagger
,RenameTagger
,ReplaceTagger
,ScriptTagger
,SplitTagger
,TextBetweenTagger
,TextPatternTagger
,TextStatisticsTagger
,TitleGeneratorTagger
,TruncateTagger
,URLExtractorTagger
,UUIDTagger
public interface IDocumentTagger extends IImporterHandler
Tags a document with extra metadata information, or manipulate existing metadata information.- Author:
- Pascal Essiembre
-
-
Method Summary
All Methods Instance Methods Abstract Methods Modifier and Type Method Description void
tagDocument(HandlerDoc doc, InputStream input, ParseState parseState)
Tags a document with extra metadata information.
-
-
-
Method Detail
-
tagDocument
void tagDocument(HandlerDoc doc, InputStream input, ParseState parseState) throws ImporterHandlerException
Tags a document with extra metadata information.- Parameters:
doc
- documentinput
- document contentparseState
- whether the document has been parsed already or not (a parsed document should normally be text-based)- Throws:
ImporterHandlerException
- problem tagging the document
-
-