Package com.norconex.importer.parser
-
Interface Summary Interface Description IDocumentParser Implementations are responsible for parsing a document to extract its text and metadata, as well as any embedded documents (when applicable).IDocumentParserFactory Factory providing document parsers for documents.IHintsAwareParser Indicates that a parser can be initialized with generic parser configuration settings and it will try to apply any such settings the best it can when possible to do so. -
Class Summary Class Description EmbeddedConfig Configuration settings affecting how embedded documents are handled by parsers.GenericDocumentParserFactory Generic document parser factory.OCRConfig OCR configuration details.ParseHints Configuration settings influencing how documents are parsed by various parsers. -
Enum Summary Enum Description ParseState Act as a flag indicating if a document has been parsed or not in a given process flow. -
Exception Summary Exception Description DocumentParserException Exception thrown upon encountering a non-recoverable issue parsing a document.