Interface | Description |
---|---|
IDocumentParser |
Implementations are responsible for parsing a document to
extract its text and metadata, as well as any embedded documents
(when applicable).
|
IDocumentParserFactory |
Factory providing document parsers for documents.
|
IHintsAwareParser |
Indicates that a parser can be initialized with generic parser configuration
settings and it will try to apply any such settings the best it can
when possible to do so.
|
Class | Description |
---|---|
EmbeddedConfig |
Configuration settings affecting how embedded documents are handled
by parsers.
|
GenericDocumentParserFactory |
Generic document parser factory.
|
OCRConfig |
OCR configuration details.
|
ParseHints |
Configuration settings influencing how documents are parsed by various
parsers.
|
Exception | Description |
---|---|
DocumentParserException |
Exception thrown upon encountering a non-recoverable issue parsing a
document.
|
Copyright © 2009–2021 Norconex Inc.. All rights reserved.