- AbstractCharStreamFilter - Class in com.norconex.importer.handler.filter
-
Base class for filters dealing with the body of text documents only.
- AbstractCharStreamFilter() - Constructor for class com.norconex.importer.handler.filter.AbstractCharStreamFilter
-
- AbstractCharStreamTagger - Class in com.norconex.importer.handler.tagger
-
Base class for taggers dealing with the body of text documents only.
- AbstractCharStreamTagger() - Constructor for class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
-
- AbstractCharStreamTransformer - Class in com.norconex.importer.handler.transformer
-
Base class for transformers dealing with text documents only.
- AbstractCharStreamTransformer() - Constructor for class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
-
- AbstractDocumentFilter - Class in com.norconex.importer.handler.filter
-
Base class for document filters.
- AbstractDocumentFilter() - Constructor for class com.norconex.importer.handler.filter.AbstractDocumentFilter
-
- AbstractDocumentSplitter - Class in com.norconex.importer.handler.splitter
-
Base class for splitters.
- AbstractDocumentSplitter() - Constructor for class com.norconex.importer.handler.splitter.AbstractDocumentSplitter
-
- AbstractDocumentTagger - Class in com.norconex.importer.handler.tagger
-
Base class for taggers.
- AbstractDocumentTagger() - Constructor for class com.norconex.importer.handler.tagger.AbstractDocumentTagger
-
- AbstractDocumentTransformer - Class in com.norconex.importer.handler.transformer
-
Base class for transformers.
- AbstractDocumentTransformer() - Constructor for class com.norconex.importer.handler.transformer.AbstractDocumentTransformer
-
- AbstractImporterHandler - Class in com.norconex.importer.handler
-
Base class for handlers applying only to certain type of documents
by providing a way to restrict applicable documents based on
a metadata field value, where the value matches a regular expression.
- AbstractImporterHandler(String) - Constructor for class com.norconex.importer.handler.AbstractImporterHandler
-
- AbstractOnMatchFilter - Class in com.norconex.importer.handler.filter
-
Convenience base class for implementing filters offering the include/exclude
"onmatch" option.
- AbstractOnMatchFilter() - Constructor for class com.norconex.importer.handler.filter.AbstractOnMatchFilter
-
- AbstractStringFilter - Class in com.norconex.importer.handler.filter
-
Base class to facilitate creating filters based on text content, loading
text into
StringBuilder
for memory processing.
- AbstractStringFilter() - Constructor for class com.norconex.importer.handler.filter.AbstractStringFilter
-
- AbstractStringTagger - Class in com.norconex.importer.handler.tagger
-
Base class to facilitate creating taggers based on text content, loading
text into
StringBuilder
for memory processing.
- AbstractStringTagger() - Constructor for class com.norconex.importer.handler.tagger.AbstractStringTagger
-
- AbstractStringTransformer - Class in com.norconex.importer.handler.transformer
-
Base class to facilitate creating transformers on text content, loading
text into a
StringBuilder
for memory processing.
- AbstractStringTransformer() - Constructor for class com.norconex.importer.handler.transformer.AbstractStringTransformer
-
- AbstractTikaParser - Class in com.norconex.importer.parser.impl
-
Base class wrapping Apache Tika parser for use by the importer.
- AbstractTikaParser(Parser) - Constructor for class com.norconex.importer.parser.impl.AbstractTikaParser
-
Creates a new Tika-based parser.
- AbstractTikaParser.MergeEmbeddedParser - Class in com.norconex.importer.parser.impl
-
- AbstractTikaParser.RecursiveParser - Interface in com.norconex.importer.parser.impl
-
- AbstractTikaParser.SplitEmbbededParser - Class in com.norconex.importer.parser.impl
-
- acceptDocument(String, InputStream, ImporterMetadata, boolean) - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
-
- acceptDocument(String, InputStream, ImporterMetadata, boolean) - Method in interface com.norconex.importer.handler.filter.IDocumentFilter
-
Whether to accepts a document.
- addCondition(DateMetadataFilter.Operator, Date) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
- addCondition(NumericMetadataFilter.Operator, double) - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
-
- addConditionFromNow(DateMetadataFilter.Operator, DateMetadataFilter.TimeUnit, int, boolean) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
- addConditionFromToday(DateMetadataFilter.Operator, DateMetadataFilter.TimeUnit, int, boolean) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
- addConstant(String, String) - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
-
- addCopyDetails(String, String, boolean) - Method in class com.norconex.importer.handler.tagger.impl.CopyTagger
-
Adds copy instructions.
- addDOMExtractDetails(DOMTagger.DOMExtractDetails) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Adds DOM extraction details.
- addDOMExtractDetails(String, String, boolean) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
- addDOMExtractDetails(String, String, boolean, String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
- addEnvironmentVariable(String, String) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Adds an environment variables to the list of previously
assigned variables (if any).
- addEnvironmentVariable(String, String) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Adds an environment variables to the list of previously
assigned variables (if any).
- addEnvironmentVariable(String, String) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Adds an environment variables to the list of previously
assigned variables (if any).
- addEnvironmentVariables(Map<String, String>) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Adds the environment variables, keeping environment variables previously
assigned.
- addEnvironmentVariables(Map<String, String>) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Adds the environment variables, keeping environment variables previously
assigned.
- addEnvironmentVariables(Map<String, String>) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Adds the environment variables, keeping environment variables previously
assigned.
- addField(String) - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
-
- addField(String) - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
-
- addFieldCase(String, String) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
- addFieldCase(String, String, String) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
Adds field case changing instructions.
- addHierarcyDetails(String, String, String, String, boolean) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger
-
- addHierarcyDetails(HierarchyTagger.HierarchyDetails) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger
-
Adds hierarchy instructions.
- addMatchDetails(CountMatchesTagger.MatchDetails) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
Adds a match details.
- addMerge(MergeTagger.Merge) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger
-
- addMetadataExtractionPattern(String, String) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Adds a metadata extraction pattern that will extract the whole text
matched into the given field.
- addMetadataExtractionPattern(String, String, int) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Adds a metadata extraction pattern, which will extract the value from
the specified group index upon matching.
- addMetadataExtractionPattern(Pattern, boolean) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
- addMetadataExtractionPattern(Pattern, String) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
- addMetadataExtractionPattern(String, String) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Adds a metadata extraction pattern that will extract the whole text
matched into the given field.
- addMetadataExtractionPattern(String, String, int) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Adds a metadata extraction pattern, which will extract the value from
the specified group index upon matching.
- addMetadataExtractionPattern(Pattern, boolean) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
- addMetadataExtractionPattern(Pattern, String) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
- addMetadataExtractionPattern(String, String) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Adds a metadata extraction pattern that will extract the whole text
matched into the given field.
- addMetadataExtractionPattern(String, String, int) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Adds a metadata extraction pattern, which will extract the value from
the specified group index upon matching.
- addMetadataExtractionPatterns(RegexFieldExtractor...) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Adds a metadata extraction pattern that will extract matching field
names/values.
- addMetadataExtractionPatterns(Map<Pattern, String>) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
- addMetadataExtractionPatterns(RegexFieldExtractor...) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Adds a metadata extraction pattern that will extract matching field
names/values.
- addMetadataExtractionPatterns(Map<Pattern, String>) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
- addMetadataExtractionPatterns(RegexFieldExtractor...) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Adds a metadata extraction pattern that will extract matching field
names/values.
- addNestedResponse(ImporterResponse) - Method in class com.norconex.importer.response.ImporterResponse
-
- addPattern(String, String) - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Adds a pattern that will extract the whole text matched into
given field.
- addPattern(String, String, int) - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Adds a new pattern, which will extract the value from the specified
group index upon matching.
- addPattern(RegexFieldExtractor...) - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Adds one or more pattern that will extract matching field names/values.
- addReductions(String...) - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
-
- addRename(String, String, boolean) - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger
-
- addRename(String, String, boolean, boolean) - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger
-
- addReplacement(String, String, String) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
-
- addReplacement(String, String, String, boolean) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
-
- addReplacement(String, String, String, String) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
-
- addReplacement(String, String, String, String, boolean) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
-
- addReplacement(ReplaceTagger.Replacement) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
-
Adds a replacement.
- addReplacement(String, String) - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
-
- addRestriction(String, String, boolean) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Adds a restriction this handler should be restricted to.
- addRestriction(PropertyMatcher...) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Adds one or more restrictions this handler should be restricted to.
- addRestrictions(List<PropertyMatcher>) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Adds restrictions this handler should be restricted to.
- addSingleValueField(String, String) - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
-
- addSplit(String, String, boolean) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
-
- addSplit(String, String, String, boolean) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
-
- addStripEndpoints(String, String) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
- addTextEndpoints(String, String, String) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
-
Adds a new pair of end points to match.
- addTikaMetadataToImporterMetadata(Metadata, ImporterMetadata) - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
-
- API_GOOGLE - Static variable in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- API_LINGO24 - Static variable in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- API_MICROSOFT - Static variable in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- API_MOSES - Static variable in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- API_YANDEX - Static variable in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- APPLY_BOTH - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
- APPLY_FIELD - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
- APPLY_VALUE - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
- ARG_CHECKCFG - Static variable in class com.norconex.importer.ImporterLauncher
-
- ARG_VARIABLES - Static variable in class com.norconex.importer.ImporterLauncher
-
- DateFormatTagger - Class in com.norconex.importer.handler.tagger.impl
-
Formats a date from any given format to a format of choice, as per the
formatting options found on
SimpleDateFormat
with the exception
of the string "EPOCH" which represents the difference, measured in
milliseconds, between the date and midnight, January 1, 1970.
- DateFormatTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Constructor.
- DateMetadataFilter - Class in com.norconex.importer.handler.filter.impl
-
Accepts or rejects a document based on the date value(s) of a metadata
field, stored in a specified format.
- DateMetadataFilter() - Constructor for class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
- DateMetadataFilter(String) - Constructor for class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
- DateMetadataFilter(String, OnMatch) - Constructor for class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
- DateMetadataFilter.Condition - Class in com.norconex.importer.handler.filter.impl
-
- DateMetadataFilter.Operator - Enum in com.norconex.importer.handler.filter.impl
-
- DateMetadataFilter.TimeUnit - Enum in com.norconex.importer.handler.filter.impl
-
- DebugTagger - Class in com.norconex.importer.handler.tagger.impl
-
A utility tagger to help with troubleshooting of document importing.
- DebugTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.DebugTagger
-
- DEFAULT_ESCAPE_CHARACTER - Static variable in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
- DEFAULT_FIELD - Static variable in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
- DEFAULT_FIELD - Static variable in class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
- DEFAULT_HEADING_MAX_LENGTH - Static variable in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- DEFAULT_HEADING_MIN_LENGTH - Static variable in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- DEFAULT_MAX_FILE_CACHE_SIZE - Static variable in class com.norconex.importer.ImporterConfig
-
- DEFAULT_MAX_FILE_POOL_CACHE_SIZE - Static variable in class com.norconex.importer.ImporterConfig
-
- DEFAULT_MAX_READ_SIZE - Static variable in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- DEFAULT_MAX_SAMPLES - Static variable in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
-
- DEFAULT_ON_CONFLICT - Static variable in class com.norconex.importer.handler.tagger.impl.ConstantTagger
-
- DEFAULT_QUOTE_CHARACTER - Static variable in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
- DEFAULT_REFERENCE_PAGE_PREFIX - Static variable in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
-
- DEFAULT_SCRIPT_ENGINE - Static variable in class com.norconex.importer.handler.ScriptRunner
-
- DEFAULT_SEPARATOR_CHARACTER - Static variable in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
- DEFAULT_TARGET_CHARSET - Static variable in class com.norconex.importer.handler.tagger.impl.CharsetTagger
-
- DEFAULT_TARGET_CHARSET - Static variable in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
-
- DEFAULT_TEMP_DIR_PATH - Static variable in class com.norconex.importer.ImporterConfig
-
- DEFAULT_TITLE_MAX_LENGTH - Static variable in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- DEFAULT_TO_FIELD - Static variable in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- DeleteTagger - Class in com.norconex.importer.handler.tagger.impl
-
Delete the metadata fields provided.
- DeleteTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.DeleteTagger
-
- detect(File) - Method in class com.norconex.importer.doc.ContentTypeDetector
-
Detects the content type of the given file.
- detect(File, String) - Method in class com.norconex.importer.doc.ContentTypeDetector
-
Detects the content type of the given file.
- detect(InputStream) - Method in class com.norconex.importer.doc.ContentTypeDetector
-
Detects the content type from the given input stream.
- detect(InputStream, String) - Method in class com.norconex.importer.doc.ContentTypeDetector
-
Detects the content type from the given input stream.
- detectCharset(String) - Static method in class com.norconex.importer.util.CharsetUtil
-
Detects the character encoding of a string.
- detectCharset(String, String) - Static method in class com.norconex.importer.util.CharsetUtil
-
Detects the character encoding of a string.
- detectCharset(InputStream) - Static method in class com.norconex.importer.util.CharsetUtil
-
Detects the character encoding of an input stream.
- detectCharset(InputStream, String) - Static method in class com.norconex.importer.util.CharsetUtil
-
Detects the character encoding of an input stream.
- detectCharsetIfBlank(String, String, InputStream, ImporterMetadata, boolean) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Convenience method for handlers that need to detect an input encoding
if the explicitly provided encoding is blank.
- DOC_CONTENT_ENCODING - Static variable in class com.norconex.importer.doc.ImporterMetadata
-
- DOC_CONTENT_FAMILY - Static variable in class com.norconex.importer.doc.ImporterMetadata
-
- DOC_CONTENT_TYPE - Static variable in class com.norconex.importer.doc.ImporterMetadata
-
- DOC_EMBEDDED_PARENT_REFERENCE - Static variable in class com.norconex.importer.doc.ImporterMetadata
-
- DOC_EMBEDDED_PARENT_ROOT_REFERENCE - Static variable in class com.norconex.importer.doc.ImporterMetadata
-
- DOC_EMBEDDED_REFERENCE - Static variable in class com.norconex.importer.doc.ImporterMetadata
-
- DOC_EMBEDDED_TYPE - Static variable in class com.norconex.importer.doc.ImporterMetadata
-
- DOC_GENERATED_TITLE - Static variable in class com.norconex.importer.doc.ImporterMetadata
-
- DOC_IMPORTED_DATE - Static variable in class com.norconex.importer.doc.ImporterMetadata
-
- DOC_LANGUAGE - Static variable in class com.norconex.importer.doc.ImporterMetadata
-
- DOC_PDF_PAGE_NO - Static variable in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
-
- DOC_PDF_TOTAL_PAGES - Static variable in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
-
- DOC_REFERENCE - Static variable in class com.norconex.importer.doc.ImporterMetadata
-
- DOC_TRANSLATED_FROM - Static variable in class com.norconex.importer.doc.ImporterMetadata
-
- DocumentLengthTagger - Class in com.norconex.importer.handler.tagger.impl
-
Adds the document length (i.e., number of bytes) to
the specified field
.
- DocumentLengthTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
-
- DocumentParserException - Exception in com.norconex.importer.parser
-
Exception thrown upon encountering a non-recoverable issue parsing a
document.
- DocumentParserException() - Constructor for exception com.norconex.importer.parser.DocumentParserException
-
- DocumentParserException(String) - Constructor for exception com.norconex.importer.parser.DocumentParserException
-
- DocumentParserException(Throwable) - Constructor for exception com.norconex.importer.parser.DocumentParserException
-
- DocumentParserException(String, Throwable) - Constructor for exception com.norconex.importer.parser.DocumentParserException
-
- DOMContentFilter - Class in com.norconex.importer.handler.filter.impl
-
Uses a Document Object Model (DOM) representation of an HTML, XHTML, or
XML document content to perform filtering based on matching an
element/attribute or element/attribute value.
- DOMContentFilter() - Constructor for class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
- DOMContentFilter(String) - Constructor for class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
- DOMContentFilter(String, OnMatch) - Constructor for class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
- DOMContentFilter(String, OnMatch, boolean) - Constructor for class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
- domContentTypes() - Static method in class com.norconex.importer.handler.CommonRestrictions
-
Default content-types defining a DOM document.
- DOMExtractDetails() - Constructor for class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
- DOMExtractDetails(String, String, boolean) - Constructor for class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
- DOMExtractDetails(String, String, boolean, String) - Constructor for class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
- DOMSplitter - Class in com.norconex.importer.handler.splitter.impl
-
Splits HTML, XHTML, or XML document on a specific element.
- DOMSplitter() - Constructor for class com.norconex.importer.handler.splitter.impl.DOMSplitter
-
- DOMTagger - Class in com.norconex.importer.handler.tagger.impl
-
Extract the value of one or more elements or attributes into
a target field, from and HTML, XHTML, or XML document.
- DOMTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Constructor.
- DOMTagger.DOMExtractDetails - Class in com.norconex.importer.handler.tagger.impl
-
DOM Extraction Details
- DOMUtil - Class in com.norconex.importer.util
-
Utility methods related to JSoup/DOM manipulation.
- GenericDocumentParserFactory - Class in com.norconex.importer.parser
-
Generic document parser factory.
- GenericDocumentParserFactory() - Constructor for class com.norconex.importer.parser.GenericDocumentParserFactory
-
Creates a new document parser factory of the given format.
- getApi() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- getApiKey() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- getApplyTo(String) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
Gets what the case changing instructions apply to.
- getBegin() - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
-
- getCaseType(String) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
- getClientId() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- getClientSecret() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- getCommand() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets the command to execute.
- getCommand() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Gets the command to execute.
- getCommand() - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Gets the command to execute.
- getConditions() - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
-
- getConstants() - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
-
- getContent() - Method in class com.norconex.importer.doc.ImporterDocument
-
- getContentColumns() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
- getContentEncoding() - Method in class com.norconex.importer.doc.ImporterDocument
-
- getContentType() - Method in class com.norconex.importer.doc.ImporterDocument
-
- getContentTypes() - Method in class com.norconex.importer.parser.OCRConfig
-
Gets the regular expression matching content types to restrict OCR to.
- getDateString() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.Condition
-
- getDefaultValue() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
- getDescription() - Method in class com.norconex.importer.response.ImporterStatus
-
- getDetectHeadingMaxLength() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- getDetectHeadingMinLength() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- getDocument() - Method in class com.norconex.importer.response.ImporterResponse
-
- getDOMExtractDetailsList() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Gets a list of DOM extraction details.
- getElementValue(Element, String) - Static method in class com.norconex.importer.util.DOMUtil
-
Gets an element value based on JSoup DOM.
- getEmbeddedConfig() - Method in class com.norconex.importer.parser.ParseHints
-
- getEmbeddedDocuments() - Method in class com.norconex.importer.parser.impl.AbstractTikaParser.MergeEmbeddedParser
-
- getEmbeddedDocuments() - Method in interface com.norconex.importer.parser.impl.AbstractTikaParser.RecursiveParser
-
- getEmbeddedDocuments() - Method in class com.norconex.importer.parser.impl.AbstractTikaParser.SplitEmbbededParser
-
- getEmbeddedParentReference() - Method in class com.norconex.importer.doc.ImporterMetadata
-
- getEmbeddedParentRootReference() - Method in class com.norconex.importer.doc.ImporterMetadata
-
- getEmbeddedReference() - Method in class com.norconex.importer.doc.ImporterMetadata
-
- getEmbeddedType() - Method in class com.norconex.importer.doc.ImporterMetadata
-
- getEnd() - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
-
- getEngineName() - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
-
- getEngineName() - Method in class com.norconex.importer.handler.ScriptRunner
-
- getEngineName() - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
-
- getEngineName() - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
-
- getEnvironmentVariables() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets environment variables.
- getEnvironmentVariables() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Gets environment variables.
- getEnvironmentVariables() - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Gets environment variables.
- getEpochDate() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.Condition
-
- getEscapeCharacter() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Gets the escape character.
- getException() - Method in class com.norconex.importer.response.ImporterStatus
-
- getExtract() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Gets what should be extracted for the value.
- getExtract() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
- getFallbackLanguage() - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
-
- getFallbackMaxLength() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- getField() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
- getField() - Method in enum com.norconex.importer.handler.filter.impl.DateMetadataFilter.TimeUnit
-
- getField() - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
-
- getField() - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
- getField() - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
- getField() - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
-
- getField() - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
- getField() - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
- getFieldGroup() - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
- getFieldName() - Method in class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
-
- getFieldNames() - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
- getFields() - Method in class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
- getFields() - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
-
- getFields() - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
-
- getFieldsRegex() - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
-
- getFieldsRegex() - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
-
- getFieldsRegex() - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
-
- getFieldsToTranslate() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- getFile() - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
-
- getFormat() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
- getFormat() - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Gets optional source field holding the HTML content to apply DOM
extraction to.
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.Split
-
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
- getFromFields() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
- getFromFieldsRegex() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
- getFromFormat() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
- getFromFormats() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Gets the source date formats to match.
- getFromLocale() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Gets the locale used for parsing the source date.
- getFromSeparator() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
- getFromValue() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
- getHierarchyDetails() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger
-
- getIgnoredContentTypesRegex() - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
Gets the regular expression matching content types to ignore
(i.e.
- getImporterStatus() - Method in class com.norconex.importer.response.ImporterResponse
-
- getInput() - Method in class com.norconex.importer.handler.splitter.SplittableDocument
-
- getLanguage() - Method in class com.norconex.importer.doc.ImporterMetadata
-
- getLanguages() - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
-
- getLanguages() - Method in class com.norconex.importer.parser.OCRConfig
-
Gets languages to use by OCR.
- getLinesToSkip() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Gets how many lines to skip before starting to parse lines.
- getLocale() - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
Gets the locale used for formatting.
- getLogFields() - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
-
- getLogLevel() - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
-
- getMatchesDetails() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
- getMaxFileCacheSize() - Method in class com.norconex.importer.ImporterConfig
-
- getMaxFilePoolCacheSize() - Method in class com.norconex.importer.ImporterConfig
-
- getMaxLength() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
- getMaxReadSize() - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
-
Gets the maximum number of characters to read for filtering
at once.
- getMaxReadSize() - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
-
Gets the maximum number of characters to read from content for tagging
at once.
- getMaxReadSize() - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
-
Gets the maximum number of characters to read and transform
at once.
- getMaxSamples() - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
-
- getMerges() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger
-
- getMetadata() - Method in class com.norconex.importer.doc.ImporterDocument
-
- getMetadata() - Method in class com.norconex.importer.handler.splitter.SplittableDocument
-
- getMetadataExtractionPatterns() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets metadata extraction patterns.
- getMetadataExtractionPatterns() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Gets metadata extraction patterns.
- getMetadataExtractionPatterns() - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Gets metadata extraction patterns.
- getMetadataInputFormat() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets the format of the metadata input file sent to the external
application.
- getMetadataInputFormat() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Gets the format of the metadata input file sent to the external
application.
- getMetadataInputFormat() - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Gets the format of the metadata input file sent to the external
application.
- getMetadataOutputFormat() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets the format of the metadata output file from the external
application.
- getMetadataOutputFormat() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Gets the format of the metadata output file from the external
application.
- getMetadataOutputFormat() - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Gets the format of the metadata output file from the external
application.
- getNestedResponses() - Method in class com.norconex.importer.response.ImporterResponse
-
- getNoExtractContainerContentTypes() - Method in class com.norconex.importer.parser.EmbeddedConfig
-
- getNoExtractEmbeddedContentTypes() - Method in class com.norconex.importer.parser.EmbeddedConfig
-
- getNumber() - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Condition
-
- getOCRConfig() - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
- getOCRConfig() - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
-
- getOcrConfig() - Method in class com.norconex.importer.parser.ParseHints
-
- getOnConflict() - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
-
Gets the conflict resolution strategy.
- getOnMatch() - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
-
- getOnMatch() - Method in class com.norconex.importer.handler.filter.AbstractOnMatchFilter
-
- getOnMatch() - Method in interface com.norconex.importer.handler.filter.IOnMatchFilter
-
Gets the the on match action (exclude or include).
- getOperator(String) - Static method in enum com.norconex.importer.handler.filter.impl.DateMetadataFilter.Operator
-
- getOperator() - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Condition
-
- getOperator(String) - Static method in enum com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Operator
-
- getParentResponse() - Method in class com.norconex.importer.response.ImporterResponse
-
- getParseErrorsSaveDir() - Method in class com.norconex.importer.ImporterConfig
-
Gets the directory where file generating parsing errors will be saved.
- getParseHints() - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
Gets parse hints.
- getParser() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Gets the parser to use when creating the DOM-tree.
- getParser() - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
-
Gets the parser to use when creating the DOM-tree.
- getParser() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Gets the parser to use when creating the DOM-tree.
- getParser(String, ContentType) - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
Gets a parser based on content type, regardless of document reference
(ignoring it).
- getParser(String, ContentType) - Method in interface com.norconex.importer.parser.IDocumentParserFactory
-
Gets a document parser, optionally based on its reference or content
type.
- getParserFactory() - Method in class com.norconex.importer.ImporterConfig
-
- getPath() - Method in class com.norconex.importer.parser.OCRConfig
-
Gets the Tesseract OCR engine executable file path.
- getPatterns() - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Gets the patterns used to extract matching field names/values.
- getPostParseHandlers() - Method in class com.norconex.importer.ImporterConfig
-
- getPreParseHandlers() - Method in class com.norconex.importer.ImporterConfig
-
- getQuoteCharacter() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Get the value's surrounding quotes character.
- getReader() - Method in class com.norconex.importer.handler.splitter.SplittableDocument
-
- getReductions() - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
-
- getReference() - Method in class com.norconex.importer.doc.ImporterDocument
-
- getReference() - Method in class com.norconex.importer.doc.ImporterMetadata
-
- getReference() - Method in class com.norconex.importer.handler.splitter.SplittableDocument
-
- getReference() - Method in class com.norconex.importer.response.ImporterResponse
-
- getReferenceColumn() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
- getReferencePagePrefix() - Method in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
-
- getRegex() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
- getRegex() - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
- getRegex() - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
- getRegex() - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
- getRegex() - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
- getRejectionFilter() - Method in class com.norconex.importer.response.ImporterStatus
-
- getReplacements() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
-
- getReplacements() - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
-
- getResponseProcessors() - Method in class com.norconex.importer.ImporterConfig
-
- getRestrictions() - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Gets all restrictions
- getScript() - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
-
- getScript() - Method in class com.norconex.importer.handler.ScriptRunner
-
- getScript() - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
-
- getScript() - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
-
- getScriptPath() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- getSelector() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
- getSelector() - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
-
- getSelector() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
- getSeparator() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.Split
-
- getSeparatorCharacter() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Gets the value-separator character.
- getSingleValueFields() - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
-
- getSingleValueSeparator() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
- getSmtPath() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- getSourceCharset() - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
-
Gets the assumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Gets the assumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
-
Gets the assumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
-
Gets the assumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
-
- getSourceCharset() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Gets the assumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
-
Gets the assumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
-
- getSourceLanguage() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- getSourceLanguageField() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- getSplitContentTypes() - Method in class com.norconex.importer.parser.EmbeddedConfig
-
- getSplits() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
-
- getStatus() - Method in class com.norconex.importer.response.ImporterStatus
-
- getStreamFactory() - Method in class com.norconex.importer.Importer
-
- getStripAfterRegex() - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
- getStripBeforeRegex() - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
- getStripEndpoints() - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
- getSuffix() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
- getTargetCharset() - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
-
- getTargetCharset() - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
-
- getTargetLanguages() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- getTempDir() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets directory where to store temporary files used for transformation.
- getTempDir() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Gets directory where to store temporary files used for transformation.
- getTempDir() - Method in class com.norconex.importer.ImporterConfig
-
- getTempDir() - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Gets directory where to store temporary files used for transformation.
- getTimeUnit(String) - Static method in enum com.norconex.importer.handler.filter.impl.DateMetadataFilter.TimeUnit
-
- getTitleMaxLength() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.Split
-
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
- getToField() - Method in class com.norconex.importer.handler.transformer.impl.NoContentTransformer
-
- getToFormat() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
- getToLocale() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Gets the locale used for formatting the target date.
- getToSeparator() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
- getToValue() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
- getTruncateSamplesAt() - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
-
- getUserKey() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- getValue() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
- getValueGroup() - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
- IDocumentFilter - Interface in com.norconex.importer.handler.filter
-
Filters documents.
- IDocumentParser - Interface in com.norconex.importer.parser
-
Implementations are responsible for parsing a document to
extract its text and metadata, as well as any embedded documents
(when applicable).
- IDocumentParserFactory - Interface in com.norconex.importer.parser
-
Factory providing document parsers for documents.
- IDocumentSplitter - Interface in com.norconex.importer.handler.splitter
-
Responsible for splitting a single document into several ones.
- IDocumentTagger - Interface in com.norconex.importer.handler.tagger
-
Tags a document with extra metadata information, or manipulate existing
metadata information.
- IDocumentTransformer - Interface in com.norconex.importer.handler.transformer
-
Transformers allow to manipulate and modify a document metadata or content.
- IHintsAwareParser - Interface in com.norconex.importer.parser
-
Indicates that a parser can be initialized with generic parser configuration
settings and it will try to apply any such settings the best it can
when possible to do so.
- IImporterHandler - Interface in com.norconex.importer.handler
-
Identifies a class as being an import handler.
- IImporterResponseProcessor - Interface in com.norconex.importer.response
-
Processes an importer response to modify it or perform other actions
as required before it is returned.
- importDocument(InputStream, Properties, String) - Method in class com.norconex.importer.Importer
-
Imports a document according to the importer configuration.
- importDocument(File, Properties) - Method in class com.norconex.importer.Importer
-
Imports a document according to the importer configuration.
- importDocument(File, ContentType, String, Properties, String) - Method in class com.norconex.importer.Importer
-
Imports a document according to the importer configuration.
- importDocument(InputStream, ContentType, String, Properties, String) - Method in class com.norconex.importer.Importer
-
- Importer - Class in com.norconex.importer
-
Principal class responsible for importing documents.
- Importer() - Constructor for class com.norconex.importer.Importer
-
Creates a new importer with default configuration.
- Importer(ImporterConfig) - Constructor for class com.norconex.importer.Importer
-
Creates a new importer with the given configuration.
- ImporterConfig - Class in com.norconex.importer
-
Importer configuration.
- ImporterConfig() - Constructor for class com.norconex.importer.ImporterConfig
-
- ImporterConfigLoader - Class in com.norconex.importer
-
Importer configuration loader.
- ImporterDocument - Class in com.norconex.importer.doc
-
A document being imported.
- ImporterDocument(String, CachedInputStream) - Constructor for class com.norconex.importer.doc.ImporterDocument
-
- ImporterDocument(String, CachedInputStream, ImporterMetadata) - Constructor for class com.norconex.importer.doc.ImporterDocument
-
- ImporterException - Exception in com.norconex.importer
-
Exception thrown when an issue prevented the proper importation of a file.
- ImporterException() - Constructor for exception com.norconex.importer.ImporterException
-
- ImporterException(String) - Constructor for exception com.norconex.importer.ImporterException
-
- ImporterException(Throwable) - Constructor for exception com.norconex.importer.ImporterException
-
- ImporterException(String, Throwable) - Constructor for exception com.norconex.importer.ImporterException
-
- ImporterHandlerException - Exception in com.norconex.importer.handler
-
Exception thrown by several handler classes upon encountering
issues.
- ImporterHandlerException() - Constructor for exception com.norconex.importer.handler.ImporterHandlerException
-
- ImporterHandlerException(String) - Constructor for exception com.norconex.importer.handler.ImporterHandlerException
-
- ImporterHandlerException(Throwable) - Constructor for exception com.norconex.importer.handler.ImporterHandlerException
-
- ImporterHandlerException(String, Throwable) - Constructor for exception com.norconex.importer.handler.ImporterHandlerException
-
- ImporterLauncher - Class in com.norconex.importer
-
Command line launcher of the Importer application.
- ImporterMetadata - Class in com.norconex.importer.doc
-
Holds a document metadata with case-insensitive keys.
- ImporterMetadata() - Constructor for class com.norconex.importer.doc.ImporterMetadata
-
- ImporterMetadata(boolean) - Constructor for class com.norconex.importer.doc.ImporterMetadata
-
- ImporterMetadata(Map<String, List<String>>, boolean) - Constructor for class com.norconex.importer.doc.ImporterMetadata
-
- ImporterMetadata(Map<String, List<String>>) - Constructor for class com.norconex.importer.doc.ImporterMetadata
-
- ImporterResponse - Class in com.norconex.importer.response
-
- ImporterResponse(String, ImporterStatus) - Constructor for class com.norconex.importer.response.ImporterResponse
-
- ImporterResponse(ImporterDocument) - Constructor for class com.norconex.importer.response.ImporterResponse
-
- ImporterRuntimeException - Exception in com.norconex.importer
-
RuntimeException thrown when a an issue prevented the proper importation of a
file.
- ImporterRuntimeException() - Constructor for exception com.norconex.importer.ImporterRuntimeException
-
- ImporterRuntimeException(String) - Constructor for exception com.norconex.importer.ImporterRuntimeException
-
- ImporterRuntimeException(Throwable) - Constructor for exception com.norconex.importer.ImporterRuntimeException
-
- ImporterRuntimeException(String, Throwable) - Constructor for exception com.norconex.importer.ImporterRuntimeException
-
- ImporterStatus - Class in com.norconex.importer.response
-
- ImporterStatus() - Constructor for class com.norconex.importer.response.ImporterStatus
-
- ImporterStatus(ImporterStatus.Status, String) - Constructor for class com.norconex.importer.response.ImporterStatus
-
- ImporterStatus(ImporterException) - Constructor for class com.norconex.importer.response.ImporterStatus
-
- ImporterStatus(ImporterException, String) - Constructor for class com.norconex.importer.response.ImporterStatus
-
- ImporterStatus(IDocumentFilter) - Constructor for class com.norconex.importer.response.ImporterStatus
-
- ImporterStatus(IDocumentFilter, String) - Constructor for class com.norconex.importer.response.ImporterStatus
-
- ImporterStatus.Status - Enum in com.norconex.importer.response
-
- initDefaultParsers() - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
- initialize(ParseHints) - Method in interface com.norconex.importer.parser.IHintsAwareParser
-
Initialize this parser with the given parse hints.
- initialize(ParseHints) - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
-
- IOnMatchFilter - Interface in com.norconex.importer.handler.filter
-
Tells the collector that a filter is of "OnMatch" type.
- isAppendHash() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
- isApplicable(String, ImporterMetadata, boolean) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Class to invoke by subclasses to find out if this handler should be
rejected or not based on the metadata restriction provided.
- isCaseSensitive() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
- isCaseSensitive() - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
- isCaseSensitive() - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
- isCaseSensitive() - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
- isCaseSensitive() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Whether the matching should be case sensitive or not.
- isCaseSensitive() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Whether the replacement should be case sensitive or not.
- isCaseSensitive() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
-
- isCaseSensitive() - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
- isCaseSensitive() - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
-
- isCaseSensitive() - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
-
- isCaseSensitive() - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
- isCaseSensitive() - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
- isCaseSensitive() - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
- isCaseSensitive() - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
- isDeleteFromFields() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
- isDetectHeading() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- isDocumentMatched(String, InputStream, ImporterMetadata, boolean) - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
-
- isDocumentMatched(String, InputStream, ImporterMetadata, boolean) - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
-
- isDocumentMatched(String, InputStream, ImporterMetadata, boolean) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
- isDocumentMatched(String, InputStream, ImporterMetadata, boolean) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
- isDocumentMatched(String, InputStream, ImporterMetadata, boolean) - Method in class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
- isDocumentMatched(String, InputStream, ImporterMetadata, boolean) - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
-
- isDocumentMatched(String, InputStream, ImporterMetadata, boolean) - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
- isDocumentMatched(String, InputStream, ImporterMetadata, boolean) - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
- isEmpty() - Method in class com.norconex.importer.parser.EmbeddedConfig
-
- isEmpty() - Method in class com.norconex.importer.parser.OCRConfig
-
- isError() - Method in class com.norconex.importer.response.ImporterStatus
-
- isIgnoreContent() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- isIgnoreNonTranslatedFields() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- isInclusive() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
-
- isInclusive() - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
- isInclusive() - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
- isInclusive() - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
- isInputDisabled() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets whether to send the document content or not, regardless
whether ${INPUT} token is part of the command or not.
- isKeepBadDates() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
- isKeepEmptySegments() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
- isKeepProbabilities() - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
-
- isLogContent() - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
-
- isMatchBlanks() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
Gets whether lements with blank values should be considered a
match and have an empty string returned as opposed to nothing at all.
- isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
- isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
- isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
-
- isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
- isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
- isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
- isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
- isRegex() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
- isRegex() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
- isRegex() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
- isRegex() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.Split
-
- isRejected() - Method in class com.norconex.importer.response.ImporterStatus
-
- isReplaceAll() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
- isShortText() - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
-
- isSingleValue() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
- isSplitEmbedded() - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
- isSplitEmbedded() - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
-
- isStringContentMatching(String, StringBuilder, ImporterMetadata, boolean, int) - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
-
- isStringContentMatching(String, StringBuilder, ImporterMetadata, boolean, int) - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
- isStringContentMatching(String, StringBuilder, ImporterMetadata, boolean, int) - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
-
- isSuccess() - Method in class com.norconex.importer.response.ImporterResponse
-
- isSuccess() - Method in class com.norconex.importer.response.ImporterStatus
-
- isTextDocumentMatching(String, Reader, ImporterMetadata, boolean) - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
-
- isTextDocumentMatching(String, Reader, ImporterMetadata, boolean) - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
-
- isUseFirstRowAsFields() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Whether to use the first row as field names for values.
- isWholeMatch() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
- isWithHeaders() - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
-
- isWithOccurences() - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
-
- saveCharStreamFilterToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
-
Saves configuration settings specific to the implementing class.
- saveCharStreamFilterToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
-
- saveCharStreamTaggerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
-
Saves configuration settings specific to the implementing class.
- saveCharStreamTaggerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
-
- saveCharStreamTaggerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
-
- saveCharStreamTransformerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
-
Saves configuration settings specific to the implementing class.
- saveCharStreamTransformerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
-
- saveCharStreamTransformerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
-
- saveFilterToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
-
- saveFilterToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
-
- saveFilterToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
- saveFilterToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
- saveFilterToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
- saveFilterToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
-
- saveFilterToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
- saveFilterToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Saves configuration settings specific to the implementing class.
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.CopyTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
- saveHandlerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.transformer.impl.NoContentTransformer
-
- saveStringFilterToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
-
Saves configuration settings specific to the implementing class.
- saveStringFilterToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
- saveStringFilterToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
-
- saveStringTaggerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
-
Saves configuration settings specific to the implementing class.
- saveStringTaggerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
- saveStringTaggerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
-
- saveStringTaggerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
-
- saveStringTaggerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
-
- saveStringTaggerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
- saveStringTaggerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- saveStringTransformerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
-
Saves configuration settings specific to the implementing class.
- saveStringTransformerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
-
- saveStringTransformerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
-
- saveStringTransformerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
-
- saveStringTransformerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
- saveStringTransformerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
- saveStringTransformerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
- saveToXML(Writer) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
- saveToXML(XMLStreamWriter) - Method in class com.norconex.importer.handler.filter.AbstractOnMatchFilter
-
Convenience method for subclasses to save the "onMatch" attribute
to an XML file when XMLConfiguration
is used.
- saveToXML(Writer) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
- saveToXML(Writer) - Method in class com.norconex.importer.ImporterConfig
-
- saveToXML(Writer) - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
- saveToXML(Writer) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
- ScriptFilter - Class in com.norconex.importer.handler.filter.impl
-
Filter incoming documents using a scripting language.
- ScriptFilter() - Constructor for class com.norconex.importer.handler.filter.impl.ScriptFilter
-
- ScriptRunner<T> - Class in com.norconex.importer.handler
-
Runs scripts written in a programming language supported by the provided
script engine.
- ScriptRunner() - Constructor for class com.norconex.importer.handler.ScriptRunner
-
- ScriptRunner(String) - Constructor for class com.norconex.importer.handler.ScriptRunner
-
- ScriptTagger - Class in com.norconex.importer.handler.tagger.impl
-
Tag incoming documents using a scripting language.
- ScriptTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.ScriptTagger
-
- ScriptTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Transform incoming documents using a scripting language.
- ScriptTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.ScriptTransformer
-
- setApi(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- setApiKey(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- setAppendHash(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
- setBegin(long) - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
-
Sets the beginning index (inclusive).
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Sets whether to do a case sensitive match or not.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Sets whether to do a case sensitive replacement or not.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
-
Sets whether to ignore case when matching start and end text.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
-
Sets whether to ignore case when matching characters or string
to reduce.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
-
Sets whether to ignore case when matching characters or string
to reduce.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
Sets whether to ignore case when matching text.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
Sets whether to ignore case when matching text.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
Sets whether to ignore case when matching start and end text.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
- setClientId(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- setClientSecret(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- setCommand(String) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets the command to execute.
- setCommand(String) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Sets the command to execute.
- setCommand(String) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Sets the command to execute.
- setConditions(NumericMetadataFilter.Condition...) - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
-
- setContent(CachedInputStream) - Method in class com.norconex.importer.doc.ImporterDocument
-
- setContentColumns(String...) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
- setContentEncoding(String) - Method in class com.norconex.importer.doc.ImporterDocument
-
- setContentType(ContentType) - Method in class com.norconex.importer.doc.ImporterDocument
-
- setContentTypes(String) - Method in class com.norconex.importer.parser.OCRConfig
-
Sets the regular expression matching content types to restrict OCR to.
- setDefaultValue(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
- setDeleteFromFields(boolean) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
- setDetectHeading(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- setDetectHeadingMaxLength(int) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- setDetectHeadingMinLength(int) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- setEmbeddedParentReference(String) - Method in class com.norconex.importer.doc.ImporterMetadata
-
- setEmbeddedParentRootReference(String) - Method in class com.norconex.importer.doc.ImporterMetadata
-
- setEmbeddedReference(String) - Method in class com.norconex.importer.doc.ImporterMetadata
-
- setEmbeddedType(String) - Method in class com.norconex.importer.doc.ImporterMetadata
-
- setEnd(long) - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
-
Sets the end index (exclusive).
- setEngineName(String) - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
-
- setEngineName(String) - Method in class com.norconex.importer.handler.ScriptRunner
-
- setEngineName(String) - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
-
- setEngineName(String) - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
-
- setEnvironmentVariables(Map<String, String>) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets the environment variables.
- setEnvironmentVariables(Map<String, String>) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Sets the environment variables.
- setEnvironmentVariables(Map<String, String>) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Sets the environment variables.
- setEscapeCharacter(char) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Sets the escape character.
- setExtract(String) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Sets what should be extracted for the value.
- setExtract(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
- setFallbackLanguage(String) - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
-
Sets the fallback language when none are detected.
- setFallbackMaxLength(int) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- setField(String) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
- setField(String) - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
-
- setField(String) - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
- setField(String) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
- setField(String) - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
-
- setField(String) - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
- setField(String) - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
- setFieldGroup(int) - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
- setFieldName(String) - Method in class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
-
- setFields(String...) - Method in class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
- setFieldsRegex(String) - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
-
- setFieldsRegex(String) - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
-
- setFieldsRegex(String) - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
-
- setFieldsToTranslate(String...) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- setFile(File) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
-
- setFormat(String) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
- setFormat(String) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
- setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Sets the field with the value we want to perform matches on.
- setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
- setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Sets optional source field holding the HTML content to apply DOM
extraction to.
- setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
- setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Sets the field with the value to replace.
- setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
- setFromFields(String...) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
- setFromFieldsRegex(String) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
- setFromFormat(String) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
- setFromFormats(String...) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Sets the source date formats to match.
- setFromLocale(Locale) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Sets the locale used for parsing the source date.
- setFromSeparator(String) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
- setFromValue(String) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Sets the value to replace.
- setIgnoreContent(boolean) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- setIgnoredContentTypesRegex(String) - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
sets the regular expression matching content types to ignore
(i.e.
- setIgnoreNonTranslatedFields(boolean) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- setImporterStatus(ImporterStatus) - Method in class com.norconex.importer.response.ImporterResponse
-
- setInclusive(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
-
Sets whether start and end text pairs should be kept or
not.
- setInclusive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
Sets whether the match itself should be stripped or not.
- setInclusive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
Sets whether the match itself should be stripped or not.
- setInclusive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
Sets whether start and end text pairs should themselves be stripped or
not.
- setInputDisabled(boolean) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets whether to send the document content or not, regardless
whether ${INPUT} token is part of the command or not.
- setKeepBadDates(boolean) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
- setKeepEmptySegments(boolean) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
- setKeepProbabilities(boolean) - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
-
Sets whether to keep the match probabilities for each languages
detected.
- setLanguage(String) - Method in class com.norconex.importer.doc.ImporterMetadata
-
- setLanguages(String...) - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
-
Sets the language candidates for the language detection.
- setLanguages(String) - Method in class com.norconex.importer.parser.OCRConfig
-
Sets languages to use by OCR.
- setLinesToSkip(int) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Sets how many lines to skip before starting to parse lines.
- setLocale(Locale) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
Sets the locale used for formatting.
- setLogContent(boolean) - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
-
- setLogFields(String...) - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
-
- setLogLevel(String) - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
-
- setMatchBlanks(boolean) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
Sets whether elements with blank values should be considered a
match and have an empty string returned as opposed to nothing at all.
- setMaxFileCacheSize(int) - Method in class com.norconex.importer.ImporterConfig
-
- setMaxFilePoolCacheSize(int) - Method in class com.norconex.importer.ImporterConfig
-
- setMaxLength(int) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
- setMaxReadSize(int) - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
-
Sets the maximum number of characters to read for filtering
at once.
- setMaxReadSize(int) - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
-
Sets the maximum number of characters to read from content for tagging
at once.
- setMaxReadSize(int) - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
-
Sets the maximum number of characters to read and transform
at once.
- setMaxSamples(int) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
-
- setMetadataExtractionPatterns(RegexFieldExtractor...) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets metadata extraction patterns.
- setMetadataExtractionPatterns(Map<Pattern, String>) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
- setMetadataExtractionPatterns(RegexFieldExtractor...) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Sets metadata extraction patterns.
- setMetadataExtractionPatterns(Map<Pattern, String>) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
- setMetadataExtractionPatterns(RegexFieldExtractor...) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Sets metadata extraction patterns.
- setMetadataInputFormat(String) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets the format of the metadata input file sent to the external
application.
- setMetadataInputFormat(String) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Sets the format of the metadata input file sent to the external
application.
- setMetadataInputFormat(String) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Sets the format of the metadata input file sent to the external
application.
- setMetadataOutputFormat(String) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets the format of the metadata output file from the external
application.
- setMetadataOutputFormat(String) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Sets the format of the metadata output file from the external
application.
- setMetadataOutputFormat(String) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Sets the format of the metadata output file from the external
application.
- setNoExtractContainerContentTypes(String) - Method in class com.norconex.importer.parser.EmbeddedConfig
-
- setNoExtractEmbeddedContentTypes(String) - Method in class com.norconex.importer.parser.EmbeddedConfig
-
- setOCRConfig(OCRConfig) - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
- setOCRConfig(OCRConfig) - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
-
- setOnConflict(ConstantTagger.OnConflict) - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
-
Sets the conflict resolution strategy.
- setOnMatch(OnMatch) - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
-
- setOnMatch(OnMatch) - Method in class com.norconex.importer.handler.filter.AbstractOnMatchFilter
-
- setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
- setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
- setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
-
- setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
- setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
- setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
- setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
- setParseErrorsSaveDir(File) - Method in class com.norconex.importer.ImporterConfig
-
Sets the directory where file generating parsing errors will be saved.
- setParser(String) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Sets the parser to use when creating the DOM-tree.
- setParser(String) - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
-
Sets the parser to use when creating the DOM-tree.
- setParser(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Sets the parser to use when creating the DOM-tree.
- setParserFactory(IDocumentParserFactory) - Method in class com.norconex.importer.ImporterConfig
-
- setPath(String) - Method in class com.norconex.importer.parser.OCRConfig
-
Sets the Tesseract OCR engine executable file path.
- setPattern(RegexFieldExtractor...) - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Sets one or more patterns that will extract matching field names/values.
- setPostParseHandlers(IImporterHandler...) - Method in class com.norconex.importer.ImporterConfig
-
- setPreParseHandlers(IImporterHandler...) - Method in class com.norconex.importer.ImporterConfig
-
- setQuoteCharacter(char) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Sets the value's surrounding quotes character.
- setReductions(String...) - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
-
- setReference(String) - Method in class com.norconex.importer.doc.ImporterDocument
-
- setReference(String) - Method in class com.norconex.importer.doc.ImporterMetadata
-
- setReferenceColumn(String) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
- setReferencePagePrefix(String) - Method in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
-
- setRegex(String) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
- setRegex(String) - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
- setRegex(String) - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
- setRegex(String) - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
- setRegex(boolean) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Sets whether the value
to match is a regular expression.
- setRegex(boolean) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
- setRegex(boolean) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Sets whether the fromValue
is a regular expression.
- setRegex(String) - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
- setReplaceAll(boolean) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Sets whether to replace all occurrences of the "from" value
with the "to" value.
- setResponseProcessors(IImporterResponseProcessor...) - Method in class com.norconex.importer.ImporterConfig
-
- setScript(String) - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
-
- setScript(String) - Method in class com.norconex.importer.handler.ScriptRunner
-
- setScript(String) - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
-
- setScript(String) - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
-
- setScriptPath(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- setSelector(String) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
- setSelector(String) - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
-
- setSelector(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
- setSeparatorCharacter(char) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Sets the value-separator character.
- setShortText(boolean) - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
-
- setSingleValue(boolean) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
- setSingleValueSeparator(String) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
- setSmtPath(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- setSourceCharset(String) - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
-
Sets the assumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Sets the assumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
-
Sets the assumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
-
Sets the assumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
-
- setSourceCharset(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Sets the assumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
-
Sets the assumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
-
- setSourceLanguage(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- setSourceLanguageField(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- setSplitContentTypes(String) - Method in class com.norconex.importer.parser.EmbeddedConfig
-
- setSplitEmbedded(boolean) - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
- setSplitEmbedded(boolean) - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
-
- setStripAfterRegex(String) - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
- setStripBeforeRegex(String) - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
- setSuffix(String) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
- setTargetCharset(String) - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
-
- setTargetCharset(String) - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
-
- setTargetLanguages(String...) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- setTempDir(File) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets directory where to store temporary files used for transformation.
- setTempDir(File) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Sets directory where to store temporary files used for transformation.
- setTempDir(File) - Method in class com.norconex.importer.ImporterConfig
-
- setTempDir(File) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Sets directory where to store temporary files used for transformation.
- setTitleMaxLength(int) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Sets the field to store the match count.
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Sets the field to store the replaced value.
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
- setToField(String) - Method in class com.norconex.importer.handler.transformer.impl.NoContentTransformer
-
- setToFormat(String) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
- setToLocale(Locale) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Sets the locale used for formatting the source date.
- setToSeparator(String) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
- setToValue(String) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Sets the replacement value.
- setTruncateSamplesAt(int) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
-
- setUseFirstRowAsFields(boolean) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Sets whether to use the first row as field names for values.
- setUserKey(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- setValue(String) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Sets the text or regular expression to match
- setValueGroup(int) - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
- setWholeMatch(boolean) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Sets whether the specified "from" value should match the entire
field value or not (default is false
).
- setWithHeaders(boolean) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
-
- setWithOccurences(boolean) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
-
- Split(String, String, String, boolean) - Constructor for class com.norconex.importer.handler.tagger.impl.SplitTagger.Split
-
- splitApplicableDocument(SplittableDocument, OutputStream, CachedStreamFactory, boolean) - Method in class com.norconex.importer.handler.splitter.AbstractDocumentSplitter
-
- splitApplicableDocument(SplittableDocument, OutputStream, CachedStreamFactory, boolean) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
- splitApplicableDocument(SplittableDocument, OutputStream, CachedStreamFactory, boolean) - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
-
- splitApplicableDocument(SplittableDocument, OutputStream, CachedStreamFactory, boolean) - Method in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
-
- splitApplicableDocument(SplittableDocument, OutputStream, CachedStreamFactory, boolean) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
- splitDocument(SplittableDocument, OutputStream, CachedStreamFactory, boolean) - Method in class com.norconex.importer.handler.splitter.AbstractDocumentSplitter
-
- splitDocument(SplittableDocument, OutputStream, CachedStreamFactory, boolean) - Method in interface com.norconex.importer.handler.splitter.IDocumentSplitter
-
- SplitEmbbededParser(String, Parser, ImporterMetadata, CachedStreamFactory) - Constructor for class com.norconex.importer.parser.impl.AbstractTikaParser.SplitEmbbededParser
-
- SplittableDocument - Class in com.norconex.importer.handler.splitter
-
- SplittableDocument(String, InputStream, ImporterMetadata) - Constructor for class com.norconex.importer.handler.splitter.SplittableDocument
-
- SplitTagger - Class in com.norconex.importer.handler.tagger.impl
-
Splits an existing metadata value into multiple values based on a given
value separator.
- SplitTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.SplitTagger
-
- SplitTagger.Split - Class in com.norconex.importer.handler.tagger.impl
-
- StripAfterTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Strips any content found after first match found for given pattern.
- StripAfterTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
- StripBeforeTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Strips any content found before first match found for given pattern.
- StripBeforeTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
- StripBetweenTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Strips any content found between a matching start and end strings.
- StripBetweenTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
- SubstringTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Keep a substring of the content matching a begin and end character
indexes.
- SubstringTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.SubstringTransformer
-