A B C D E F G H I J K L M N O P R S T U V X Y
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- AbstractCharStreamCondition - Class in com.norconex.importer.handler.condition
-
Base class for conditions dealing with the document content as text.
- AbstractCharStreamCondition() - Constructor for class com.norconex.importer.handler.condition.AbstractCharStreamCondition
- AbstractCharStreamFilter - Class in com.norconex.importer.handler.filter
-
Base class for filters dealing with the body of text documents only.
- AbstractCharStreamFilter() - Constructor for class com.norconex.importer.handler.filter.AbstractCharStreamFilter
- AbstractCharStreamTagger - Class in com.norconex.importer.handler.tagger
-
Base class for taggers dealing with the body of text documents only.
- AbstractCharStreamTagger() - Constructor for class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
- AbstractCharStreamTransformer - Class in com.norconex.importer.handler.transformer
-
Base class for transformers dealing with text documents only.
- AbstractCharStreamTransformer() - Constructor for class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
- AbstractDocumentFilter - Class in com.norconex.importer.handler.filter
-
Base class for document filters.
- AbstractDocumentFilter() - Constructor for class com.norconex.importer.handler.filter.AbstractDocumentFilter
- AbstractDocumentSplitter - Class in com.norconex.importer.handler.splitter
-
Base class for splitters.
- AbstractDocumentSplitter() - Constructor for class com.norconex.importer.handler.splitter.AbstractDocumentSplitter
- AbstractDocumentTagger - Class in com.norconex.importer.handler.tagger
-
Base class for taggers.
- AbstractDocumentTagger() - Constructor for class com.norconex.importer.handler.tagger.AbstractDocumentTagger
- AbstractDocumentTransformer - Class in com.norconex.importer.handler.transformer
-
Base class for transformers.
- AbstractDocumentTransformer() - Constructor for class com.norconex.importer.handler.transformer.AbstractDocumentTransformer
- AbstractImporterHandler - Class in com.norconex.importer.handler
-
Base class for handlers applying only to certain type of documents by providing a way to restrict applicable documents based on a metadata field value, where the value matches a regular expression.
- AbstractImporterHandler() - Constructor for class com.norconex.importer.handler.AbstractImporterHandler
- AbstractOnMatchFilter - Class in com.norconex.importer.handler.filter
-
Deprecated.Since 3.0.0, use composition with OnMatch instead
- AbstractOnMatchFilter() - Constructor for class com.norconex.importer.handler.filter.AbstractOnMatchFilter
-
Deprecated.
- AbstractStringCondition - Class in com.norconex.importer.handler.condition
-
Base class to facilitate creating conditions based on text content, loading text into
StringBuilder
for memory processing. - AbstractStringCondition() - Constructor for class com.norconex.importer.handler.condition.AbstractStringCondition
- AbstractStringFilter - Class in com.norconex.importer.handler.filter
-
Base class to facilitate creating filters based on text content, loading text into
StringBuilder
for memory processing. - AbstractStringFilter() - Constructor for class com.norconex.importer.handler.filter.AbstractStringFilter
- AbstractStringTagger - Class in com.norconex.importer.handler.tagger
-
Base class to facilitate creating taggers based on text content, loading text into
StringBuilder
for memory processing. - AbstractStringTagger() - Constructor for class com.norconex.importer.handler.tagger.AbstractStringTagger
- AbstractStringTransformer - Class in com.norconex.importer.handler.transformer
-
Base class to facilitate creating transformers on text content, loading text into a
StringBuilder
for memory processing. - AbstractStringTransformer() - Constructor for class com.norconex.importer.handler.transformer.AbstractStringTransformer
- AbstractTikaParser - Class in com.norconex.importer.parser.impl
-
Base class wrapping Apache Tika parser for use by the importer.
- AbstractTikaParser(Parser) - Constructor for class com.norconex.importer.parser.impl.AbstractTikaParser
-
Creates a new Tika-based parser.
- AbstractTikaParser.MergeEmbeddedParser - Class in com.norconex.importer.parser.impl
- AbstractTikaParser.RecursiveParser - Interface in com.norconex.importer.parser.impl
- AbstractTikaParser.SplitEmbbededParser - Class in com.norconex.importer.parser.impl
- accept(HandlerContext) - Method in class com.norconex.importer.handler.HandlerConsumer
- acceptDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
- acceptDocument(HandlerDoc, InputStream, ParseState) - Method in interface com.norconex.importer.handler.filter.IDocumentFilter
-
Whether to accepts a document.
- acceptDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.filter.impl.RejectFilter
- ADD - com.norconex.importer.handler.tagger.impl.ConstantTagger.OnConflict
-
Deprecated.
- addCondition(DateMetadataFilter.Condition) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- addCondition(DateMetadataFilter.Operator, ZonedDateTime) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- addCondition(DateMetadataFilter.Operator, Date) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
Deprecated.
- addCondition(DateMetadataFilter.Operator, Supplier<ZonedDateTime>) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- addCondition(NumericMetadataFilter.Operator, double) - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
- addConditions(List<DateMetadataFilter.Condition>) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
Adds a list of conditions, appending them to the list of already defined conditions in this filter (if any).
- addConstant(String, String) - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
- addCopyDetails(TextMatcher, String, PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.CopyTagger
-
Adds copy instructions.
- addCopyDetails(String, String) - Method in class com.norconex.importer.handler.tagger.impl.CopyTagger
-
Adds copy instructions, adding to any existing values on the target field.
- addCopyDetails(String, String, boolean) - Method in class com.norconex.importer.handler.tagger.impl.CopyTagger
-
Deprecated.Since 3.0.0, use
CopyTagger.addCopyDetails(TextMatcher, String, PropertySetter)
. - addDOMExtractDetails(DOMTagger.DOMExtractDetails) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Adds DOM extraction details.
- addDOMExtractDetails(DOMPreserveTransformer.DOMExtractDetails) - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
-
Adds DOM extraction details.
- addEmbeddedParentReference(String) - Method in class com.norconex.importer.doc.DocInfo
- addEnvironmentVariable(String, String) - Method in class com.norconex.importer.handler.ExternalHandler
-
Adds an environment variables to the list of previously assigned variables (if any).
- addEnvironmentVariable(String, String) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Adds an environment variables to the list of previously assigned variables (if any).
- addEnvironmentVariable(String, String) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Adds an environment variables to the list of previously assigned variables (if any).
- addEnvironmentVariable(String, String) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Adds an environment variables to the list of previously assigned variables (if any).
- addEnvironmentVariables(Map<String, String>) - Method in class com.norconex.importer.handler.ExternalHandler
-
Adds the environment variables, keeping environment variables previously assigned.
- addEnvironmentVariables(Map<String, String>) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Adds the environment variables, keeping environment variables previously assigned.
- addEnvironmentVariables(Map<String, String>) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Adds the environment variables, keeping environment variables previously assigned.
- addEnvironmentVariables(Map<String, String>) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Adds the environment variables, keeping environment variables previously assigned.
- addField(String) - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
-
Deprecated.Since 3.0.0, use
DeleteTagger.setFieldMatcher(TextMatcher)
- addField(String) - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
-
Deprecated.Since 3.0.0, use
KeepOnlyTagger.setFieldMatcher(TextMatcher)
- addFieldCase(String, String) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
Deprecated.
- addFieldCase(String, String, String) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
Deprecated.
- addHierarcyDetails(HierarchyTagger.HierarchyDetails) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger
-
Adds hierarchy instructions.
- addHierarcyDetails(String, String, String, String, boolean) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger
-
Deprecated.Since 2.10.0, use
HierarchyTagger.addHierarcyDetails(HierarchyDetails)
instead. - addMatchDetails(CountMatchesTagger.MatchDetails) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
Deprecated.
- addMerge(MergeTagger.Merge) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger
- addMetadataExtractionPattern(String, String) - Method in class com.norconex.importer.handler.ExternalHandler
-
Adds a metadata extraction pattern that will extract the whole text matched into the given field.
- addMetadataExtractionPattern(String, String) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Adds a metadata extraction pattern that will extract the whole text matched into the given field.
- addMetadataExtractionPattern(String, String) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Adds a metadata extraction pattern that will extract the whole text matched into the given field.
- addMetadataExtractionPattern(String, String) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Adds a metadata extraction pattern that will extract the whole text matched into the given field.
- addMetadataExtractionPattern(String, String, int) - Method in class com.norconex.importer.handler.ExternalHandler
-
Adds a metadata extraction pattern, which will extract the value from the specified group index upon matching.
- addMetadataExtractionPattern(String, String, int) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Adds a metadata extraction pattern, which will extract the value from the specified group index upon matching.
- addMetadataExtractionPattern(String, String, int) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Adds a metadata extraction pattern, which will extract the value from the specified group index upon matching.
- addMetadataExtractionPattern(String, String, int) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Adds a metadata extraction pattern, which will extract the value from the specified group index upon matching.
- addMetadataExtractionPatterns(RegexFieldValueExtractor...) - Method in class com.norconex.importer.handler.ExternalHandler
-
Adds a metadata extraction pattern that will extract matching field names/values.
- addMetadataExtractionPatterns(RegexFieldValueExtractor...) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Adds a metadata extraction pattern that will extract matching field names/values.
- addMetadataExtractionPatterns(RegexFieldValueExtractor...) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Adds a metadata extraction pattern that will extract matching field names/values.
- addMetadataExtractionPatterns(RegexFieldValueExtractor...) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Adds a metadata extraction pattern that will extract matching field names/values.
- addNestedResponse(ImporterResponse) - Method in class com.norconex.importer.response.ImporterResponse
- addPattern(RegexFieldValueExtractor...) - Method in class com.norconex.importer.handler.tagger.impl.RegexTagger
-
Adds one or more pattern that will extract matching field names/values.
- addPattern(RegexFieldValueExtractor...) - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Deprecated.Adds one or more pattern that will extract matching field names/values.
- addPattern(String, String) - Method in class com.norconex.importer.handler.tagger.impl.RegexTagger
-
Adds a pattern that will extract the whole text matched into given field.
- addPattern(String, String) - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Deprecated.Adds a pattern that will extract the whole text matched into given field.
- addPattern(String, String, int) - Method in class com.norconex.importer.handler.tagger.impl.RegexTagger
-
Adds a new pattern, which will extract the value from the specified group index upon matching.
- addPattern(String, String, int) - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Deprecated.Adds a new pattern, which will extract the value from the specified group index upon matching.
- addReductions(String...) - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
- addRename(TextMatcher, String, PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger
- addRename(String, String, boolean) - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger
-
Deprecated.
- addRename(String, String, boolean, boolean) - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger
-
Deprecated.
- addReplacement(ReplaceTagger.Replacement) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
-
Adds a replacement.
- addReplacement(ReplaceTransformer.Replacement) - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
- addRestriction(PropertyMatcher...) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Adds one or more restrictions this handler should be restricted to.
- addRestriction(String, String, boolean) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Deprecated.Since 3.0.0, use
AbstractImporterHandler.addRestriction(PropertyMatcher...)
. - addRestrictions(List<PropertyMatcher>) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Adds restrictions this handler should be restricted to.
- addSelector(String) - Method in class com.norconex.importer.handler.transformer.impl.DOMDeleteTransformer
- addSingleValueField(String, String) - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
-
Deprecated.Since 3.0.0, use
ForceSingleValueTagger.setFieldMatcher(TextMatcher)
andForceSingleValueTagger.setAction(String)
. - addSplit(String, String, boolean) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
-
Deprecated.
- addSplit(String, String, String, boolean) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
-
Deprecated.
- addSplitDetails(SplitTagger.SplitDetails) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
- addStripBetweenDetails(StripBetweenTransformer.StripBetweenDetails) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
Adds strip between instructions.
- addStripEndpoints(String, String) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
Deprecated.Since 3.0.0, use
StripBetweenTransformer.addStripBetweenDetails(StripBetweenDetails)
- addTextBetweenDetails(TextBetweenTagger.TextBetweenDetails) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
-
Adds text between instructions.
- addTextEndpoints(String, String, String) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
-
Deprecated.Since 3.0.0, use
TextBetweenTagger.addTextBetweenDetails(TextBetweenDetails)
- addTikaMetadataToImporterMetadata(Metadata, Properties) - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
- analyze(Reader, Properties, String) - Method in class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
- API_GOOGLE - Static variable in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- API_LINGO24 - Static variable in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- API_MICROSOFT - Static variable in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- API_MOSES - Static variable in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- API_YANDEX - Static variable in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- APPLY_BOTH - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- APPLY_FIELD - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- APPLY_VALUE - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- ARG_CHECKCFG - Static variable in class com.norconex.importer.ImporterLauncher
- ARG_IGNOREERRORS - Static variable in class com.norconex.importer.ImporterLauncher
- ARG_VARIABLES - Static variable in class com.norconex.importer.ImporterLauncher
- atLeastOneIncludeMatch() - Method in class com.norconex.importer.handler.HandlerContext.IncludeMatchResolver
B
- BlankCondition - Class in com.norconex.importer.handler.condition.impl
-
A condition based on whether the document content (default) or any of the specified metadata fields are blank or inexistent.
- BlankCondition() - Constructor for class com.norconex.importer.handler.condition.impl.BlankCondition
- BufferUtil - Class in com.norconex.importer.util
-
Buffer related utility methods.
- build() - Method in class com.norconex.importer.ImporterEvent.Builder
- Builder(String, Doc) - Constructor for class com.norconex.importer.ImporterEvent.Builder
C
- CASE_LOWER - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- CASE_SENTENCES - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- CASE_SENTENCES_FULLY - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- CASE_STRING - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- CASE_STRING_FULLY - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- CASE_SWAP - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- CASE_UPPER - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- CASE_WORDS - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- CASE_WORDS_FULLY - Static variable in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- CharacterCaseTagger - Class in com.norconex.importer.handler.tagger.impl
-
Changes the character case of matching fields and values according to one of the following methods:
- CharacterCaseTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- CharsetTagger - Class in com.norconex.importer.handler.tagger.impl
-
Converts one or more field values (if needed) from a source character encoding (charset) to a target one.
- CharsetTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.CharsetTagger
- CharsetTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Transforms a document content (if needed) from a source character encoding (charset) to a target one.
- CharsetTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.CharsetTransformer
- CharsetUtil - Class in com.norconex.importer.util
-
Character set utility methods.
- clearRestrictions() - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Clears all restrictions.
- com.norconex.importer - package com.norconex.importer
- com.norconex.importer.doc - package com.norconex.importer.doc
- com.norconex.importer.handler - package com.norconex.importer.handler
- com.norconex.importer.handler.condition - package com.norconex.importer.handler.condition
- com.norconex.importer.handler.condition.impl - package com.norconex.importer.handler.condition.impl
- com.norconex.importer.handler.filter - package com.norconex.importer.handler.filter
- com.norconex.importer.handler.filter.impl - package com.norconex.importer.handler.filter.impl
- com.norconex.importer.handler.splitter - package com.norconex.importer.handler.splitter
- com.norconex.importer.handler.splitter.impl - package com.norconex.importer.handler.splitter.impl
- com.norconex.importer.handler.tagger - package com.norconex.importer.handler.tagger
- com.norconex.importer.handler.tagger.impl - package com.norconex.importer.handler.tagger.impl
- com.norconex.importer.handler.transformer - package com.norconex.importer.handler.transformer
- com.norconex.importer.handler.transformer.impl - package com.norconex.importer.handler.transformer.impl
- com.norconex.importer.parser - package com.norconex.importer.parser
- com.norconex.importer.parser.impl - package com.norconex.importer.parser.impl
- com.norconex.importer.parser.impl.xfdl - package com.norconex.importer.parser.impl.xfdl
- com.norconex.importer.response - package com.norconex.importer.response
- com.norconex.importer.util - package com.norconex.importer.util
- com.norconex.importer.util.regex - package com.norconex.importer.util.regex
- CommonMatchers - Class in com.norconex.importer.handler
-
Commonly used
TextMatcher
instances. - CommonRestrictions - Class in com.norconex.importer.handler
-
Commonly encountered restrictions that can be applied to
Properties
instances. - compileDotAll(String) - Static method in class com.norconex.importer.util.regex.RegexUtil
-
Deprecated.Compiles a case insensitive "dotall" pattern (dots match all, including new lines).
- compileDotAll(String, boolean) - Static method in class com.norconex.importer.util.regex.RegexUtil
-
Deprecated.Compiles a case insensitive "dotall" pattern (dots match all, including new lines).
- Condition(DateMetadataFilter.Operator, Supplier<ZonedDateTime>) - Constructor for class com.norconex.importer.handler.filter.impl.DateMetadataFilter.Condition
- Condition(NumericMetadataFilter.Operator, double) - Constructor for class com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Condition
- ConstantTagger - Class in com.norconex.importer.handler.tagger.impl
-
Define and add constant values to documents.
- ConstantTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.ConstantTagger
- ConstantTagger.OnConflict - Enum in com.norconex.importer.handler.tagger.impl
-
Deprecated.
- CONTENT_ENCODING - Static variable in class com.norconex.importer.doc.DocMetadata
-
Document character encoding.
- CONTENT_FAMILY - Static variable in class com.norconex.importer.doc.DocMetadata
-
Document content family (general categorization of content types).
- CONTENT_TYPE - Static variable in class com.norconex.importer.doc.DocMetadata
-
Document content type (also called "media type", or "mime type").
- ContentTypeDetector - Class in com.norconex.importer.doc
-
Master class to detect all content types.
- convertCharset(InputStream, String, OutputStream, String) - Static method in class com.norconex.importer.util.CharsetUtil
-
Converts the character encoding of the supplied input.
- convertCharset(String, String, String) - Static method in class com.norconex.importer.util.CharsetUtil
-
Converts the character encoding of the supplied input value.
- copyFrom(DocInfo) - Method in class com.norconex.importer.doc.DocInfo
- CopyTagger - Class in com.norconex.importer.handler.tagger.impl
-
Copies metadata fields.
- CopyTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.CopyTagger
- copyTo(DocInfo) - Method in class com.norconex.importer.doc.DocInfo
- CountMatchesTagger - Class in com.norconex.importer.handler.tagger.impl
-
Counts the number of matches of a given string (or string pattern) and store the resulting value in a field in the specified "toField".
- CountMatchesTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
- CountMatchesTagger.MatchDetails - Class in com.norconex.importer.handler.tagger.impl
-
Deprecated.
- createBindings() - Method in class com.norconex.importer.handler.ScriptRunner
- createDateTime(ZoneId) - Method in class com.norconex.importer.handler.condition.impl.DateCondition.DynamicFixedDateTimeSupplier
- createDateTime(ZoneId) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.DynamicFixedDateTimeSupplier
- createRecursiveParser(String, String, Writer, Properties, CachedStreamFactory) - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
- CsvSplitter - Class in com.norconex.importer.handler.splitter.impl
-
Split files with Coma-Separated values (or any other characters, like tab) into one document per line.
- CsvSplitter() - Constructor for class com.norconex.importer.handler.splitter.impl.CsvSplitter
- CurrentDateTagger - Class in com.norconex.importer.handler.tagger.impl
-
Adds the current computer UTC date to the specified
field
. - CurrentDateTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
Constructor.
D
- DateCondition - Class in com.norconex.importer.handler.condition.impl
-
A condition based on the date value(s) of matching metadata fields given the supplied date format.
- DateCondition() - Constructor for class com.norconex.importer.handler.condition.impl.DateCondition
- DateCondition(TextMatcher) - Constructor for class com.norconex.importer.handler.condition.impl.DateCondition
- DateCondition(TextMatcher, DateCondition.ValueMatcher) - Constructor for class com.norconex.importer.handler.condition.impl.DateCondition
- DateCondition(TextMatcher, DateCondition.ValueMatcher, DateCondition.ValueMatcher) - Constructor for class com.norconex.importer.handler.condition.impl.DateCondition
- DateCondition.DynamicFixedDateTimeSupplier - Class in com.norconex.importer.handler.condition.impl
- DateCondition.DynamicFloatingDateTimeSupplier - Class in com.norconex.importer.handler.condition.impl
- DateCondition.StaticDateTimeSupplier - Class in com.norconex.importer.handler.condition.impl
- DateCondition.TimeUnit - Enum in com.norconex.importer.handler.condition.impl
- DateCondition.ValueMatcher - Class in com.norconex.importer.handler.condition.impl
- DateFormatTagger - Class in com.norconex.importer.handler.tagger.impl
-
Formats a date from any given format to a format of choice, as per the formatting options found on
SimpleDateFormat
with the exception of the string "EPOCH" which represents the difference, measured in milliseconds, between the date and midnight, January 1, 1970. - DateFormatTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Constructor.
- DateMetadataFilter - Class in com.norconex.importer.handler.filter.impl
-
Accepts or rejects a document based on whether field values correspond to a date matching supplied conditions and format.
- DateMetadataFilter() - Constructor for class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- DateMetadataFilter(TextMatcher) - Constructor for class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
Constructor.
- DateMetadataFilter(TextMatcher, OnMatch) - Constructor for class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- DateMetadataFilter(String) - Constructor for class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
Deprecated.Since 3.0.0, use
DateMetadataFilter(TextMatcher)
- DateMetadataFilter(String, OnMatch) - Constructor for class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
Deprecated.Since 3.0.0, use
DateMetadataFilter(TextMatcher, OnMatch)
- DateMetadataFilter.Condition - Class in com.norconex.importer.handler.filter.impl
- DateMetadataFilter.DynamicFixedDateTimeSupplier - Class in com.norconex.importer.handler.filter.impl
- DateMetadataFilter.DynamicFloatingDateTimeSupplier - Class in com.norconex.importer.handler.filter.impl
- DateMetadataFilter.Operator - Enum in com.norconex.importer.handler.filter.impl
- DateMetadataFilter.StaticDateTimeSupplier - Class in com.norconex.importer.handler.filter.impl
- DateMetadataFilter.TimeUnit - Enum in com.norconex.importer.handler.filter.impl
- DAY - com.norconex.importer.handler.condition.impl.DateCondition.TimeUnit
- DAY - com.norconex.importer.handler.filter.impl.DateMetadataFilter.TimeUnit
- DebugTagger - Class in com.norconex.importer.handler.tagger.impl
-
A utility tagger to help with troubleshooting of document importing.
- DebugTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.DebugTagger
- DEFAULT_ESCAPE_CHARACTER - Static variable in class com.norconex.importer.handler.splitter.impl.CsvSplitter
- DEFAULT_FIELD - Static variable in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
- DEFAULT_FIELD - Static variable in class com.norconex.importer.handler.tagger.impl.UUIDTagger
- DEFAULT_FILE - Static variable in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- DEFAULT_HEADING_MAX_LENGTH - Static variable in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- DEFAULT_HEADING_MIN_LENGTH - Static variable in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- DEFAULT_MAX_MEM_INSTANCE - Static variable in class com.norconex.importer.ImporterConfig
-
100 MB.
- DEFAULT_MAX_MEM_POOL - Static variable in class com.norconex.importer.ImporterConfig
-
1 GB.
- DEFAULT_MAX_READ_SIZE - Static variable in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- DEFAULT_MAX_SAMPLES - Static variable in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- DEFAULT_QUOTE_CHARACTER - Static variable in class com.norconex.importer.handler.splitter.impl.CsvSplitter
- DEFAULT_REFERENCE_PAGE_PREFIX - Static variable in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
- DEFAULT_SCRIPT_ENGINE - Static variable in class com.norconex.importer.handler.ScriptRunner
- DEFAULT_SEPARATOR_CHARACTER - Static variable in class com.norconex.importer.handler.splitter.impl.CsvSplitter
- DEFAULT_TARGET_CHARSET - Static variable in class com.norconex.importer.handler.tagger.impl.CharsetTagger
- DEFAULT_TARGET_CHARSET - Static variable in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
- DEFAULT_TARGET_FORMAT - Static variable in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- DEFAULT_TEMP_DIR_PATH - Static variable in class com.norconex.importer.ImporterConfig
- DEFAULT_TITLE_MAX_LENGTH - Static variable in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- DEFAULT_TO_FIELD - Static variable in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- DeleteTagger - Class in com.norconex.importer.handler.tagger.impl
-
Delete the metadata fields provided.
- DeleteTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.DeleteTagger
- detect(File) - Static method in class com.norconex.importer.doc.ContentTypeDetector
-
Detects the content type of the given file.
- detect(File, String) - Static method in class com.norconex.importer.doc.ContentTypeDetector
-
Detects the content type of the given file.
- detect(InputStream) - Static method in class com.norconex.importer.doc.ContentTypeDetector
-
Detects the content type from the given input stream.
- detect(InputStream, String) - Static method in class com.norconex.importer.doc.ContentTypeDetector
-
Detects the content type from the given input stream.
- detectCharset(InputStream) - Static method in class com.norconex.importer.util.CharsetUtil
-
Detects the character encoding of an input stream.
- detectCharset(InputStream, String) - Static method in class com.norconex.importer.util.CharsetUtil
-
Detects the character encoding of an input stream.
- detectCharset(String) - Static method in class com.norconex.importer.util.CharsetUtil
-
Detects the character encoding of a string.
- detectCharset(String, String) - Static method in class com.norconex.importer.util.CharsetUtil
-
Detects the character encoding of a string.
- detectCharsetIfBlank(HandlerDoc, InputStream, String, ParseState) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Deprecated.Since 3.0.0, charset was already detected or use
CharsetUtil.firstNonBlankOrUTF8(ParseState, String...)
- detectCharsetIfBlank(String, Doc) - Static method in class com.norconex.importer.util.CharsetUtil
-
Detects a document character encoding if the supplied
charset
is blank. - detectCharsetIfBlank(String, InputStream) - Static method in class com.norconex.importer.util.CharsetUtil
-
Detects a document character encoding if the supplied
charset
is blank. - detectsCharset(Doc) - Static method in class com.norconex.importer.util.CharsetUtil
-
Detects a document character encoding.
- dispose() - Method in class com.norconex.importer.doc.Doc
-
Disposes of any resources associated with this document (like disk or memory cache).
- Doc - Class in com.norconex.importer.doc
-
A document being imported.
- Doc(DocInfo, CachedInputStream) - Constructor for class com.norconex.importer.doc.Doc
-
Creates a blank importer document using the supplied input stream to handle content.
- Doc(DocInfo, CachedInputStream, Properties) - Constructor for class com.norconex.importer.doc.Doc
-
Creates a blank importer document using the supplied input stream to handle content.
- Doc(String, CachedInputStream) - Constructor for class com.norconex.importer.doc.Doc
- Doc(String, CachedInputStream, Properties) - Constructor for class com.norconex.importer.doc.Doc
- DOC_PDF_PAGE_NO - Static variable in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
- DOC_PDF_TOTAL_PAGES - Static variable in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
- DocInfo - Class in com.norconex.importer.doc
-
Important information about a document that has specific meaning and purpose for processing by the Importer and needs to be referenced in a constant way.
- DocInfo() - Constructor for class com.norconex.importer.doc.DocInfo
-
Constructor.
- DocInfo(DocInfo) - Constructor for class com.norconex.importer.doc.DocInfo
-
Copy constructor.
- DocInfo(String) - Constructor for class com.norconex.importer.doc.DocInfo
-
Constructor.
- DocMetadata - Class in com.norconex.importer.doc
-
Constants for common metadata field names typically associated with a document and often set on
Doc.getMetadata()
. - DocumentLengthTagger - Class in com.norconex.importer.handler.tagger.impl
-
Adds the document length (i.e., number of bytes) to the specified
field
. - DocumentLengthTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
- DocumentParserException - Exception in com.norconex.importer.parser
-
Exception thrown upon encountering a non-recoverable issue parsing a document.
- DocumentParserException() - Constructor for exception com.norconex.importer.parser.DocumentParserException
- DocumentParserException(String) - Constructor for exception com.norconex.importer.parser.DocumentParserException
- DocumentParserException(String, Throwable) - Constructor for exception com.norconex.importer.parser.DocumentParserException
- DocumentParserException(Throwable) - Constructor for exception com.norconex.importer.parser.DocumentParserException
- DOM_CONTENT_TYPES - Static variable in class com.norconex.importer.handler.CommonMatchers
-
Content types representing a document object model.
- DOMCondition - Class in com.norconex.importer.handler.condition.impl
-
A condition using a Document Object Model (DOM) representation of an HTML, XHTML, or XML document content to match an element, attribute or value.
- DOMCondition() - Constructor for class com.norconex.importer.handler.condition.impl.DOMCondition
- DOMContentFilter - Class in com.norconex.importer.handler.filter.impl
-
Deprecated.Since 3.0.0, use
DOMFilter
. - DOMContentFilter() - Constructor for class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.
- DOMContentFilter(String) - Constructor for class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Since 3.0.0
- DOMContentFilter(String, OnMatch) - Constructor for class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Since 3.0.0
- DOMContentFilter(String, OnMatch, boolean) - Constructor for class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Since 3.0.0
- domContentTypes() - Static method in class com.norconex.importer.handler.CommonMatchers
-
Matcher for common content-types defining a DOM document.
- domContentTypes(String) - Static method in class com.norconex.importer.handler.CommonRestrictions
-
Common content-types defining a DOM document.
- DOMDeleteTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Enables deletion of one or more elements matching a given selector from a document content.
- DOMDeleteTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.DOMDeleteTransformer
-
Constructor.
- DOMExtractDetails() - Constructor for class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
- DOMExtractDetails() - Constructor for class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer.DOMExtractDetails
- DOMExtractDetails(String) - Constructor for class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer.DOMExtractDetails
- DOMExtractDetails(String, String) - Constructor for class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer.DOMExtractDetails
- DOMExtractDetails(String, String, boolean) - Constructor for class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
Deprecated.
- DOMExtractDetails(String, String, boolean, String) - Constructor for class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
Deprecated.
- DOMExtractDetails(String, String, PropertySetter) - Constructor for class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
- DOMExtractDetails(String, String, PropertySetter, String) - Constructor for class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
- DOMFilter - Class in com.norconex.importer.handler.filter.impl
-
Uses a Document Object Model (DOM) representation of an HTML, XHTML, or XML document content to perform filtering based on matching an element/attribute or element/attribute value.
- DOMFilter() - Constructor for class com.norconex.importer.handler.filter.impl.DOMFilter
- DOMPreserveTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Preserves only one or more elements matching a given selector from a document content.
- DOMPreserveTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
-
Constructor.
- DOMPreserveTransformer.DOMExtractDetails - Class in com.norconex.importer.handler.transformer.impl
-
DOM Extraction Details
- DOMSplitter - Class in com.norconex.importer.handler.splitter.impl
-
Splits HTML, XHTML, or XML document on elements matching a given selector.
- DOMSplitter() - Constructor for class com.norconex.importer.handler.splitter.impl.DOMSplitter
- DOMTagger - Class in com.norconex.importer.handler.tagger.impl
-
Extract the value of one or more elements or attributes into a target field, or delete matching elements.
- DOMTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Constructor.
- DOMTagger.DOMExtractDetails - Class in com.norconex.importer.handler.tagger.impl
-
DOM Extraction Details
- DOMUtil - Class in com.norconex.importer.util
-
Utility methods related to JSoup/DOM manipulation.
- DynamicFixedDateTimeSupplier(DateCondition.TimeUnit, int, boolean, ZoneId) - Constructor for class com.norconex.importer.handler.condition.impl.DateCondition.DynamicFixedDateTimeSupplier
- DynamicFixedDateTimeSupplier(DateMetadataFilter.TimeUnit, int, boolean, ZoneId) - Constructor for class com.norconex.importer.handler.filter.impl.DateMetadataFilter.DynamicFixedDateTimeSupplier
- DynamicFloatingDateTimeSupplier(DateCondition.TimeUnit, int, boolean, ZoneId) - Constructor for class com.norconex.importer.handler.condition.impl.DateCondition.DynamicFloatingDateTimeSupplier
- DynamicFloatingDateTimeSupplier(DateMetadataFilter.TimeUnit, int, boolean, ZoneId) - Constructor for class com.norconex.importer.handler.filter.impl.DateMetadataFilter.DynamicFloatingDateTimeSupplier
E
- EMBEDDED_INDEX - Static variable in class com.norconex.importer.doc.DocMetadata
-
Zero-based index of this embedded document in relation to siblings.
- EMBEDDED_PARENT_REFERENCES - Static variable in class com.norconex.importer.doc.DocMetadata
-
All references to parents of an embedded document (first is top-one).
- EMBEDDED_REFERENCE - Static variable in class com.norconex.importer.doc.DocMetadata
-
Relative reference to this document within its parent.
- EMBEDDED_TYPE - Static variable in class com.norconex.importer.doc.DocMetadata
-
Type of embedded file (from a zip, a word doc, etc.).
- EmbeddedConfig - Class in com.norconex.importer.parser
-
Configuration settings affecting how embedded documents are handled by parsers.
- EmbeddedConfig() - Constructor for class com.norconex.importer.parser.EmbeddedConfig
- EMPTY_ARRAY - Static variable in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- EMPTY_RESPONSES - Static variable in class com.norconex.importer.response.ImporterResponse
- EmptyFilter - Class in com.norconex.importer.handler.filter.impl
-
Accepts or rejects a document based on whether its content (default) or any of the specified metadata fields are empty or not.
- EmptyFilter() - Constructor for class com.norconex.importer.handler.filter.impl.EmptyFilter
- EmptyMetadataFilter - Class in com.norconex.importer.handler.filter.impl
-
Deprecated.Since 3.0.0, use
EmptyFilter
. - EmptyMetadataFilter() - Constructor for class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
Deprecated.
- EmptyMetadataFilter(OnMatch, String...) - Constructor for class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
Deprecated.
- equals(Object) - Method in class com.norconex.importer.doc.Doc
- equals(Object) - Method in class com.norconex.importer.doc.DocInfo
- equals(Object) - Method in class com.norconex.importer.handler.AbstractImporterHandler
- equals(Object) - Method in class com.norconex.importer.handler.condition.AbstractCharStreamCondition
- equals(Object) - Method in class com.norconex.importer.handler.condition.AbstractStringCondition
- equals(Object) - Method in class com.norconex.importer.handler.condition.impl.BlankCondition
- equals(Object) - Method in class com.norconex.importer.handler.condition.impl.DateCondition.DynamicFixedDateTimeSupplier
- equals(Object) - Method in class com.norconex.importer.handler.condition.impl.DateCondition.DynamicFloatingDateTimeSupplier
- equals(Object) - Method in class com.norconex.importer.handler.condition.impl.DateCondition
- equals(Object) - Method in class com.norconex.importer.handler.condition.impl.DateCondition.StaticDateTimeSupplier
- equals(Object) - Method in class com.norconex.importer.handler.condition.impl.DateCondition.ValueMatcher
- equals(Object) - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
- equals(Object) - Method in class com.norconex.importer.handler.condition.impl.NumericCondition
- equals(Object) - Method in class com.norconex.importer.handler.condition.impl.NumericCondition.ValueMatcher
- equals(Object) - Method in class com.norconex.importer.handler.condition.impl.ReferenceCondition
- equals(Object) - Method in class com.norconex.importer.handler.condition.impl.ScriptCondition
- equals(Object) - Method in class com.norconex.importer.handler.condition.impl.TextCondition
- equals(Object) - Method in class com.norconex.importer.handler.ExternalHandler
- equals(Object) - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
- equals(Object) - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
- equals(Object) - Method in class com.norconex.importer.handler.filter.AbstractOnMatchFilter
-
Deprecated.
- equals(Object) - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.Condition
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.DynamicFixedDateTimeSupplier
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.DynamicFloatingDateTimeSupplier
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.StaticDateTimeSupplier
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.EmptyFilter
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
Deprecated.
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Condition
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.ReferenceFilter
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
Deprecated.
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
Deprecated.
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.RejectFilter
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
- equals(Object) - Method in class com.norconex.importer.handler.filter.impl.TextFilter
- equals(Object) - Method in class com.norconex.importer.handler.HandlerConsumer
- equals(Object) - Method in class com.norconex.importer.handler.HandlerDoc
- equals(Object) - Method in class com.norconex.importer.handler.HandlerPredicate
- equals(Object) - Method in class com.norconex.importer.handler.ScriptRunner
- equals(Object) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
- equals(Object) - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
- equals(Object) - Method in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
- equals(Object) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- equals(Object) - Method in class com.norconex.importer.handler.splitter.impl.XMLStreamSplitter
- equals(Object) - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.CopyTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.RegexTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger.RenameDetails
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Deprecated.
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
- equals(Object) - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
- equals(Object) - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
- equals(Object) - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.DOMDeleteTransformer
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer.DOMExtractDetails
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.NoContentTransformer
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer.Replacement
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer.StripBetweenDetails
- equals(Object) - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
- equals(Object) - Method in class com.norconex.importer.ImporterConfig
- equals(Object) - Method in class com.norconex.importer.ImporterEvent
- equals(Object) - Method in class com.norconex.importer.ImporterRequest
- equals(Object) - Method in class com.norconex.importer.parser.EmbeddedConfig
- equals(Object) - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
- equals(Object) - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
- equals(Object) - Method in class com.norconex.importer.parser.impl.ExternalParser
- equals(Object) - Method in class com.norconex.importer.parser.impl.xfdl.XFDLParser
- equals(Object) - Method in class com.norconex.importer.parser.OCRConfig
- equals(Object) - Method in class com.norconex.importer.parser.ParseHints
- equals(Object) - Method in class com.norconex.importer.response.ImporterResponse
- equals(Object) - Method in class com.norconex.importer.response.ImporterStatus
- equals(Object) - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- EQUALS - com.norconex.importer.handler.filter.impl.DateMetadataFilter.Operator
- EQUALS - com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Operator
- ERROR - com.norconex.importer.response.ImporterStatus.Status
- eval(Bindings) - Method in class com.norconex.importer.handler.ScriptRunner
- evaluate(double, double) - Method in enum com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Operator
- evaluate(ZonedDateTime, ZonedDateTime) - Method in enum com.norconex.importer.handler.filter.impl.DateMetadataFilter.Operator
- EXCLUDE - com.norconex.importer.handler.filter.OnMatch
- excludeIfNull(OnMatch) - Static method in enum com.norconex.importer.handler.filter.OnMatch
- ExternalHandler - Class in com.norconex.importer.handler
-
Class executing an external application to extract data from and/or manipulate a document.
- ExternalHandler() - Constructor for class com.norconex.importer.handler.ExternalHandler
- ExternalParser - Class in com.norconex.importer.parser.impl
-
Parses and extracts text from a file using an external application to do so.
- ExternalParser() - Constructor for class com.norconex.importer.parser.impl.ExternalParser
- ExternalTagger - Class in com.norconex.importer.handler.tagger.impl
-
Extracts metadata from a document using an external application to do so.
- ExternalTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.ExternalTagger
- ExternalTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Transforms a document using an external application to do so.
- ExternalTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.ExternalTransformer
- extractFields(Properties, CharSequence) - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- extractFields(Properties, CharSequence, RegexFieldExtractor...) - Static method in class com.norconex.importer.util.regex.RegexUtil
-
Deprecated.
- extractFields(CharSequence) - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- extractFields(CharSequence, RegexFieldExtractor...) - Static method in class com.norconex.importer.util.regex.RegexUtil
-
Deprecated.
F
- FallbackParser - Class in com.norconex.importer.parser.impl
-
Parser using auto-detection of document content-type to figure out which specific parser to invoke to best parse a document.
- FallbackParser() - Constructor for class com.norconex.importer.parser.impl.FallbackParser
-
Creates a new parser.
- FieldReportTagger - Class in com.norconex.importer.handler.tagger.impl
-
A utility tagger that reports in a CSV file the fields discovered in a crawl session, captured at the point of your choice in the importing process.
- FieldReportTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- firstNonBlankOrUTF8(ParseState, String...) - Static method in class com.norconex.importer.util.CharsetUtil
-
Returns the first non-blank character encoding, or returns UTF-8 if they are all blank or in post-parse state.
- firstNonBlankOrUTF8(String...) - Static method in class com.norconex.importer.util.CharsetUtil
-
Returns the first non-blank character encoding, or returns UTF-8 if they are all blank.
- flushBuffer(StringBuilder, Writer, boolean) - Static method in class com.norconex.importer.util.BufferUtil
-
Flushes the buffer to output stream.
- ForceSingleValueTagger - Class in com.norconex.importer.handler.tagger.impl
-
Forces a metadata field to be single-value.
- ForceSingleValueTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
- formatDateString(String, String, String) - Static method in class com.norconex.importer.util.FormatUtil
-
Formats a string representation of a date, into another string date format.
- formatDateString(String, String, String, String) - Static method in class com.norconex.importer.util.FormatUtil
-
Formats a string representation of a date, into another string date format.
- formatDateString(String, String, Locale, String, Locale, String) - Static method in class com.norconex.importer.util.FormatUtil
-
Formats a string representation of a date, into another string date format.
- FormatUtil - Class in com.norconex.importer.util
-
Utility methods related to formatting.
- fromHandlers(IImporterHandler...) - Static method in class com.norconex.importer.handler.HandlerConsumer
- fromHandlers(List<IImporterHandler>) - Static method in class com.norconex.importer.handler.HandlerConsumer
G
- GENERATED_TITLE - Static variable in class com.norconex.importer.doc.DocMetadata
-
Generated title.
- GenericDocumentParserFactory - Class in com.norconex.importer.parser
-
Generic document parser factory.
- GenericDocumentParserFactory() - Constructor for class com.norconex.importer.parser.GenericDocumentParserFactory
-
Creates a new document parser factory of the given format.
- get() - Method in class com.norconex.importer.handler.condition.impl.DateCondition.DynamicFixedDateTimeSupplier
- get() - Method in class com.norconex.importer.handler.condition.impl.DateCondition.DynamicFloatingDateTimeSupplier
- get() - Method in class com.norconex.importer.handler.condition.impl.DateCondition.StaticDateTimeSupplier
- get() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.DynamicFixedDateTimeSupplier
- get() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.DynamicFloatingDateTimeSupplier
- get() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.StaticDateTimeSupplier
- get() - Static method in class com.norconex.importer.Importer
- getAction() - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
-
Gets action.
- getApi() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- getApiKey() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- getApplyTo() - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
Gets whether to apply the case transformation to fields, values, or both.
- getApplyTo(String) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
Deprecated.Since 3.0.0 use
CharacterCaseTagger.getApplyTo()
- getBegin() - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
- getCaseType() - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
Gets the type of character case transformation.
- getCaseType(String) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
Deprecated.Since 3.0.0 use
CharacterCaseTagger.getCaseType()
- getChildDocs() - Method in class com.norconex.importer.handler.HandlerContext
- getClientId() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- getClientSecret() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- getCommand() - Method in class com.norconex.importer.handler.ExternalHandler
-
Gets the command to execute.
- getCommand() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets the command to execute.
- getCommand() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Gets the command to execute.
- getCommand() - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Gets the command to execute.
- getCondition() - Method in class com.norconex.importer.handler.HandlerPredicate
- getConditions() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
Gets the list date filter conditions for this filter.
- getConditions() - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
- getConstants() - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
- getContent() - Method in class com.norconex.importer.doc.Doc
-
Deprecated.Since 3.0.0, use
Doc.getInputStream()
- getContentColumns() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
- getContentEncoding() - Method in class com.norconex.importer.doc.DocInfo
- getContentEncoding() - Method in class com.norconex.importer.ImporterRequest
- getContentType() - Method in class com.norconex.importer.doc.DocInfo
- getContentType() - Method in class com.norconex.importer.ImporterRequest
- getContentTypeMatcher() - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
-
Gets this condition content-type matcher.
- getContentTypes() - Method in class com.norconex.importer.parser.OCRConfig
-
Gets the regular expression matching content types to restrict OCR to.
- getCountMatcher() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
Gets the count matcher.
- getCropRectangle() - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- getDateTime() - Method in class com.norconex.importer.handler.condition.impl.DateCondition.ValueMatcher
- getDateTime() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.Condition
- getDateTimeSupplier() - Method in class com.norconex.importer.handler.condition.impl.DateCondition.ValueMatcher
- getDateTimeSupplier() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.Condition
- getDefaultValue() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
- getDefaultValue() - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer.DOMExtractDetails
- getDescription() - Method in class com.norconex.importer.response.ImporterStatus
- getDetectHeadingMaxLength() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- getDetectHeadingMinLength() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- getDoc() - Method in class com.norconex.importer.handler.HandlerContext
- getDocInfo() - Method in class com.norconex.importer.doc.Doc
- getDocInfo() - Method in class com.norconex.importer.handler.HandlerDoc
- getDocument() - Method in class com.norconex.importer.response.ImporterResponse
- getDocZoneId() - Method in class com.norconex.importer.handler.condition.impl.DateCondition
-
Gets the time zone id documents are considered to be.
- getDocZoneId() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
Gets the time zone id documents are considered to be.
- getDOMExtractDetailsList() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Gets a list of DOM extraction details.
- getDOMExtractDetailsList() - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
-
Gets a list of DOM extraction details.
- getElementValue(Element, String) - Static method in class com.norconex.importer.util.DOMUtil
-
Gets an element value based on JSoup DOM.
- getEmbeddedConfig() - Method in class com.norconex.importer.parser.ParseHints
- getEmbeddedDocuments() - Method in class com.norconex.importer.parser.impl.AbstractTikaParser.MergeEmbeddedParser
- getEmbeddedDocuments() - Method in interface com.norconex.importer.parser.impl.AbstractTikaParser.RecursiveParser
- getEmbeddedDocuments() - Method in class com.norconex.importer.parser.impl.AbstractTikaParser.SplitEmbbededParser
- getEmbeddedParentReferences() - Method in class com.norconex.importer.doc.DocInfo
- getEnd() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Deprecated.Since 3.0.0, use
TextBetweenTagger.TextBetweenDetails.getEndMatcher()
. - getEnd() - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
- getEndMatcher() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Gets the end delimiter matcher for text to extract.
- getEndMatcher() - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer.StripBetweenDetails
-
Gets the end delimiter matcher for text to strip.
- getEngineName() - Method in class com.norconex.importer.handler.condition.impl.ScriptCondition
- getEngineName() - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
- getEngineName() - Method in class com.norconex.importer.handler.ScriptRunner
- getEngineName() - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
- getEngineName() - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
- getEnvironmentVariables() - Method in class com.norconex.importer.handler.ExternalHandler
-
Gets environment variables.
- getEnvironmentVariables() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets environment variables.
- getEnvironmentVariables() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Gets environment variables.
- getEnvironmentVariables() - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Gets environment variables.
- getEscapeCharacter() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Gets the escape character.
- getEventManager() - Method in class com.norconex.importer.handler.HandlerContext
- getEventManager() - Method in class com.norconex.importer.Importer
-
Gets the event manager.
- getException() - Method in class com.norconex.importer.response.ImporterStatus
- getExtract() - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
-
Gets what should be extracted for the value.
- getExtract() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Gets what should be extracted for the value.
- getExtract() - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
-
Gets what should be extracted for the value.
- getExtract() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
- getExtract() - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer.DOMExtractDetails
- getFallbackLanguage() - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
- getField() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
Deprecated.Since 3.0.0, use
DateMetadataFilter.getFieldMatcher()
. - getField() - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
-
Deprecated.Since 3.0.0, use
NumericMetadataFilter.getFieldMatcher()
. - getField() - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- getField() - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
Deprecated.Since 3.0.0, use
CurrentDateTagger.getToField()
- getField() - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
-
Deprecated.Since 3.0.0, use
DocumentLengthTagger.getToField()
- getField() - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
Deprecated.Since 3.0.0, use
UUIDTagger.getToField()
- getField() - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- getFieldGroup() - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- getFieldMatcher() - Method in class com.norconex.importer.handler.condition.impl.BlankCondition
- getFieldMatcher() - Method in class com.norconex.importer.handler.condition.impl.DateCondition
- getFieldMatcher() - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
-
Gets this filter field matcher.
- getFieldMatcher() - Method in class com.norconex.importer.handler.condition.impl.NumericCondition
-
Gets the text matcher of field names.
- getFieldMatcher() - Method in class com.norconex.importer.handler.condition.impl.TextCondition
-
Gets the text matcher of field names.
- getFieldMatcher() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- getFieldMatcher() - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
-
Gets this filter field matcher (copy).
- getFieldMatcher() - Method in class com.norconex.importer.handler.filter.impl.EmptyFilter
- getFieldMatcher() - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
- getFieldMatcher() - Method in class com.norconex.importer.handler.filter.impl.TextFilter
-
Gets the text matcher of field names.
- getFieldMatcher() - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
Gets field matcher.
- getFieldMatcher() - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
-
Gets field matcher.
- getFieldMatcher() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
Gets the field matcher.
- getFieldMatcher() - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
-
Gets field matcher for fields to delete.
- getFieldMatcher() - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
-
Gets field matcher.
- getFieldMatcher() - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
-
Gets field matcher.
- getFieldMatcher() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
Gets field matcher.
- getFieldMatcher() - Method in class com.norconex.importer.handler.tagger.impl.RegexTagger
-
Gets source field matcher for fields on which to extract fields/values.
- getFieldMatcher() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Gets field matcher.
- getFieldMatcher() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
-
Gets field matcher for fields to split.
- getFieldMatcher() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Gets field matcher for fields on which to extract values.
- getFieldMatcher() - Method in class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
-
Gets field matcher for fields to split.
- getFieldMatcher() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
Gets field matcher for fields to truncate.
- getFieldMatcher() - Method in class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
-
Gets field matcher for fields containing text.
- getFieldName() - Method in class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
-
Deprecated.Since 3.0.0, use
TextStatisticsTagger.getFieldMatcher()
. - getFieldNames() - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
Deprecated.Since 3.0.0 use
CharacterCaseTagger.getFieldMatcher()
- getFields() - Method in class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
Deprecated.
- getFields() - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
-
Deprecated.Since 3.0.0, use
DeleteTagger.getFieldMatcher()
- getFields() - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
-
Deprecated.Since 3.0.0, use
KeepOnlyTagger.getFieldMatcher()
- getFieldsRegex() - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
-
Deprecated.Since 3.0.0, use
CharsetTagger.getFieldMatcher()
. - getFieldsRegex() - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
-
Deprecated.Since 3.0.0, use
DeleteTagger.getFieldMatcher()
- getFieldsRegex() - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
-
Deprecated.Since 3.0.0, use
KeepOnlyTagger.getFieldMatcher()
- getFieldsToTranslate() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- getFile() - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- getFile() - Method in class com.norconex.importer.ImporterRequest
- getFormat() - Method in class com.norconex.importer.handler.condition.impl.DateCondition
- getFormat() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- getFormat() - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Gets optional source field holding the HTML content to apply DOM extraction to.
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Deprecated.
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
-
Deprecated.Since 3.0.0, use
SplitTagger.SplitDetails.getFieldMatcher()
instead - getFromField() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- getFromField() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
Deprecated.Since 3.0.0, use
TruncateTagger.getFieldMatcher()
instead - getFromFields() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
Deprecated.Since 3.0.0, use
MergeTagger.Merge.getFieldMatcher()
- getFromFieldsRegex() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
Deprecated.Since 3.0.0, use
MergeTagger.Merge.getFieldMatcher()
- getFromFormats() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Gets the source date formats to match.
- getFromLocale() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Gets the locale used for parsing the source date.
- getFromSeparator() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- getFromValue() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Deprecated.
- getHandler() - Method in class com.norconex.importer.handler.HandlerConsumer
- getHierarchyDetails() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger
- getIgnoredContentTypesRegex() - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
Gets the regular expression matching content types to ignore (i.e. do not perform parsing on them).
- getImporterConfig() - Method in class com.norconex.importer.Importer
-
Gets the importer configuration.
- getImporterStatus() - Method in class com.norconex.importer.response.ImporterResponse
- getIncludeResolver() - Method in class com.norconex.importer.handler.HandlerContext
- getInputStream() - Method in class com.norconex.importer.doc.Doc
- getInputStream() - Method in class com.norconex.importer.ImporterRequest
- getLanguages() - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
- getLanguages() - Method in class com.norconex.importer.parser.OCRConfig
-
Gets languages to use by OCR.
- getLinesToSkip() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Gets how many lines to skip before starting to parse lines.
- getLocale() - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
Gets the locale used for formatting.
- getLogFields() - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
- getLogLevel() - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
- getMatchesDetails() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
Deprecated.Since 3.0.0, use
CountMatchesTagger.getToField()
,CountMatchesTagger.getFieldMatcher()
, andCountMatchesTagger.getCountMatcher()
. - getMaxFileCacheSize() - Method in class com.norconex.importer.ImporterConfig
-
Deprecated.Since 3.0.0, use
ImporterConfig.getMaxMemoryInstance()
. - getMaxFilePoolCacheSize() - Method in class com.norconex.importer.ImporterConfig
-
Deprecated.Since 3.0.0, use
ImporterConfig.getMaxMemoryPool()
. - getMaxLength() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
- getMaxMemoryInstance() - Method in class com.norconex.importer.ImporterConfig
-
Gets the maximum number of bytes used for memory caching of a single documents being processed.
- getMaxMemoryPool() - Method in class com.norconex.importer.ImporterConfig
-
Gets the maximum number of bytes used for memory caching of data for all documents concurrently being processed.
- getMaxReadSize() - Method in class com.norconex.importer.handler.condition.AbstractStringCondition
-
Gets the maximum number of characters to read for filtering at once.
- getMaxReadSize() - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
-
Gets the maximum number of characters to read for filtering at once.
- getMaxReadSize() - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
-
Gets the maximum number of characters to read from content for tagging at once.
- getMaxReadSize() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
Gets the maximum number of characters to read from content for tagging at once.
- getMaxReadSize() - Method in class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
-
Gets the maximum number of characters to read from content for tagging at once.
- getMaxReadSize() - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
-
Gets the maximum number of characters to read and transform at once.
- getMaxSamples() - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- getMerges() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger
- getMetadata() - Method in class com.norconex.importer.doc.Doc
- getMetadata() - Method in class com.norconex.importer.handler.HandlerDoc
- getMetadata() - Method in class com.norconex.importer.ImporterRequest
- getMetadataExtractionPatterns() - Method in class com.norconex.importer.handler.ExternalHandler
-
Gets metadata extraction patterns.
- getMetadataExtractionPatterns() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets metadata extraction patterns.
- getMetadataExtractionPatterns() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Gets metadata extraction patterns.
- getMetadataExtractionPatterns() - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Gets metadata extraction patterns.
- getMetadataInputFormat() - Method in class com.norconex.importer.handler.ExternalHandler
-
Gets the format of the metadata input file sent to the external application.
- getMetadataInputFormat() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets the format of the metadata input file sent to the external application.
- getMetadataInputFormat() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Gets the format of the metadata input file sent to the external application.
- getMetadataInputFormat() - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Gets the format of the metadata input file sent to the external application.
- getMetadataOutputFormat() - Method in class com.norconex.importer.handler.ExternalHandler
-
Gets the format of the metadata output file from the external application.
- getMetadataOutputFormat() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets the format of the metadata output file from the external application.
- getMetadataOutputFormat() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Gets the format of the metadata output file from the external application.
- getMetadataOutputFormat() - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Gets the format of the metadata output file from the external application.
- getName() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Deprecated.Since 3.0.0, use
TextBetweenTagger.TextBetweenDetails.getToField()
instead - getNestedResponses() - Method in class com.norconex.importer.response.ImporterResponse
- getNoExtractContainerContentTypes() - Method in class com.norconex.importer.parser.EmbeddedConfig
- getNoExtractEmbeddedContentTypes() - Method in class com.norconex.importer.parser.EmbeddedConfig
- getNumber() - Method in class com.norconex.importer.handler.condition.impl.NumericCondition.ValueMatcher
- getNumber() - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Condition
- getOcrConfig() - Method in class com.norconex.importer.parser.ParseHints
- getOCRConfig() - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
-
Deprecated.
- getOnConflict() - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
-
Deprecated.Since 3.0.0, use
ConstantTagger.getOnSet()
instead - getOnMatch() - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
- getOnMatch() - Method in class com.norconex.importer.handler.filter.AbstractOnMatchFilter
-
Deprecated.
- getOnMatch() - Method in interface com.norconex.importer.handler.filter.IOnMatchFilter
-
Gets the the on match action (exclude or include).
- getOnSet() - Method in class com.norconex.importer.handler.ExternalHandler
-
Gets the property setter to use when a metadata value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
-
Gets the property setter to use when a value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
Gets the property setter to use when a value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
Gets the property setter to use when a value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Gets the property setter to use when a value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
-
Gets the property setter to use when a value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
Gets the property setter to use when a value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets the property setter to use when a metadata value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
Gets the property setter to use when a value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Gets the property setter to use when a value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
-
Gets the property setter to use when a value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Gets the property setter to use when a value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
Gets the property setter to use when a value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
Gets the property setter to use when a value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
-
Gets the property setter to use when a value is set.
- getOnSet() - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
Gets the property setter to use when a value is set.
- getOnSet() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Gets the property setter to use when a metadata value is set.
- getOnSet() - Method in class com.norconex.importer.handler.transformer.impl.NoContentTransformer
- getOnSet() - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Gets the property setter to use when a metadata value is set.
- getOperator() - Method in class com.norconex.importer.handler.condition.impl.NumericCondition.ValueMatcher
- getOperator() - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Condition
- getOperator(String) - Static method in enum com.norconex.importer.handler.filter.impl.DateMetadataFilter.Operator
- getOperator(String) - Static method in enum com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Operator
- getParentResponse() - Method in class com.norconex.importer.response.ImporterResponse
- getParseErrorsSaveDir() - Method in class com.norconex.importer.ImporterConfig
-
Gets the directory where file generating parsing errors will be saved.
- getParseHints() - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
Gets parse hints.
- getParser() - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
-
Gets the parser to use when creating the DOM-tree.
- getParser() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Gets the parser to use when creating the DOM-tree.
- getParser() - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
-
Gets the parser to use when creating the DOM-tree.
- getParser() - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
-
Gets the parser to use when creating the DOM-tree.
- getParser() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Gets the parser to use when creating the DOM-tree.
- getParser() - Method in class com.norconex.importer.handler.transformer.impl.DOMDeleteTransformer
-
Gets the parser to use when creating the DOM-tree.
- getParser() - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
-
Gets the parser to use when creating the DOM-tree.
- getParser(String, ContentType) - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
Gets a parser based on content type, regardless of document reference (ignoring it).
- getParser(String, ContentType) - Method in interface com.norconex.importer.parser.IDocumentParserFactory
-
Gets a document parser, optionally based on its reference or content type.
- getParserFactory() - Method in class com.norconex.importer.ImporterConfig
- getParseState() - Method in class com.norconex.importer.handler.HandlerContext
- getParseState() - Method in class com.norconex.importer.ImporterEvent
- getPath() - Method in class com.norconex.importer.handler.splitter.impl.XMLStreamSplitter
- getPath() - Method in class com.norconex.importer.parser.OCRConfig
-
Gets the Tesseract OCR engine executable file path.
- getPatterns() - Method in class com.norconex.importer.handler.tagger.impl.RegexTagger
-
Gets the patterns used to extract matching field names/values.
- getPatterns() - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Deprecated.Gets the patterns used to extract matching field names/values.
- getPostParseConsumer() - Method in class com.norconex.importer.ImporterConfig
-
Gets the
Consumer
to be executed on documents after their parsing has occurred. - getPostParseHandlers() - Method in class com.norconex.importer.ImporterConfig
-
Deprecated.Since 3.0.0, use
ImporterConfig.getPostParseConsumer()
instead - getPrefix() - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
-
Gets the prefix to print before the actual log message.
- getPreParseConsumer() - Method in class com.norconex.importer.ImporterConfig
-
Gets the
Consumer
to be executed on documents before their parsing has occurred. - getPreParseHandlers() - Method in class com.norconex.importer.ImporterConfig
-
Deprecated.Since 3.0.0, use
ImporterConfig.getPreParseConsumer()
instead - getQuoteCharacter() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Get the value's surrounding quotes character.
- getReductions() - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
- getReference() - Method in class com.norconex.importer.doc.Doc
-
Gets the document reference.
- getReference() - Method in class com.norconex.importer.doc.DocInfo
- getReference() - Method in class com.norconex.importer.handler.HandlerDoc
- getReference() - Method in class com.norconex.importer.ImporterRequest
- getReference() - Method in class com.norconex.importer.response.ImporterResponse
- getReferenceColumn() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
- getReferencePagePrefix() - Method in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
- getRegex() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Since 3.0.0, use
DOMContentFilter.getValueMatcher()
- getRegex() - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
Deprecated.
- getRegex() - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- getRegex() - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
Deprecated.
- getRegex() - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- getRejectedBy() - Method in class com.norconex.importer.handler.HandlerContext
- getRejectionFilter() - Method in class com.norconex.importer.response.ImporterStatus
- getReplacements() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
- getReplacements() - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
- getResponseProcessors() - Method in class com.norconex.importer.ImporterConfig
- getRestrictions() - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Gets all restrictions
- getRotateDegrees() - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- getScaleDimension() - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- getScaleFactor() - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- getScript() - Method in class com.norconex.importer.handler.condition.impl.ScriptCondition
- getScript() - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
- getScript() - Method in class com.norconex.importer.handler.ScriptRunner
- getScript() - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
- getScript() - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
- getScriptPath() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- getSelector() - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
- getSelector() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.
- getSelector() - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
- getSelector() - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
- getSelector() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
- getSelector() - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer.DOMExtractDetails
- getSelectors() - Method in class com.norconex.importer.handler.transformer.impl.DOMDeleteTransformer
- getSeparator() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
- getSeparatorCharacter() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Gets the value-separator character.
- getSingleValueFields() - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
-
Deprecated.Since 3.0.0, use
ForceSingleValueTagger.getFieldMatcher()
. - getSingleValueSeparator() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
- getSmtPath() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- getSource() - Method in class com.norconex.importer.ImporterEvent
- getSourceCharset() - Method in class com.norconex.importer.handler.condition.AbstractCharStreamCondition
-
Gets the presumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
-
Gets the assumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Gets the assumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
-
Gets the assumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
-
Gets the assumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
-
Gets the assumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
- getSourceCharset() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Gets the assumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
-
Gets the assumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
- getSourceCharset() - Method in class com.norconex.importer.handler.transformer.impl.DOMDeleteTransformer
-
Gets the assumed source character encoding.
- getSourceCharset() - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
-
Gets the assumed source character encoding.
- getSourceLanguage() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- getSourceLanguageField() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- getSplitContentTypes() - Method in class com.norconex.importer.parser.EmbeddedConfig
- getSplitDetailsList() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
- getSplits() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
-
Deprecated.
- getStart() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Deprecated.Since 3.0.0, use
TextBetweenTagger.TextBetweenDetails.getStartMatcher()
. - getStartMatcher() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Gets the start delimiter matcher for text to extract.
- getStartMatcher() - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer.StripBetweenDetails
-
Gets the start delimiter matcher for text to strip.
- getStatus() - Method in class com.norconex.importer.response.ImporterStatus
- getStreamFactory() - Method in class com.norconex.importer.doc.Doc
- getStreamFactory() - Method in class com.norconex.importer.handler.HandlerDoc
- getStripAfterMatcher() - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
Gets the matcher for the text from which to strip content.
- getStripAfterRegex() - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
Deprecated.Since 3.0.0, use
StripAfterTransformer.getStripAfterMatcher()
. - getStripBeforeMatcher() - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
Gets the matcher for the text up to which to strip content.
- getStripBeforeRegex() - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
Deprecated.Since 3.0.0, use
StripBeforeTransformer.getStripBeforeMatcher()
. - getStripBetweenDetailsList() - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
Gets text between instructions.
- getStripEndpoints() - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
Deprecated.Since 3.0.0, use
StripBetweenTransformer.getStripBetweenDetailsList()
. - getSubject() - Method in class com.norconex.importer.ImporterEvent
- getSuffix() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
- getTargetCharset() - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
- getTargetCharset() - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
- getTargetFormat() - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- getTargetLanguages() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- getTempDir() - Method in class com.norconex.importer.handler.ExternalHandler
-
Gets directory where to store temporary files sent to the external handler as file paths.
- getTempDir() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets directory where to store temporary files used for transformation.
- getTempDir() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Gets directory where to store temporary files used for transformation.
- getTempDir() - Method in class com.norconex.importer.ImporterConfig
-
Gets the temporary directory where files can be deleted safely by the OS or any other processes when the Importer is not running.
- getTempDir() - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Gets directory where to store temporary files used for transformation.
- getTextBetweenDetailsList() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
-
Gets text between instructions.
- getTimeUnit(String) - Static method in enum com.norconex.importer.handler.condition.impl.DateCondition.TimeUnit
- getTimeUnit(String) - Static method in enum com.norconex.importer.handler.filter.impl.DateMetadataFilter.TimeUnit
- getTitleMaxLength() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
Sets the target field.
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
Gets the target field.
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
-
Gets the target field.
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
- getToField() - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
Gets the target field.
- getToField() - Method in class com.norconex.importer.handler.transformer.impl.NoContentTransformer
- getToFormat() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
- getToLocale() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Gets the locale used for formatting the target date.
- getToSeparator() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- getToValue() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
- getToValue() - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer.Replacement
- getTruncateSamplesAt() - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- getUserKey() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- getValue() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.
- getValueGroup() - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- getValueMatcher() - Method in class com.norconex.importer.handler.condition.impl.DateCondition
- getValueMatcher() - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
-
Gets this condition value matcher.
- getValueMatcher() - Method in class com.norconex.importer.handler.condition.impl.NumericCondition
- getValueMatcher() - Method in class com.norconex.importer.handler.condition.impl.ReferenceCondition
-
Gets the text matcher for field values.
- getValueMatcher() - Method in class com.norconex.importer.handler.condition.impl.TextCondition
-
Gets the text matcher for content or field values.
- getValueMatcher() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Gets this filter text matcher (copy).
- getValueMatcher() - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
-
Gets this filter value matcher (copy).
- getValueMatcher() - Method in class com.norconex.importer.handler.filter.impl.ReferenceFilter
-
Gets the text matcher for field values.
- getValueMatcher() - Method in class com.norconex.importer.handler.filter.impl.TextFilter
-
Gets the text matcher for field values.
- getValueMatcher() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Gets value matcher.
- getValueMatcher() - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer.Replacement
-
Gets value matcher.
- getValueMatcherRangeEnd() - Method in class com.norconex.importer.handler.condition.impl.DateCondition
- getValueMatcherRangeEnd() - Method in class com.norconex.importer.handler.condition.impl.NumericCondition
- GREATER_EQUAL - com.norconex.importer.handler.filter.impl.DateMetadataFilter.Operator
- GREATER_EQUAL - com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Operator
- GREATER_THAN - com.norconex.importer.handler.filter.impl.DateMetadataFilter.Operator
- GREATER_THAN - com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Operator
H
- handleDocument(HandlerDoc, InputStream, OutputStream) - Method in class com.norconex.importer.handler.ExternalHandler
-
Invoke the external application on a document.
- HandlerConsumer - Class in com.norconex.importer.handler
-
Consumer wrapping an
IImporterHandler
instance for use in anXMLFlow
. - HandlerConsumer() - Constructor for class com.norconex.importer.handler.HandlerConsumer
- HandlerConsumer(IImporterHandler) - Constructor for class com.norconex.importer.handler.HandlerConsumer
- HandlerContext - Class in com.norconex.importer.handler
- HandlerContext(Doc, EventManager, ParseState) - Constructor for class com.norconex.importer.handler.HandlerContext
- HandlerContext.IncludeMatchResolver - Class in com.norconex.importer.handler
- HandlerDoc - Class in com.norconex.importer.handler
-
Lighter version of
Doc
which leaves content out to let each handler dictate how content should be referenced. - HandlerDoc(Doc) - Constructor for class com.norconex.importer.handler.HandlerDoc
- HandlerPredicate - Class in com.norconex.importer.handler
-
Predicate wrapping an
IImporterCondition
instance for use in anXMLFlow
. - HandlerPredicate() - Constructor for class com.norconex.importer.handler.HandlerPredicate
- HandlerPredicate(IImporterCondition) - Constructor for class com.norconex.importer.handler.HandlerPredicate
- hashCode() - Method in class com.norconex.importer.doc.Doc
- hashCode() - Method in class com.norconex.importer.doc.DocInfo
- hashCode() - Method in class com.norconex.importer.handler.AbstractImporterHandler
- hashCode() - Method in class com.norconex.importer.handler.condition.AbstractCharStreamCondition
- hashCode() - Method in class com.norconex.importer.handler.condition.AbstractStringCondition
- hashCode() - Method in class com.norconex.importer.handler.condition.impl.BlankCondition
- hashCode() - Method in class com.norconex.importer.handler.condition.impl.DateCondition.DynamicFixedDateTimeSupplier
- hashCode() - Method in class com.norconex.importer.handler.condition.impl.DateCondition.DynamicFloatingDateTimeSupplier
- hashCode() - Method in class com.norconex.importer.handler.condition.impl.DateCondition
- hashCode() - Method in class com.norconex.importer.handler.condition.impl.DateCondition.StaticDateTimeSupplier
- hashCode() - Method in class com.norconex.importer.handler.condition.impl.DateCondition.ValueMatcher
- hashCode() - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
- hashCode() - Method in class com.norconex.importer.handler.condition.impl.NumericCondition
- hashCode() - Method in class com.norconex.importer.handler.condition.impl.NumericCondition.ValueMatcher
- hashCode() - Method in class com.norconex.importer.handler.condition.impl.ReferenceCondition
- hashCode() - Method in class com.norconex.importer.handler.condition.impl.ScriptCondition
- hashCode() - Method in class com.norconex.importer.handler.condition.impl.TextCondition
- hashCode() - Method in class com.norconex.importer.handler.ExternalHandler
- hashCode() - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
- hashCode() - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
- hashCode() - Method in class com.norconex.importer.handler.filter.AbstractOnMatchFilter
-
Deprecated.
- hashCode() - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.Condition
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.DynamicFixedDateTimeSupplier
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.DynamicFloatingDateTimeSupplier
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.StaticDateTimeSupplier
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.EmptyFilter
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
Deprecated.
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Condition
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.ReferenceFilter
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
Deprecated.
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
Deprecated.
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.RejectFilter
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
- hashCode() - Method in class com.norconex.importer.handler.filter.impl.TextFilter
- hashCode() - Method in class com.norconex.importer.handler.HandlerConsumer
- hashCode() - Method in class com.norconex.importer.handler.HandlerDoc
- hashCode() - Method in class com.norconex.importer.handler.HandlerPredicate
- hashCode() - Method in class com.norconex.importer.handler.ScriptRunner
- hashCode() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
- hashCode() - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
- hashCode() - Method in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
- hashCode() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- hashCode() - Method in class com.norconex.importer.handler.splitter.impl.XMLStreamSplitter
- hashCode() - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.CopyTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.RegexTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger.RenameDetails
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Deprecated.
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
- hashCode() - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
- hashCode() - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
- hashCode() - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.DOMDeleteTransformer
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer.DOMExtractDetails
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.NoContentTransformer
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer.Replacement
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer.StripBetweenDetails
- hashCode() - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
- hashCode() - Method in class com.norconex.importer.ImporterConfig
- hashCode() - Method in class com.norconex.importer.ImporterEvent
- hashCode() - Method in class com.norconex.importer.ImporterRequest
- hashCode() - Method in class com.norconex.importer.parser.EmbeddedConfig
- hashCode() - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
- hashCode() - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
- hashCode() - Method in class com.norconex.importer.parser.impl.ExternalParser
- hashCode() - Method in class com.norconex.importer.parser.impl.xfdl.XFDLParser
- hashCode() - Method in class com.norconex.importer.parser.OCRConfig
- hashCode() - Method in class com.norconex.importer.parser.ParseHints
- hashCode() - Method in class com.norconex.importer.response.ImporterResponse
- hashCode() - Method in class com.norconex.importer.response.ImporterStatus
- hashCode() - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- hasIncludes() - Method in class com.norconex.importer.handler.HandlerContext.IncludeMatchResolver
- HierarchyDetails() - Constructor for class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- HierarchyDetails(String, String, String, String) - Constructor for class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- HierarchyTagger - Class in com.norconex.importer.handler.tagger.impl
-
Given a separator, split a field string into multiple segments representing each node of a hierarchical branch.
- HierarchyTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.HierarchyTagger
- HierarchyTagger.HierarchyDetails - Class in com.norconex.importer.handler.tagger.impl
- HOUR - com.norconex.importer.handler.condition.impl.DateCondition.TimeUnit
- HOUR - com.norconex.importer.handler.filter.impl.DateMetadataFilter.TimeUnit
- HTML_CONTENT_TYPES - Static variable in class com.norconex.importer.handler.CommonMatchers
-
HTML/XHTML content types.
- htmlContentTypes() - Static method in class com.norconex.importer.handler.CommonMatchers
-
Matcher for common content-types defining an HTML or XHTML document.
- htmlContentTypes(String) - Static method in class com.norconex.importer.handler.CommonRestrictions
-
Default content-types defining an HTML or XHTML document.
I
- IDocumentFilter - Interface in com.norconex.importer.handler.filter
-
Filters documents.
- IDocumentParser - Interface in com.norconex.importer.parser
-
Implementations are responsible for parsing a document to extract its text and metadata, as well as any embedded documents (when applicable).
- IDocumentParserFactory - Interface in com.norconex.importer.parser
-
Factory providing document parsers for documents.
- IDocumentSplitter - Interface in com.norconex.importer.handler.splitter
-
Responsible for splitting a single document into several ones.
- IDocumentTagger - Interface in com.norconex.importer.handler.tagger
-
Tags a document with extra metadata information, or manipulate existing metadata information.
- IDocumentTransformer - Interface in com.norconex.importer.handler.transformer
-
Transformers allow to manipulate and modify a document metadata or content.
- IHintsAwareParser - Interface in com.norconex.importer.parser
-
Indicates that a parser can be initialized with generic parser configuration settings and it will try to apply any such settings the best it can when possible to do so.
- IImporterCondition - Interface in com.norconex.importer.handler.condition
-
A condition usually used in XML flow creation when configuring importer handlers.
- IImporterHandler - Interface in com.norconex.importer.handler
-
Identifies a class as being an import handler.
- IImporterResponseProcessor - Interface in com.norconex.importer.response
-
Processes an importer response to modify it or perform other actions as required before it is returned.
- IMAGE_IO_CONTENT_TYPES - Static variable in class com.norconex.importer.handler.CommonMatchers
-
Content types for natively supported Java ImageIO images.
- imageIOStandardContentTypes() - Static method in class com.norconex.importer.handler.CommonMatchers
-
Content types of standard image format supported by all Java ImageIO implementations: JPEG, PNG, GIF, BMP, WBMP.
- imageIOStandardContentTypes(String) - Static method in class com.norconex.importer.handler.CommonRestrictions
-
Content types of standard image format supported by all Java ImageIO implementations: JPEG, PNG, GIF, BMP, WBMP.
- ImageTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Transforms an image using common image operations.
- ImageTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.ImageTransformer
- importDocument(Doc) - Method in class com.norconex.importer.Importer
-
Imports a document according to the importer configuration.
- importDocument(ImporterRequest) - Method in class com.norconex.importer.Importer
-
Imports a document according to the importer configuration.
- importDocument(File, ContentType, String, Properties, String) - Method in class com.norconex.importer.Importer
-
Deprecated.Since 3.0.0 use
Importer.importDocument(ImporterRequest)
- importDocument(File, Properties) - Method in class com.norconex.importer.Importer
-
Deprecated.Since 3.0.0 use
Importer.importDocument(ImporterRequest)
- importDocument(InputStream, ContentType, String, Properties, String) - Method in class com.norconex.importer.Importer
-
Deprecated.Since 3.0.0 use
Importer.importDocument(ImporterRequest)
- importDocument(InputStream, Properties, String) - Method in class com.norconex.importer.Importer
-
Deprecated.Since 3.0.0 use
Importer.importDocument(ImporterRequest)
- IMPORTED_DATE - Static variable in class com.norconex.importer.doc.DocMetadata
-
Date processed by the Importer.
- Importer - Class in com.norconex.importer
-
Principal class responsible for importing documents.
- Importer() - Constructor for class com.norconex.importer.Importer
-
Creates a new importer with default configuration.
- Importer(ImporterConfig) - Constructor for class com.norconex.importer.Importer
-
Creates a new importer with the given configuration.
- Importer(ImporterConfig, EventManager) - Constructor for class com.norconex.importer.Importer
-
Creates a new importer with the given configuration.
- IMPORTER_HANDLER_BEGIN - Static variable in class com.norconex.importer.ImporterEvent
- IMPORTER_HANDLER_CONDITION_FALSE - Static variable in class com.norconex.importer.ImporterEvent
- IMPORTER_HANDLER_CONDITION_TRUE - Static variable in class com.norconex.importer.ImporterEvent
- IMPORTER_HANDLER_END - Static variable in class com.norconex.importer.ImporterEvent
- IMPORTER_HANDLER_ERROR - Static variable in class com.norconex.importer.ImporterEvent
- IMPORTER_PARSER_BEGIN - Static variable in class com.norconex.importer.ImporterEvent
- IMPORTER_PARSER_END - Static variable in class com.norconex.importer.ImporterEvent
- IMPORTER_PARSER_ERROR - Static variable in class com.norconex.importer.ImporterEvent
- ImporterConfig - Class in com.norconex.importer
-
Importer configuration.
- ImporterConfig() - Constructor for class com.norconex.importer.ImporterConfig
- ImporterEvent - Class in com.norconex.importer
-
An Importer event.
- ImporterEvent(ImporterEvent.Builder) - Constructor for class com.norconex.importer.ImporterEvent
-
New event.
- ImporterEvent.Builder - Class in com.norconex.importer
- ImporterException - Exception in com.norconex.importer
-
Exception thrown when an issue prevented the proper importation of a file.
- ImporterException() - Constructor for exception com.norconex.importer.ImporterException
- ImporterException(String) - Constructor for exception com.norconex.importer.ImporterException
- ImporterException(String, Throwable) - Constructor for exception com.norconex.importer.ImporterException
- ImporterException(Throwable) - Constructor for exception com.norconex.importer.ImporterException
- ImporterHandlerException - Exception in com.norconex.importer.handler
-
Exception thrown by several handler classes upon encountering issues.
- ImporterHandlerException() - Constructor for exception com.norconex.importer.handler.ImporterHandlerException
- ImporterHandlerException(String) - Constructor for exception com.norconex.importer.handler.ImporterHandlerException
- ImporterHandlerException(String, Throwable) - Constructor for exception com.norconex.importer.handler.ImporterHandlerException
- ImporterHandlerException(Throwable) - Constructor for exception com.norconex.importer.handler.ImporterHandlerException
- ImporterLauncher - Class in com.norconex.importer
-
Command line launcher of the Importer application.
- ImporterRequest - Class in com.norconex.importer
-
An Importer request, unique for each document to be imported.
- ImporterRequest(InputStream) - Constructor for class com.norconex.importer.ImporterRequest
- ImporterRequest(Path) - Constructor for class com.norconex.importer.ImporterRequest
- ImporterResponse - Class in com.norconex.importer.response
- ImporterResponse(Doc) - Constructor for class com.norconex.importer.response.ImporterResponse
- ImporterResponse(String, ImporterStatus) - Constructor for class com.norconex.importer.response.ImporterResponse
- ImporterRuntimeException - Exception in com.norconex.importer
-
RuntimeException thrown when a an issue prevented the proper importation of a file.
- ImporterRuntimeException() - Constructor for exception com.norconex.importer.ImporterRuntimeException
- ImporterRuntimeException(String) - Constructor for exception com.norconex.importer.ImporterRuntimeException
- ImporterRuntimeException(String, Throwable) - Constructor for exception com.norconex.importer.ImporterRuntimeException
- ImporterRuntimeException(Throwable) - Constructor for exception com.norconex.importer.ImporterRuntimeException
- ImporterStatus - Class in com.norconex.importer.response
- ImporterStatus() - Constructor for class com.norconex.importer.response.ImporterStatus
- ImporterStatus(IDocumentFilter) - Constructor for class com.norconex.importer.response.ImporterStatus
- ImporterStatus(IDocumentFilter, String) - Constructor for class com.norconex.importer.response.ImporterStatus
- ImporterStatus(ImporterException) - Constructor for class com.norconex.importer.response.ImporterStatus
- ImporterStatus(ImporterException, String) - Constructor for class com.norconex.importer.response.ImporterStatus
- ImporterStatus(ImporterStatus.Status, String) - Constructor for class com.norconex.importer.response.ImporterStatus
- ImporterStatus.Status - Enum in com.norconex.importer.response
- INCLUDE - com.norconex.importer.handler.filter.OnMatch
- includeIfNull(OnMatch) - Static method in enum com.norconex.importer.handler.filter.OnMatch
- IncludeMatchResolver() - Constructor for class com.norconex.importer.handler.HandlerContext.IncludeMatchResolver
- initDefaultParsers() - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
- initialize(ParseHints) - Method in interface com.norconex.importer.parser.IHintsAwareParser
-
Initialize this parser with the given parse hints.
- initialize(ParseHints) - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
- INSTANCE - Static variable in class com.norconex.importer.handler.filter.impl.RejectFilter
- IOnMatchFilter - Interface in com.norconex.importer.handler.filter
-
Tells the collector that a filter is of "OnMatch" type.
- isAppendHash() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
- isApplicable(HandlerDoc, ParseState) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Class to invoke by subclasses to find out if this handler should be rejected or not based on the metadata restriction provided.
- isCaseSensitive() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Since 3.0.0, use
DOMContentFilter.getValueMatcher()
- isCaseSensitive() - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
Deprecated.
- isCaseSensitive() - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- isCaseSensitive() - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
Deprecated.
- isCaseSensitive() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.Whether the matching should be case sensitive or not.
- isCaseSensitive() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Deprecated.
- isCaseSensitive() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
-
Deprecated.Since 3.0.0, use
TextBetweenTagger.TextBetweenDetails.isCaseSensitive()
- isCaseSensitive() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Deprecated.
- isCaseSensitive() - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
-
Deprecated.Since 3.0.0, use
ReduceConsecutivesTransformer.isIgnoreCase()
. - isCaseSensitive() - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
-
Deprecated.Since 3.0.0.
- isCaseSensitive() - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
Deprecated.Since 3.0.0, use
StripAfterTransformer.getStripAfterMatcher()
. - isCaseSensitive() - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
Deprecated.Since 3.0.0, use
StripBeforeTransformer.getStripBeforeMatcher()
. - isCaseSensitive() - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
Deprecated.Since 3.0.0, use
StripBetweenTransformer.isCaseSensitive()
- isCaseSensitive() - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- isDelete() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
Gets whether to delete DOM attributes/elements matching the specified selector.
- isDeleteFromFields() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
- isDetectHeading() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- isDiscardUnchanged() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Gets whether to discard values that did not change as a result of the replacement attempt.
- isDocumentMatched(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
- isDocumentMatched(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
- isDocumentMatched(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- isDocumentMatched(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.
- isDocumentMatched(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
- isDocumentMatched(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.filter.impl.EmptyFilter
- isDocumentMatched(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
Deprecated.
- isDocumentMatched(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
- isDocumentMatched(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.filter.impl.ReferenceFilter
- isDocumentMatched(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- isDocumentMatched(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
Deprecated.
- isEmpty() - Method in class com.norconex.importer.parser.EmbeddedConfig
- isEmpty() - Method in class com.norconex.importer.parser.OCRConfig
- isError() - Method in class com.norconex.importer.response.ImporterStatus
- isIgnoreCase() - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
-
Gets whether to ignore case sensitivity.
- isIgnoreContent() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- isIgnoreNonTranslatedFields() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- isInclusive() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
-
Deprecated.Since 3.0.0, use
TextBetweenTagger.TextBetweenDetails.isInclusive()
- isInclusive() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
- isInclusive() - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
- isInclusive() - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
- isInclusive() - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
Deprecated.Since 3.0.0, use
StripBetweenTransformer.StripBetweenDetails.isInclusive()
- isInclusive() - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer.StripBetweenDetails
- isInputDisabled() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Gets whether to send the document content or not, regardless whether ${INPUT} token is part of the command or not.
- isKeepBadDates() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
- isKeepEmptySegments() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- isKeepProbabilities() - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
- isLogContent() - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
- isMatchAnyBlank() - Method in class com.norconex.importer.handler.condition.impl.BlankCondition
- isMatchBlanks() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
Gets whether elements with blank values should be considered a match and have an empty string returned as opposed to nothing at all.
- isMatchBlanks() - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer.DOMExtractDetails
-
Gets whether elements with blank values should be considered a match and have an empty string returned as opposed to nothing at all.
- isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
Deprecated.Since 3.0.0 use
CurrentDateTagger.getOnSet()
. - isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
-
Deprecated.Since 3.0.0 use
DocumentLengthTagger.getOnSet()
. - isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
Deprecated.Since 3.0.0 use
DOMTagger.DOMExtractDetails.getOnSet()
. - isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
Deprecated.Since 3.0.0 use
HierarchyTagger.HierarchyDetails.getOnSet()
. - isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
Deprecated.Since 3.0.0 use
TitleGeneratorTagger.getOnSet()
. - isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
Deprecated.Since 3.0.0 use
TruncateTagger.getOnSet()
. - isOverwrite() - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
Deprecated.Since 3.0.0 use
UUIDTagger.getOnSet()
. - isParsed() - Method in class com.norconex.importer.ImporterEvent
- isPost() - Method in enum com.norconex.importer.parser.ParseState
- isPost(ParseState) - Static method in enum com.norconex.importer.parser.ParseState
- isPre() - Method in enum com.norconex.importer.parser.ParseState
- isPre(ParseState) - Static method in enum com.norconex.importer.parser.ParseState
- isRegex() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.
- isRegex() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- isRegex() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Deprecated.
- isRegex() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
-
Deprecated.Since 3.0.0, use
SplitTagger.SplitDetails.isSeparatorRegex()
instead - isRejected() - Method in class com.norconex.importer.handler.HandlerContext
- isRejected() - Method in class com.norconex.importer.response.ImporterStatus
- isReplaceAll() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Deprecated.
- isScaleStretch() - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- isSeparatorRegex() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
-
Gets whether the separator value is a regular expression.
- isSingleValue() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
- isSplitEmbedded() - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
-
Deprecated.
- isStringContentMatching(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
- isStringContentMatching(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
Deprecated.
- isStringContentMatching(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
- isStringContentMatching(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.filter.impl.TextFilter
- isSuccess() - Method in class com.norconex.importer.response.ImporterResponse
- isSuccess() - Method in class com.norconex.importer.response.ImporterStatus
- isTextDocumentMatching(HandlerDoc, Reader, ParseState) - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
- isTextDocumentMatching(HandlerDoc, Reader, ParseState) - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
- isUseFirstRowAsFields() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Whether to use the first row as field names for values.
- isWholeMatch() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Deprecated.
- isWithHeaders() - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- isWithOccurences() - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
J
- JAVASCRIPT_ENGINE - Static variable in class com.norconex.importer.handler.ScriptRunner
K
- KeepOnlyTagger - Class in com.norconex.importer.handler.tagger.impl
-
Keep only the metadata fields provided, delete all other ones.
- KeepOnlyTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
L
- LANGUAGE - Static variable in class com.norconex.importer.doc.DocMetadata
-
Document language.
- LanguageTagger - Class in com.norconex.importer.handler.tagger.impl
-
Detects a document language based on Apache Tika language detection capability.
- LanguageTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.LanguageTagger
- launch(String[]) - Static method in class com.norconex.importer.ImporterLauncher
- loadCharStreamConditionFromXML(XML) - Method in class com.norconex.importer.handler.condition.AbstractCharStreamCondition
-
Loads configuration settings specific to the implementing class.
- loadCharStreamConditionFromXML(XML) - Method in class com.norconex.importer.handler.condition.AbstractStringCondition
- loadCharStreamConditionFromXML(XML) - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
- loadCharStreamFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
-
Loads configuration settings specific to the implementing class.
- loadCharStreamFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
- loadCharStreamTaggerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
-
Loads configuration settings specific to the implementing class.
- loadCharStreamTaggerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
- loadCharStreamTaggerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
- loadCharStreamTaggerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
- loadCharStreamTaggerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
- loadCharStreamTaggerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
- loadCharStreamTransformerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
-
Loads configuration settings specific to the implementing class.
- loadCharStreamTransformerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
- loadCharStreamTransformerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
- loadFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
- loadFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
- loadFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- loadFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.
- loadFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
- loadFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.impl.EmptyFilter
- loadFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
Deprecated.
- loadFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
- loadFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.impl.ReferenceFilter
- loadFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- loadFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
Deprecated.
- loadFromXML(XML) - Method in class com.norconex.importer.handler.AbstractImporterHandler
- loadFromXML(XML) - Method in class com.norconex.importer.handler.condition.AbstractCharStreamCondition
- loadFromXML(XML) - Method in class com.norconex.importer.handler.condition.impl.BlankCondition
- loadFromXML(XML) - Method in class com.norconex.importer.handler.condition.impl.DateCondition
- loadFromXML(XML) - Method in class com.norconex.importer.handler.condition.impl.NumericCondition
- loadFromXML(XML) - Method in class com.norconex.importer.handler.condition.impl.ReferenceCondition
- loadFromXML(XML) - Method in class com.norconex.importer.handler.HandlerConsumer
- loadFromXML(XML) - Method in class com.norconex.importer.handler.HandlerPredicate
- loadFromXML(XML) - Method in class com.norconex.importer.ImporterConfig
- loadFromXML(XML) - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
- loadFromXML(XML) - Method in class com.norconex.importer.parser.impl.ExternalParser
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Loads configuration settings specific to the implementing class.
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.ExternalHandler
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.splitter.impl.XMLStreamSplitter
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.CopyTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.DOMDeleteTransformer
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- loadHandlerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.NoContentTransformer
- loadOnMatchFromXML(XML) - Method in class com.norconex.importer.handler.filter.AbstractOnMatchFilter
-
Deprecated.Convenience method for subclasses to load the "onMatch" attribute from an XML file when
XML
is used. - loadStringConditionFromXML(XML) - Method in class com.norconex.importer.handler.condition.AbstractStringCondition
-
Loads configuration settings specific to the implementing class.
- loadStringConditionFromXML(XML) - Method in class com.norconex.importer.handler.condition.impl.ScriptCondition
- loadStringConditionFromXML(XML) - Method in class com.norconex.importer.handler.condition.impl.TextCondition
- loadStringFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
-
Loads configuration settings specific to the implementing class.
- loadStringFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
Deprecated.
- loadStringFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
- loadStringFilterFromXML(XML) - Method in class com.norconex.importer.handler.filter.impl.TextFilter
- loadStringTaggerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
-
Loads configuration settings specific to the implementing class.
- loadStringTaggerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
- loadStringTaggerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.RegexTagger
- loadStringTaggerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
- loadStringTaggerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
- loadStringTaggerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Deprecated.
- loadStringTaggerFromXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- loadStringTransformerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
-
Loads configuration settings specific to the implementing class.
- loadStringTransformerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
- loadStringTransformerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
- loadStringTransformerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
- loadStringTransformerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
- loadStringTransformerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
- loadStringTransformerFromXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
- LOWER_EQUAL - com.norconex.importer.handler.filter.impl.DateMetadataFilter.Operator
- LOWER_EQUAL - com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Operator
- LOWER_THAN - com.norconex.importer.handler.filter.impl.DateMetadataFilter.Operator
- LOWER_THAN - com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Operator
- LUA_ENGINE - Static variable in class com.norconex.importer.handler.ScriptRunner
M
- main(String[]) - Static method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- main(String[]) - Static method in class com.norconex.importer.Importer
-
Invokes the importer from the command line.
- MatchDetails() - Constructor for class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.
- MatchDetails(String, String, String) - Constructor for class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.
- MAX_CONTENT_FROM_END_TO_CUT - Static variable in class com.norconex.importer.util.BufferUtil
- Merge() - Constructor for class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
- MergeEmbeddedParser(Parser, Writer, Properties) - Constructor for class com.norconex.importer.parser.impl.AbstractTikaParser.MergeEmbeddedParser
- MergeTagger - Class in com.norconex.importer.handler.tagger.impl
-
Merge multiple metadata fields into a single one.
- MergeTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.MergeTagger
- MergeTagger.Merge - Class in com.norconex.importer.handler.tagger.impl
- META_FORMAT_JSON - Static variable in class com.norconex.importer.handler.ExternalHandler
- META_FORMAT_PROPERTIES - Static variable in class com.norconex.importer.handler.ExternalHandler
- META_FORMAT_XML - Static variable in class com.norconex.importer.handler.ExternalHandler
- MINUTE - com.norconex.importer.handler.condition.impl.DateCondition.TimeUnit
- MINUTE - com.norconex.importer.handler.filter.impl.DateMetadataFilter.TimeUnit
- modifyParseContext(ParseContext) - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
-
Override to apply your own settings on the Tika ParseContext.
- MONTH - com.norconex.importer.handler.condition.impl.DateCondition.TimeUnit
- MONTH - com.norconex.importer.handler.filter.impl.DateMetadataFilter.TimeUnit
N
- NoContentTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Get rid of the content stream and optionally store it as text into a metadata field instead.
- NoContentTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.NoContentTransformer
- NOOP - com.norconex.importer.handler.tagger.impl.ConstantTagger.OnConflict
-
Deprecated.
- NumericCondition - Class in com.norconex.importer.handler.condition.impl
-
A condition based on the numeric value(s) of matching metadata fields, supporting decimals.
- NumericCondition() - Constructor for class com.norconex.importer.handler.condition.impl.NumericCondition
- NumericCondition(TextMatcher) - Constructor for class com.norconex.importer.handler.condition.impl.NumericCondition
- NumericCondition(TextMatcher, NumericCondition.ValueMatcher) - Constructor for class com.norconex.importer.handler.condition.impl.NumericCondition
- NumericCondition(TextMatcher, NumericCondition.ValueMatcher, NumericCondition.ValueMatcher) - Constructor for class com.norconex.importer.handler.condition.impl.NumericCondition
- NumericCondition.ValueMatcher - Class in com.norconex.importer.handler.condition.impl
- NumericMetadataFilter - Class in com.norconex.importer.handler.filter.impl
-
Accepts or rejects a document based on the numeric value(s) of matching metadata fields, supporting decimals.
- NumericMetadataFilter() - Constructor for class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
- NumericMetadataFilter(TextMatcher) - Constructor for class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
-
Constructor.
- NumericMetadataFilter(TextMatcher, OnMatch) - Constructor for class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
- NumericMetadataFilter(String) - Constructor for class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
-
Deprecated.Since 3.0.0, use
NumericMetadataFilter(TextMatcher)
- NumericMetadataFilter(String, OnMatch) - Constructor for class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
-
Deprecated.Since 3.0.0, use
NumericMetadataFilter(TextMatcher, OnMatch)
- NumericMetadataFilter.Condition - Class in com.norconex.importer.handler.filter.impl
- NumericMetadataFilter.Operator - Enum in com.norconex.importer.handler.filter.impl
O
- OCRConfig - Class in com.norconex.importer.parser
-
OCR configuration details.
- OCRConfig() - Constructor for class com.norconex.importer.parser.OCRConfig
-
Constructor.
- OnMatch - Enum in com.norconex.importer.handler.filter
-
Constants indicating the action to perform upon matching a condition.
P
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class com.norconex.importer.parser.impl.AbstractTikaParser.MergeEmbeddedParser
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class com.norconex.importer.parser.impl.AbstractTikaParser.SplitEmbbededParser
- parseDocument(Doc, Writer) - Method in interface com.norconex.importer.parser.IDocumentParser
-
Parses a document.
- parseDocument(Doc, Writer) - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
- parseDocument(Doc, Writer) - Method in class com.norconex.importer.parser.impl.ExternalParser
- parseDocument(Doc, Writer) - Method in class com.norconex.importer.parser.impl.xfdl.XFDLParser
- ParseHints - Class in com.norconex.importer.parser
-
Configuration settings influencing how documents are parsed by various parsers.
- ParseHints() - Constructor for class com.norconex.importer.parser.ParseHints
- PARSER_HTML - Static variable in class com.norconex.importer.util.DOMUtil
- PARSER_XML - Static variable in class com.norconex.importer.util.DOMUtil
- parseState(ParseState) - Method in class com.norconex.importer.ImporterEvent.Builder
- ParseState - Enum in com.norconex.importer.parser
-
Act as a flag indicating if a document has been parsed or not in a given process flow.
- parseZonedDateTimeString(String, String, Locale, String, ZoneId) - Static method in class com.norconex.importer.util.FormatUtil
-
Formats a string representation of a date, into another string date format.
- passes() - Method in class com.norconex.importer.handler.HandlerContext.IncludeMatchResolver
- PDFPageSplitter - Class in com.norconex.importer.handler.splitter.impl
-
Split PDFs pages so each pages are treated as individual documents.
- PDFPageSplitter() - Constructor for class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
- POST - com.norconex.importer.parser.ParseState
- PRE - com.norconex.importer.parser.ParseState
- processImporterResponse(ImporterResponse) - Method in interface com.norconex.importer.response.IImporterResponseProcessor
-
Processes the importer response.
R
- ReduceConsecutivesTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Reduces specified consecutive characters or strings to only one instance (document content only).
- ReduceConsecutivesTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
- REFERENCE - Static variable in class com.norconex.importer.doc.DocMetadata
-
Document unique reference (also called "id", "primary key", etc.).
- ReferenceCondition - Class in com.norconex.importer.handler.condition.impl
-
A condition based on a text pattern matching a document reference (e.g.
- ReferenceCondition() - Constructor for class com.norconex.importer.handler.condition.impl.ReferenceCondition
- ReferenceCondition(TextMatcher) - Constructor for class com.norconex.importer.handler.condition.impl.ReferenceCondition
- ReferenceFilter - Class in com.norconex.importer.handler.filter.impl
-
Accepts or rejects a document based on its reference (e.g.
- ReferenceFilter() - Constructor for class com.norconex.importer.handler.filter.impl.ReferenceFilter
- ReferenceFilter(TextMatcher) - Constructor for class com.norconex.importer.handler.filter.impl.ReferenceFilter
- RegexContentFilter - Class in com.norconex.importer.handler.filter.impl
-
Deprecated.Since 3.0.0, use
TextFilter
instead. - RegexContentFilter() - Constructor for class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
Deprecated.
- RegexContentFilter(String) - Constructor for class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
Deprecated.
- RegexContentFilter(String, OnMatch) - Constructor for class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
Deprecated.
- RegexContentFilter(String, OnMatch, boolean) - Constructor for class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
Deprecated.
- RegexFieldExtractor - Class in com.norconex.importer.util.regex
-
Deprecated.Since 3.0.0, use
RegexFieldValueExtractor
from Norconex Commons Lang - RegexFieldExtractor() - Constructor for class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- RegexFieldExtractor(String) - Constructor for class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- RegexFieldExtractor(String, int, int) - Constructor for class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- RegexFieldExtractor(String, String) - Constructor for class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- RegexFieldExtractor(String, String, int) - Constructor for class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- RegexMetadataFilter - Class in com.norconex.importer.handler.filter.impl
-
Deprecated.Since 3.0.0, use
TextFilter
instead. - RegexMetadataFilter() - Constructor for class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- RegexMetadataFilter(String, String) - Constructor for class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- RegexMetadataFilter(String, String, OnMatch) - Constructor for class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- RegexMetadataFilter(String, String, OnMatch, boolean) - Constructor for class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- RegexReferenceFilter - Class in com.norconex.importer.handler.filter.impl
-
Deprecated.Since 3.0.0, use
ReferenceFilter
instead. - RegexReferenceFilter() - Constructor for class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
Deprecated.
- RegexReferenceFilter(String) - Constructor for class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
Deprecated.
- RegexReferenceFilter(String, OnMatch) - Constructor for class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
Deprecated.
- RegexReferenceFilter(String, OnMatch, boolean) - Constructor for class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
Deprecated.
- RegexTagger - Class in com.norconex.importer.handler.tagger.impl
-
Extracts field names and their values with regular expression.
- RegexTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.RegexTagger
- RegexUtil - Class in com.norconex.importer.util.regex
-
Deprecated.Since 3.0.0, use
RegexFieldValueExtractor
from Norconex Commons Lang - registerParser(ContentType, IDocumentParser) - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
Registers a parser to use for the given content type.
- REJECTED - com.norconex.importer.response.ImporterStatus.Status
- RejectFilter - Class in com.norconex.importer.handler.filter.impl
-
Rejects a document.
- RejectFilter() - Constructor for class com.norconex.importer.handler.filter.impl.RejectFilter
- removeAllConditions() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
Removes all conditions from this filter.
- removeCondition(DateMetadataFilter.Condition) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
Removes a condition, if it part of already defined conditions.
- removeConstant(String) - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
- removeDOMExtractDetails(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Removes the DOM extraction details matching the given selector
- removeDOMExtractDetails(String) - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
-
Removes the DOM extraction details matching the given selector
- removeDOMExtractDetailsList() - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
-
Removes all DOM extraction details.
- removeField(String) - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
-
Deprecated.Since 3.0.0, use
DeleteTagger.setFieldMatcher(TextMatcher)
- removeField(String) - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
-
Deprecated.Since 3.0.0, use
KeepOnlyTagger.setFieldMatcher(TextMatcher)
- removeMatchDetails(CountMatchesTagger.MatchDetails) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
Deprecated.Since 3.0.0, this method does nothing.
- removeNestedResponse(String) - Method in class com.norconex.importer.response.ImporterResponse
- removeReplacement(String) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
-
Deprecated.
- removeRestriction(PropertyMatcher) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Removes a restriction.
- removeRestriction(String) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Removes all restrictions on a given field.
- removeSingleValueField(String) - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
-
Deprecated.Since 3.0.0, use
ForceSingleValueTagger.setFieldMatcher(TextMatcher)
. - removeSplit(String) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
-
Deprecated.
- removeSplitDetails(String) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
- RenameDetails(TextMatcher, String, PropertySetter) - Constructor for class com.norconex.importer.handler.tagger.impl.RenameTagger.RenameDetails
- RenameDetails(String, String, boolean) - Constructor for class com.norconex.importer.handler.tagger.impl.RenameTagger.RenameDetails
-
Deprecated.
- RenameDetails(String, String, boolean, boolean) - Constructor for class com.norconex.importer.handler.tagger.impl.RenameTagger.RenameDetails
-
Deprecated.
- RenameTagger - Class in com.norconex.importer.handler.tagger.impl
-
Rename metadata fields to different names.
- RenameTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.RenameTagger
- RenameTagger.RenameDetails - Class in com.norconex.importer.handler.tagger.impl
- REPLACE - com.norconex.importer.handler.tagger.impl.ConstantTagger.OnConflict
-
Deprecated.
- Replacement() - Constructor for class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
- Replacement() - Constructor for class com.norconex.importer.handler.transformer.impl.ReplaceTransformer.Replacement
- Replacement(TextMatcher, TextMatcher, String, String) - Constructor for class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
- Replacement(TextMatcher, TextMatcher, String, String) - Constructor for class com.norconex.importer.handler.transformer.impl.ReplaceTransformer.Replacement
- Replacement(String, String, String, String) - Constructor for class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Deprecated.
- ReplaceTagger - Class in com.norconex.importer.handler.tagger.impl
-
Replaces an existing metadata value with another one.
- ReplaceTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.ReplaceTagger
- ReplaceTagger.Replacement - Class in com.norconex.importer.handler.tagger.impl
- ReplaceTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Replaces every occurrences of the given replacements (document content only).
- ReplaceTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
- ReplaceTransformer.Replacement - Class in com.norconex.importer.handler.transformer.impl
S
- saveCharStreamConditionToXML(XML) - Method in class com.norconex.importer.handler.condition.AbstractCharStreamCondition
-
Saves configuration settings specific to the implementing class.
- saveCharStreamConditionToXML(XML) - Method in class com.norconex.importer.handler.condition.AbstractStringCondition
- saveCharStreamConditionToXML(XML) - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
- saveCharStreamFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
-
Saves configuration settings specific to the implementing class.
- saveCharStreamFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
- saveCharStreamTaggerToXML(XML) - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
-
Saves configuration settings specific to the implementing class.
- saveCharStreamTaggerToXML(XML) - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
- saveCharStreamTaggerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
- saveCharStreamTaggerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
- saveCharStreamTaggerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
- saveCharStreamTaggerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
- saveCharStreamTransformerToXML(XML) - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
-
Saves configuration settings specific to the implementing class.
- saveCharStreamTransformerToXML(XML) - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
- saveCharStreamTransformerToXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
- saveFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
- saveFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
- saveFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- saveFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.
- saveFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
- saveFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.impl.EmptyFilter
- saveFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
Deprecated.
- saveFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
- saveFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.impl.ReferenceFilter
- saveFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- saveFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
Deprecated.
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.AbstractImporterHandler
-
Saves configuration settings specific to the implementing class.
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.ExternalHandler
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.splitter.impl.XMLStreamSplitter
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.CopyTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.DOMDeleteTransformer
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- saveHandlerToXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.NoContentTransformer
- saveOnMatchToXML(XML) - Method in class com.norconex.importer.handler.filter.AbstractOnMatchFilter
-
Deprecated.Convenience method for subclasses to save the "onMatch" attribute to an XML file when
XML
is used. - saveStringConditionToXML(XML) - Method in class com.norconex.importer.handler.condition.AbstractStringCondition
-
Saves configuration settings specific to the implementing class.
- saveStringConditionToXML(XML) - Method in class com.norconex.importer.handler.condition.impl.ScriptCondition
- saveStringConditionToXML(XML) - Method in class com.norconex.importer.handler.condition.impl.TextCondition
- saveStringFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
-
Saves configuration settings specific to the implementing class.
- saveStringFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
Deprecated.
- saveStringFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
- saveStringFilterToXML(XML) - Method in class com.norconex.importer.handler.filter.impl.TextFilter
- saveStringTaggerToXML(XML) - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
-
Saves configuration settings specific to the implementing class.
- saveStringTaggerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
- saveStringTaggerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.RegexTagger
- saveStringTaggerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
- saveStringTaggerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
- saveStringTaggerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Deprecated.
- saveStringTaggerToXML(XML) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- saveStringTransformerToXML(XML) - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
-
Saves configuration settings specific to the implementing class.
- saveStringTransformerToXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
- saveStringTransformerToXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
- saveStringTransformerToXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
- saveStringTransformerToXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
- saveStringTransformerToXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
- saveStringTransformerToXML(XML) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
- saveToXML(XML) - Method in class com.norconex.importer.handler.AbstractImporterHandler
- saveToXML(XML) - Method in class com.norconex.importer.handler.condition.AbstractCharStreamCondition
- saveToXML(XML) - Method in class com.norconex.importer.handler.condition.impl.BlankCondition
- saveToXML(XML) - Method in class com.norconex.importer.handler.condition.impl.DateCondition
- saveToXML(XML) - Method in class com.norconex.importer.handler.condition.impl.NumericCondition
- saveToXML(XML) - Method in class com.norconex.importer.handler.condition.impl.ReferenceCondition
- saveToXML(XML) - Method in class com.norconex.importer.handler.HandlerConsumer
- saveToXML(XML) - Method in class com.norconex.importer.handler.HandlerPredicate
- saveToXML(XML) - Method in class com.norconex.importer.ImporterConfig
- saveToXML(XML) - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
- saveToXML(XML) - Method in class com.norconex.importer.parser.impl.ExternalParser
- ScriptCondition - Class in com.norconex.importer.handler.condition.impl
-
A condition formulated using a scripting language.
- ScriptCondition() - Constructor for class com.norconex.importer.handler.condition.impl.ScriptCondition
- ScriptFilter - Class in com.norconex.importer.handler.filter.impl
-
Filter incoming documents using a scripting language.
- ScriptFilter() - Constructor for class com.norconex.importer.handler.filter.impl.ScriptFilter
- ScriptRunner<T> - Class in com.norconex.importer.handler
-
Runs scripts written in a programming language supported by the provided script engine.
- ScriptRunner() - Constructor for class com.norconex.importer.handler.ScriptRunner
- ScriptRunner(String) - Constructor for class com.norconex.importer.handler.ScriptRunner
- ScriptTagger - Class in com.norconex.importer.handler.tagger.impl
-
Tag incoming documents using a scripting language.
- ScriptTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.ScriptTagger
- ScriptTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Transform incoming documents using a scripting language.
- ScriptTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.ScriptTransformer
- SECOND - com.norconex.importer.handler.condition.impl.DateCondition.TimeUnit
- SECOND - com.norconex.importer.handler.filter.impl.DateMetadataFilter.TimeUnit
- setAction(String) - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
-
Sets the action.
- setApi(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setApiKey(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setAppendHash(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
- setApplyTo(String) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
Sets whether to apply the case transformation to fields, values, or both.
- setAtLeastOneIncludeMatch(boolean) - Method in class com.norconex.importer.handler.HandlerContext.IncludeMatchResolver
- setBegin(long) - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
-
Sets the beginning index (inclusive).
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Since 3.0.0, use
DOMContentFilter.getValueMatcher()
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
Deprecated.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
Deprecated.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.Sets whether to do a case sensitive match or not.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Deprecated.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
-
Deprecated.Since 3.0.0, use
TextBetweenTagger.TextBetweenDetails.setCaseSensitive(boolean)
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Deprecated.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
-
Deprecated.Since 3.0.0, use
ReduceConsecutivesTransformer.setIgnoreCase(boolean)
. - setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
-
Deprecated.Since 3.0.0.
- setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
Deprecated.Since 3.0.0, use
StripAfterTransformer.setStripAfterMatcher(TextMatcher)
. - setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
Deprecated.Since 3.0.0, use
StripBeforeTransformer.setStripBeforeMatcher(TextMatcher)
. - setCaseSensitive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
Deprecated.Since 3.0.0, use
StripBetweenTransformer.setCaseSensitive(boolean)
- setCaseSensitive(boolean) - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- setCaseType(String) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
Sets the type of character case transformation.
- setClientId(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setClientSecret(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setCommand(String) - Method in class com.norconex.importer.handler.ExternalHandler
-
Sets the command to execute.
- setCommand(String) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets the command to execute.
- setCommand(String) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Sets the command to execute.
- setCommand(String) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Sets the command to execute.
- setConditions(NumericMetadataFilter.Condition...) - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
- setConditions(List<DateMetadataFilter.Condition>) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
Sets a list of conditions, overwriting any existing ones in this filter.
- setContent(CachedInputStream) - Method in class com.norconex.importer.doc.Doc
-
Deprecated.Since 3.0.0, use
Doc.setInputStream(InputStream)
- setContentColumns(String...) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
- setContentColumns(List<String>) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Sets content columns.
- setContentEncoding(String) - Method in class com.norconex.importer.doc.DocInfo
- setContentEncoding(String) - Method in class com.norconex.importer.ImporterRequest
- setContentType(ContentType) - Method in class com.norconex.importer.doc.DocInfo
- setContentType(ContentType) - Method in class com.norconex.importer.ImporterRequest
- setContentTypeMatcher(TextMatcher) - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
-
Sets this condition content-type matcher.
- setContentTypes(String) - Method in class com.norconex.importer.parser.OCRConfig
-
Sets the regular expression matching content types to restrict OCR to.
- setCountMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
Sets the count matcher.
- setCropRectangle(Rectangle) - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- setDefaultValue(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
- setDefaultValue(String) - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer.DOMExtractDetails
- setDelete(boolean) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
Sets whether to delete DOM attributes/elements matching the specified selector.
- setDeleteFromFields(boolean) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
- setDetectHeading(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- setDetectHeadingMaxLength(int) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- setDetectHeadingMinLength(int) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- setDiscardUnchanged(boolean) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Sets whether to discard values that did not change as a result of the replacement attempt.
- setDocZoneId(ZoneId) - Method in class com.norconex.importer.handler.condition.impl.DateCondition
-
Sets the time zone id documents are considered to be.
- setDocZoneId(ZoneId) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
Sets the time zone id documents are considered to be.
- setEmbeddedParentReferences(List<String>) - Method in class com.norconex.importer.doc.DocInfo
- setEnd(long) - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
-
Sets the end index (exclusive).
- setEndMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Sets the end delimiter matcher for text to extract.
- setEndMatcher(TextMatcher) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer.StripBetweenDetails
-
Sets the end delimiter matcher for text to strip.
- setEngineName(String) - Method in class com.norconex.importer.handler.condition.impl.ScriptCondition
- setEngineName(String) - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
- setEngineName(String) - Method in class com.norconex.importer.handler.ScriptRunner
- setEngineName(String) - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
- setEngineName(String) - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
- setEnvironmentVariables(Map<String, String>) - Method in class com.norconex.importer.handler.ExternalHandler
-
Sets the environment variables.
- setEnvironmentVariables(Map<String, String>) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets the environment variables.
- setEnvironmentVariables(Map<String, String>) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Sets the environment variables.
- setEnvironmentVariables(Map<String, String>) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Sets the environment variables.
- setEscapeCharacter(char) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Sets the escape character.
- setExtract(String) - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
-
Sets what should be extracted for the value.
- setExtract(String) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Sets what should be extracted for the value.
- setExtract(String) - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
-
Sets what should be extracted for the value.
- setExtract(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
- setExtract(String) - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer.DOMExtractDetails
- setFallbackLanguage(String) - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
-
Sets the fallback language when none are detected.
- setField(String) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
-
Deprecated.Since 3.0.0, use
DateMetadataFilter.setFieldMatcher(TextMatcher)
- setField(String) - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
-
Deprecated.Since 3.0.0, use
NumericMetadataFilter.setFieldMatcher(TextMatcher)
- setField(String) - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- setField(String) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
Deprecated.Since 3.0.0, use
CurrentDateTagger.setToField(String)
- setField(String) - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
-
Deprecated.Since 3.0.0, use
DocumentLengthTagger.setToField(String)
- setField(String) - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
Deprecated.Since 3.0.0, use
UUIDTagger.setToField(String)
- setField(String) - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- setFieldGroup(int) - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.condition.impl.BlankCondition
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.condition.impl.DateCondition
-
Sets the text matcher of field names.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
-
Sets this condition field matcher.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.condition.impl.NumericCondition
-
Sets the text matcher of field names.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.condition.impl.TextCondition
-
Sets the text matcher of field names.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
-
Sets this filter field matcher (copy).
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.filter.impl.EmptyFilter
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.filter.impl.TextFilter
-
Sets the text matcher of field names.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
-
Sets field matcher.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
-
Set field matcher (copy).
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
Sets the field matcher.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
-
Sets the field matcher for fields to delete.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
-
Sets field matcher.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
-
Sets field matcher.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
Sets field matcher.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.RegexTagger
-
Sets source field matcher for fields on which to extract fields/values.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Sets field matcher.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
-
Sets the field matcher for fields to split.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Sets field matcher for fields on which to extract values.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
-
Sets the field matcher for fields to split.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
Sets the field matcher for fields to truncate.
- setFieldMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
-
Sets the field matcher for fields containing text.
- setFieldName(String) - Method in class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
-
Deprecated.Since 3.0.0, use
TextStatisticsTagger.setFieldMatcher(TextMatcher)
. - setFields(String...) - Method in class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
Deprecated.
- setFields(List<String>) - Method in class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
Deprecated.
- setFields(List<String>) - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
-
Deprecated.Since 3.0.0, use
DeleteTagger.setFieldMatcher(TextMatcher)
- setFieldsRegex(String) - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
-
Deprecated.Since 3.0.0, use
CharsetTagger.setFieldMatcher(TextMatcher)
- setFieldsRegex(String) - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
-
Deprecated.Since 3.0.0, use
DeleteTagger.setFieldMatcher(TextMatcher)
- setFieldsRegex(String) - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
-
Deprecated.Since 3.0.0, use
KeepOnlyTagger.setFieldMatcher(TextMatcher)
- setFieldsToTranslate(String...) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setFieldsToTranslate(List<String>) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setFile(Path) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- setFormat(String) - Method in class com.norconex.importer.handler.condition.impl.DateCondition
- setFormat(String) - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- setFormat(String) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
- setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.Sets the field with the value we want to perform matches on.
- setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
- setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Sets optional source field holding the HTML content to apply DOM extraction to.
- setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Deprecated.
- setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
-
Deprecated.Since 3.0.0, use
SplitTagger.SplitDetails.setFieldMatcher(TextMatcher)
instead - setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- setFromField(String) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
Deprecated.Since 3.0.0, use
TruncateTagger.setFieldMatcher(TextMatcher)
instead - setFromFields(List<String>) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
Deprecated.Since 3.0.0, use
MergeTagger.Merge.setFieldMatcher(TextMatcher)
- setFromFieldsRegex(String) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
-
Deprecated.Since 3.0.0, use
MergeTagger.Merge.setFieldMatcher(TextMatcher)
- setFromFormats(String...) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Sets the source date formats to match.
- setFromFormats(List<String>) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Sets the source date formats to match.
- setFromLocale(Locale) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Sets the locale used for parsing the source date.
- setFromSeparator(String) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- setFromValue(String) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Deprecated.
- setHandler(IImporterHandler) - Method in class com.norconex.importer.handler.HandlerConsumer
- setHasIncludes(boolean) - Method in class com.norconex.importer.handler.HandlerContext.IncludeMatchResolver
- setIgnoreCase(boolean) - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
-
Sets whether to ignore case sensitivity.
- setIgnoreContent(boolean) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setIgnoredContentTypesRegex(String) - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
-
sets the regular expression matching content types to ignore (i.e. do not perform parsing on them).
- setIgnoreNonTranslatedFields(boolean) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setImporterStatus(ImporterStatus) - Method in class com.norconex.importer.response.ImporterResponse
- setInclusive(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
-
Deprecated.Since 3.0.0, use
TextBetweenTagger.TextBetweenDetails.setInclusive(boolean)
- setInclusive(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
- setInclusive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
Sets whether the match itself should be stripped or not.
- setInclusive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
Sets whether the match itself should be stripped or not.
- setInclusive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
-
Deprecated.Since 3.0.0, use
StripBetweenTransformer.StripBetweenDetails.setInclusive(boolean)
- setInclusive(boolean) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer.StripBetweenDetails
- setInputDisabled(boolean) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets whether to send the document content or not, regardless whether ${INPUT} token is part of the command or not.
- setInputStream(InputStream) - Method in class com.norconex.importer.doc.Doc
- setKeepBadDates(boolean) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
- setKeepEmptySegments(boolean) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- setKeepProbabilities(boolean) - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
-
Sets whether to keep the match probabilities for each languages detected.
- setLanguages(String) - Method in class com.norconex.importer.parser.OCRConfig
-
Sets languages to use by OCR.
- setLanguages(List<String>) - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
-
Sets the language candidates for the language detection.
- setLinesToSkip(int) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Sets how many lines to skip before starting to parse lines.
- setLocale(Locale) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
Sets the locale used for formatting.
- setLogContent(boolean) - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
- setLogFields(List<String>) - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
- setLogLevel(String) - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
- setMatchAnyBlank(boolean) - Method in class com.norconex.importer.handler.condition.impl.BlankCondition
- setMatchBlanks(boolean) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
Sets whether elements with blank values should be considered a match and have an empty string returned as opposed to nothing at all.
- setMatchBlanks(boolean) - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer.DOMExtractDetails
-
Sets whether elements with blank values should be considered a match and have an empty string returned as opposed to nothing at all.
- setMaxFileCacheSize(long) - Method in class com.norconex.importer.ImporterConfig
-
Deprecated.Since 3.0.0, use
ImporterConfig.setMaxMemoryInstance(long)
. - setMaxFilePoolCacheSize(long) - Method in class com.norconex.importer.ImporterConfig
-
Deprecated.Since 3.0.0, use
ImporterConfig.setMaxMemoryPool(long)
. - setMaxLength(int) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
- setMaxMemoryInstance(long) - Method in class com.norconex.importer.ImporterConfig
-
Sets the maximum number of bytes used for memory caching of a single documents being processed.
- setMaxMemoryPool(long) - Method in class com.norconex.importer.ImporterConfig
-
Sets the maximum number of bytes used for memory caching of data for all documents concurrently being processed.
- setMaxReadSize(int) - Method in class com.norconex.importer.handler.condition.AbstractStringCondition
-
Sets the maximum number of characters to read for filtering at once.
- setMaxReadSize(int) - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
-
Sets the maximum number of characters to read for filtering at once.
- setMaxReadSize(int) - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
-
Sets the maximum number of characters to read from content for tagging at once.
- setMaxReadSize(int) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
Sets the maximum number of characters to read from content for tagging at once.
- setMaxReadSize(int) - Method in class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
-
Sets the maximum number of characters to read from content for tagging at once.
- setMaxReadSize(int) - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
-
Sets the maximum number of characters to read and transform at once.
- setMaxSamples(int) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- setMetadata(Properties) - Method in class com.norconex.importer.ImporterRequest
- setMetadataExtractionPatterns(RegexFieldValueExtractor...) - Method in class com.norconex.importer.handler.ExternalHandler
-
Sets metadata extraction patterns.
- setMetadataExtractionPatterns(RegexFieldValueExtractor...) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets metadata extraction patterns.
- setMetadataExtractionPatterns(RegexFieldValueExtractor...) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Sets metadata extraction patterns.
- setMetadataExtractionPatterns(RegexFieldValueExtractor...) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Sets metadata extraction patterns.
- setMetadataInputFormat(String) - Method in class com.norconex.importer.handler.ExternalHandler
-
Sets the format of the metadata input file sent to the external application.
- setMetadataInputFormat(String) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets the format of the metadata input file sent to the external application.
- setMetadataInputFormat(String) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Sets the format of the metadata input file sent to the external application.
- setMetadataInputFormat(String) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Sets the format of the metadata input file sent to the external application.
- setMetadataOutputFormat(String) - Method in class com.norconex.importer.handler.ExternalHandler
-
Sets the format of the metadata output file from the external application.
- setMetadataOutputFormat(String) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets the format of the metadata output file from the external application.
- setMetadataOutputFormat(String) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Sets the format of the metadata output file from the external application.
- setMetadataOutputFormat(String) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Sets the format of the metadata output file from the external application.
- setNoExtractContainerContentTypes(String) - Method in class com.norconex.importer.parser.EmbeddedConfig
- setNoExtractEmbeddedContentTypes(String) - Method in class com.norconex.importer.parser.EmbeddedConfig
- setOCRConfig(OCRConfig) - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
-
Deprecated.
- setOnConflict(ConstantTagger.OnConflict) - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
-
Deprecated.Since 3.0.0, use
ConstantTagger.setOnSet(PropertySetter)
instead - setOnMatch(OnMatch) - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
- setOnMatch(OnMatch) - Method in class com.norconex.importer.handler.filter.AbstractOnMatchFilter
-
Deprecated.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.ExternalHandler
-
Sets the property setter to use when a metadata value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
-
Sets the property setter to use when a value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
Sets the property setter to use when a value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
Sets the property setter to use when a value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Sets the property setter to use when a value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
-
Sets the property setter to use when a value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
Sets the property setter to use when a value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets the property setter to use when a metadata value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
Sets the property setter to use when a value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Sets the property setter to use when a value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
-
Sets the property setter to use when a value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Sets the property setter to use when a value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
Sets the property setter to use when a value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
Sets the property setter to use when a value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
-
Sets the property setter to use when a value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
Sets the property setter to use when a value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Sets the property setter to use when a metadata value is set.
- setOnSet(PropertySetter) - Method in class com.norconex.importer.handler.transformer.impl.NoContentTransformer
- setOnSet(PropertySetter) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Sets the property setter to use when a metadata value is set.
- setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
Deprecated.Since 3.0.0 use
CurrentDateTagger.setOnSet(PropertySetter)
. - setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
-
Deprecated.Since 3.0.0 use
DocumentLengthTagger.setOnSet(PropertySetter)
. - setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
-
Deprecated.Since 3.0.0 use
DOMTagger.DOMExtractDetails.setOnSet(PropertySetter)
. - setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
-
Deprecated.Since 3.0.0 use
HierarchyTagger.HierarchyDetails.setOnSet(PropertySetter)
. - setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
-
Deprecated.Since 3.0.0 use
TitleGeneratorTagger.setOnSet(PropertySetter)
. - setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
Deprecated.Since 3.0.0 use
TruncateTagger.setOnSet(PropertySetter)
. - setOverwrite(boolean) - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
Deprecated.Since 3.0.0 use
UUIDTagger.setOnSet(PropertySetter)
. - setParseErrorsSaveDir(Path) - Method in class com.norconex.importer.ImporterConfig
-
Sets the directory where file generating parsing errors will be saved.
- setParser(String) - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
-
Sets the parser to use when creating the DOM-tree.
- setParser(String) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Sets the parser to use when creating the DOM-tree.
- setParser(String) - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
-
Sets the parser to use when creating the DOM-tree.
- setParser(String) - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
-
Sets the parser to use when creating the DOM-tree.
- setParser(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Sets the parser to use when creating the DOM-tree.
- setParser(String) - Method in class com.norconex.importer.handler.transformer.impl.DOMDeleteTransformer
-
Sets the parser to use when creating the DOM-tree.
- setParser(String) - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
-
Sets the parser to use when creating the DOM-tree.
- setParserFactory(IDocumentParserFactory) - Method in class com.norconex.importer.ImporterConfig
- setPath(String) - Method in class com.norconex.importer.handler.splitter.impl.XMLStreamSplitter
- setPath(String) - Method in class com.norconex.importer.parser.OCRConfig
-
Sets the Tesseract OCR engine executable file path.
- setPattern(RegexFieldValueExtractor...) - Method in class com.norconex.importer.handler.tagger.impl.RegexTagger
-
Sets one or more patterns that will extract matching field names/values.
- setPattern(RegexFieldValueExtractor...) - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Deprecated.Sets one or more patterns that will extract matching field names/values.
- setPostParseConsumer(Consumer<HandlerContext>) - Method in class com.norconex.importer.ImporterConfig
-
Sets the
Consumer
to be executed on documents after their parsing has occurred. - setPostParseHandlers(List<IImporterHandler>) - Method in class com.norconex.importer.ImporterConfig
-
Deprecated.Since 3.0.0, use
ImporterConfig.setPostParseConsumer(Consumer)
instead - setPrefix(String) - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
-
Sets the prefix to print before the actual log message.
- setPreParseConsumer(Consumer<HandlerContext>) - Method in class com.norconex.importer.ImporterConfig
-
Sets the
Consumer
to be executed on documents before their parsing has occurred. - setPreParseHandlers(List<IImporterHandler>) - Method in class com.norconex.importer.ImporterConfig
-
Deprecated.Since 3.0.0, use
ImporterConfig.setPreParseConsumer(Consumer)
instead - setQuoteCharacter(char) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Sets the value's surrounding quotes character.
- setReductions(String...) - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
- setReference(String) - Method in class com.norconex.importer.doc.DocInfo
- setReference(String) - Method in class com.norconex.importer.ImporterRequest
- setReferenceColumn(String) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
- setReferencePagePrefix(String) - Method in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
- setRegex(boolean) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.Sets whether the
value
to match is a regular expression. - setRegex(boolean) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- setRegex(boolean) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Deprecated.
- setRegex(boolean) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
-
Deprecated.Since 3.0.0, use
SplitTagger.SplitDetails.setSeparatorRegex(boolean)
instead - setRegex(String) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Since 3.0.0, use
DOMContentFilter.getValueMatcher()
- setRegex(String) - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
Deprecated.
- setRegex(String) - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- setRegex(String) - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
Deprecated.
- setRegex(String) - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- setRejectedBy(IDocumentFilter) - Method in class com.norconex.importer.handler.HandlerContext
- setReplaceAll(boolean) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Deprecated.
- setReplacements(List<ReplaceTransformer.Replacement>) - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
- setResponseProcessors(List<IImporterResponseProcessor>) - Method in class com.norconex.importer.ImporterConfig
- setRotateDegrees(Double) - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- setScaleDimension(Dimension) - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- setScaleFactor(Double) - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- setScaleStretch(boolean) - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- setScript(String) - Method in class com.norconex.importer.handler.condition.impl.ScriptCondition
- setScript(String) - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
- setScript(String) - Method in class com.norconex.importer.handler.ScriptRunner
- setScript(String) - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
- setScript(String) - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
- setScriptPath(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setSelector(String) - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
- setSelector(String) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.
- setSelector(String) - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
- setSelector(String) - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
- setSelector(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
- setSelector(String) - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer.DOMExtractDetails
- setSelectors(List<String>) - Method in class com.norconex.importer.handler.transformer.impl.DOMDeleteTransformer
- setSeparator(String) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
- setSeparatorCharacter(char) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Sets the value-separator character.
- setSeparatorRegex(boolean) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
-
Sets whether the separator value is a regular expression.
- setSingleValue(boolean) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
- setSingleValueSeparator(String) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
- setSmtPath(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setSourceCharset(String) - Method in class com.norconex.importer.handler.condition.AbstractCharStreamCondition
-
Sets the presumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
-
Sets the assumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Sets the assumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
-
Sets the assumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
-
Sets the assumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
-
Sets the assumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
- setSourceCharset(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
-
Sets the assumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
-
Sets the assumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
- setSourceCharset(String) - Method in class com.norconex.importer.handler.transformer.impl.DOMDeleteTransformer
-
Sets the assumed source character encoding.
- setSourceCharset(String) - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
-
Sets the assumed source character encoding.
- setSourceLanguage(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setSourceLanguageField(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setSplitContentTypes(String) - Method in class com.norconex.importer.parser.EmbeddedConfig
- setSplitEmbedded(boolean) - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
-
Deprecated.
- setStartMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Sets the start delimiter matcher for text to extract.
- setStartMatcher(TextMatcher) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer.StripBetweenDetails
-
Sets the start delimiter matcher for text to strip.
- setStripAfterMatcher(TextMatcher) - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
Sets the matcher for the text from which to strip content.
- setStripAfterRegex(String) - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
-
Deprecated.Since 3.0.0, use
StripAfterTransformer.setStripAfterMatcher(TextMatcher)
. - setStripBeforeMatcher(TextMatcher) - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
Sets the matcher for the text up to which to strip content.
- setStripBeforeRegex(String) - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
-
Deprecated.Since 3.0.0, use
StripBeforeTransformer.setStripBeforeMatcher(TextMatcher)
. - setSuffix(String) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
- setTargetCharset(String) - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
- setTargetCharset(String) - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
- setTargetFormat(String) - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- setTargetLanguages(String...) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setTargetLanguages(List<String>) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setTempDir(Path) - Method in class com.norconex.importer.handler.ExternalHandler
-
Sets directory where to store temporary files sent to the external handler as file paths.
- setTempDir(Path) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
-
Sets directory where to store temporary files used for transformation.
- setTempDir(Path) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
-
Sets directory where to store temporary files used for transformation.
- setTempDir(Path) - Method in class com.norconex.importer.ImporterConfig
-
Sets the temporary directory where files can be deleted safely by the OS or any other processes when the Importer is not running.
- setTempDir(Path) - Method in class com.norconex.importer.parser.impl.ExternalParser
-
Sets directory where to store temporary files used for transformation.
- setTitleMaxLength(int) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.Sets the field to store the match count.
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
-
Gets the target field.
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
-
Sets the target field.
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
-
Sets the target field.
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Sets the field to store the replaced value.
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Sets the target field for extracted text.
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
- setToField(String) - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
Sets the target field.
- setToField(String) - Method in class com.norconex.importer.handler.transformer.impl.NoContentTransformer
- setToFormat(String) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
- setToLocale(Locale) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
-
Sets the locale used for formatting the source date.
- setToSeparator(String) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- setToValue(String) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Sets the replacement value.
- setToValue(String) - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer.Replacement
- setTruncateSamplesAt(int) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- setUseFirstRowAsFields(boolean) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
-
Sets whether to use the first row as field names for values.
- setUserKey(String) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- setValue(String) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.Sets the text or regular expression to match
- setValueGroup(int) - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- setValueMatcher(TextMatcher) - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
-
Sets this condition value matcher.
- setValueMatcher(TextMatcher) - Method in class com.norconex.importer.handler.condition.impl.ReferenceCondition
-
Sets the text matcher for field values.
- setValueMatcher(TextMatcher) - Method in class com.norconex.importer.handler.condition.impl.TextCondition
-
Sets the text matcher for content or field values.
- setValueMatcher(TextMatcher) - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.Sets this filter text matcher (copy).
- setValueMatcher(TextMatcher) - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
-
Sets this filter value matcher (copy).
- setValueMatcher(TextMatcher) - Method in class com.norconex.importer.handler.filter.impl.ReferenceFilter
-
Sets the text matcher for field values.
- setValueMatcher(TextMatcher) - Method in class com.norconex.importer.handler.filter.impl.TextFilter
-
Sets the text matcher for field values.
- setValueMatcher(TextMatcher) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Sets value matcher.
- setValueMatcher(TextMatcher) - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer.Replacement
-
Sets value matcher.
- setValueMatcher(DateCondition.ValueMatcher) - Method in class com.norconex.importer.handler.condition.impl.DateCondition
- setValueMatcher(NumericCondition.ValueMatcher) - Method in class com.norconex.importer.handler.condition.impl.NumericCondition
- setValueMatcherRangeEnd(DateCondition.ValueMatcher) - Method in class com.norconex.importer.handler.condition.impl.DateCondition
- setValueMatcherRangeEnd(NumericCondition.ValueMatcher) - Method in class com.norconex.importer.handler.condition.impl.NumericCondition
- setWholeMatch(boolean) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
-
Deprecated.
- setWithHeaders(boolean) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- setWithOccurences(boolean) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- splitApplicableDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.splitter.AbstractDocumentSplitter
- splitApplicableDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
- splitApplicableDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
- splitApplicableDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
- splitApplicableDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- splitApplicableDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.splitter.impl.XMLStreamSplitter
- SplitDetails() - Constructor for class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
- SplitDetails(TextMatcher, String, String) - Constructor for class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
-
Constructor.
- SplitDetails(TextMatcher, String, String, boolean) - Constructor for class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
-
Constructor.
- SplitDetails(String, String, boolean) - Constructor for class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
-
Deprecated.Since 3.0.0.
- SplitDetails(String, String, String, boolean) - Constructor for class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
-
Deprecated.Since 3.0.0.
- splitDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.splitter.AbstractDocumentSplitter
- splitDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in interface com.norconex.importer.handler.splitter.IDocumentSplitter
- SplitEmbbededParser(String, Parser, Properties, CachedStreamFactory) - Constructor for class com.norconex.importer.parser.impl.AbstractTikaParser.SplitEmbbededParser
- SplitTagger - Class in com.norconex.importer.handler.tagger.impl
-
Splits an existing metadata value into multiple values based on a given value separator (the separator gets discarded).
- SplitTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.SplitTagger
- SplitTagger.SplitDetails - Class in com.norconex.importer.handler.tagger.impl
- StaticDateTimeSupplier(ZonedDateTime) - Constructor for class com.norconex.importer.handler.condition.impl.DateCondition.StaticDateTimeSupplier
- StaticDateTimeSupplier(ZonedDateTime) - Constructor for class com.norconex.importer.handler.filter.impl.DateMetadataFilter.StaticDateTimeSupplier
- StripAfterTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Strips any content found after first match found for given pattern.
- StripAfterTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
- StripBeforeTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Strips any content found before first match found for given pattern.
- StripBeforeTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
- StripBetweenDetails() - Constructor for class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer.StripBetweenDetails
-
Constructor.
- StripBetweenDetails(TextMatcher, TextMatcher) - Constructor for class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer.StripBetweenDetails
-
Constructor.
- StripBetweenTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Strips any content found between a matching start and end strings.
- StripBetweenTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
- StripBetweenTransformer.StripBetweenDetails - Class in com.norconex.importer.handler.transformer.impl
- subject(Object) - Method in class com.norconex.importer.ImporterEvent.Builder
- SubstringTransformer - Class in com.norconex.importer.handler.transformer.impl
-
Keep a substring of the content matching a begin and end character indexes.
- SubstringTransformer() - Constructor for class com.norconex.importer.handler.transformer.impl.SubstringTransformer
- SUCCESS - com.norconex.importer.response.ImporterStatus.Status
T
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.AbstractDocumentTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.CopyTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
- tagApplicableDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
- tagDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.tagger.AbstractDocumentTagger
- tagDocument(HandlerDoc, InputStream, ParseState) - Method in interface com.norconex.importer.handler.tagger.IDocumentTagger
-
Tags a document with extra metadata information.
- tagStringContent(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
- tagStringContent(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
- tagStringContent(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.tagger.impl.RegexTagger
- tagStringContent(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
- tagStringContent(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
- tagStringContent(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Deprecated.
- tagStringContent(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- tagTextDocument(HandlerDoc, Reader, ParseState) - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
- tagTextDocument(HandlerDoc, Reader, ParseState) - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
- tagTextDocument(HandlerDoc, Reader, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
- tagTextDocument(HandlerDoc, Reader, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
- tagTextDocument(HandlerDoc, Reader, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
- tagTextDocument(HandlerDoc, Reader, ParseState) - Method in class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
- test(HandlerContext) - Method in class com.norconex.importer.handler.HandlerPredicate
- testDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.condition.AbstractCharStreamCondition
- testDocument(HandlerDoc, InputStream, ParseState) - Method in interface com.norconex.importer.handler.condition.IImporterCondition
-
Tests a given document.
- testDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.condition.impl.BlankCondition
- testDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.condition.impl.DateCondition
- testDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.condition.impl.NumericCondition
- testDocument(HandlerDoc, InputStream, ParseState) - Method in class com.norconex.importer.handler.condition.impl.ReferenceCondition
- testDocument(HandlerDoc, Reader, ParseState) - Method in class com.norconex.importer.handler.condition.AbstractCharStreamCondition
- testDocument(HandlerDoc, Reader, ParseState) - Method in class com.norconex.importer.handler.condition.AbstractStringCondition
- testDocument(HandlerDoc, Reader, ParseState) - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
- testDocument(HandlerDoc, String, ParseState, int) - Method in class com.norconex.importer.handler.condition.AbstractStringCondition
- testDocument(HandlerDoc, String, ParseState, int) - Method in class com.norconex.importer.handler.condition.impl.ScriptCondition
- testDocument(HandlerDoc, String, ParseState, int) - Method in class com.norconex.importer.handler.condition.impl.TextCondition
- TextBetweenDetails() - Constructor for class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Constructor.
- TextBetweenDetails(String, TextMatcher, TextMatcher, TextMatcher) - Constructor for class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Constructor.
- TextBetweenDetails(String, String, String) - Constructor for class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
-
Deprecated.Since 3.0.0.
- TextBetweenTagger - Class in com.norconex.importer.handler.tagger.impl
-
Extracts and add values found between a matching start and end strings to a document metadata field.
- TextBetweenTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
- TextBetweenTagger.TextBetweenDetails - Class in com.norconex.importer.handler.tagger.impl
- TextCondition - Class in com.norconex.importer.handler.condition.impl
-
A condition based on a text pattern matching a document content (default), or matching specific field(s).
- TextCondition() - Constructor for class com.norconex.importer.handler.condition.impl.TextCondition
- TextCondition(TextMatcher) - Constructor for class com.norconex.importer.handler.condition.impl.TextCondition
- TextCondition(TextMatcher, TextMatcher) - Constructor for class com.norconex.importer.handler.condition.impl.TextCondition
- TextFilter - Class in com.norconex.importer.handler.filter.impl
-
Filters a document based on a text pattern in a document content (default), or matching fields specified.
- TextFilter() - Constructor for class com.norconex.importer.handler.filter.impl.TextFilter
- TextFilter(TextMatcher) - Constructor for class com.norconex.importer.handler.filter.impl.TextFilter
- TextFilter(TextMatcher, TextMatcher) - Constructor for class com.norconex.importer.handler.filter.impl.TextFilter
- TextFilter(TextMatcher, TextMatcher, OnMatch) - Constructor for class com.norconex.importer.handler.filter.impl.TextFilter
- TextFilter(TextMatcher, OnMatch) - Constructor for class com.norconex.importer.handler.filter.impl.TextFilter
- TextPatternTagger - Class in com.norconex.importer.handler.tagger.impl
-
Deprecated.Since 3.0.0, use
RegexTagger
. - TextPatternTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Deprecated.
- TextStatisticsTagger - Class in com.norconex.importer.handler.tagger.impl
-
Analyzes the content of the supplied document and adds statistical information about its content or field as metadata fields.
- TextStatisticsTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
- TitleGeneratorTagger - Class in com.norconex.importer.handler.tagger.impl
-
Attempts to generate a title from the document content (default) or a specified metadata field.
- TitleGeneratorTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- toCondition(DateMetadataFilter.Operator, String, ZoneId) - Static method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- toJSoupParser(String) - Static method in class com.norconex.importer.util.DOMUtil
-
Gets the JSoup parser associated with the string representation.
- TOKEN_INPUT - Static variable in class com.norconex.importer.handler.ExternalHandler
- TOKEN_INPUT_META - Static variable in class com.norconex.importer.handler.ExternalHandler
- TOKEN_OUTPUT - Static variable in class com.norconex.importer.handler.ExternalHandler
- TOKEN_OUTPUT_META - Static variable in class com.norconex.importer.handler.ExternalHandler
- TOKEN_REFERENCE - Static variable in class com.norconex.importer.handler.ExternalHandler
- toString() - Method in class com.norconex.importer.doc.Doc
- toString() - Method in class com.norconex.importer.doc.DocInfo
- toString() - Method in class com.norconex.importer.handler.AbstractImporterHandler
- toString() - Method in class com.norconex.importer.handler.condition.AbstractCharStreamCondition
- toString() - Method in class com.norconex.importer.handler.condition.AbstractStringCondition
- toString() - Method in class com.norconex.importer.handler.condition.impl.BlankCondition
- toString() - Method in class com.norconex.importer.handler.condition.impl.DateCondition.DynamicFixedDateTimeSupplier
- toString() - Method in class com.norconex.importer.handler.condition.impl.DateCondition.DynamicFloatingDateTimeSupplier
- toString() - Method in class com.norconex.importer.handler.condition.impl.DateCondition.StaticDateTimeSupplier
- toString() - Method in enum com.norconex.importer.handler.condition.impl.DateCondition.TimeUnit
- toString() - Method in class com.norconex.importer.handler.condition.impl.DateCondition
- toString() - Method in class com.norconex.importer.handler.condition.impl.DateCondition.ValueMatcher
- toString() - Method in class com.norconex.importer.handler.condition.impl.DOMCondition
- toString() - Method in class com.norconex.importer.handler.condition.impl.NumericCondition
- toString() - Method in class com.norconex.importer.handler.condition.impl.NumericCondition.ValueMatcher
- toString() - Method in class com.norconex.importer.handler.condition.impl.ReferenceCondition
- toString() - Method in class com.norconex.importer.handler.condition.impl.ScriptCondition
- toString() - Method in class com.norconex.importer.handler.condition.impl.TextCondition
- toString() - Method in class com.norconex.importer.handler.ExternalHandler
- toString() - Method in class com.norconex.importer.handler.filter.AbstractCharStreamFilter
- toString() - Method in class com.norconex.importer.handler.filter.AbstractDocumentFilter
- toString() - Method in class com.norconex.importer.handler.filter.AbstractOnMatchFilter
-
Deprecated.
- toString() - Method in class com.norconex.importer.handler.filter.AbstractStringFilter
- toString() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.Condition
- toString() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.DynamicFixedDateTimeSupplier
- toString() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.DynamicFloatingDateTimeSupplier
- toString() - Method in enum com.norconex.importer.handler.filter.impl.DateMetadataFilter.Operator
- toString() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter.StaticDateTimeSupplier
- toString() - Method in enum com.norconex.importer.handler.filter.impl.DateMetadataFilter.TimeUnit
- toString() - Method in class com.norconex.importer.handler.filter.impl.DateMetadataFilter
- toString() - Method in class com.norconex.importer.handler.filter.impl.DOMContentFilter
-
Deprecated.
- toString() - Method in class com.norconex.importer.handler.filter.impl.DOMFilter
- toString() - Method in class com.norconex.importer.handler.filter.impl.EmptyFilter
- toString() - Method in class com.norconex.importer.handler.filter.impl.EmptyMetadataFilter
-
Deprecated.
- toString() - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Condition
- toString() - Method in enum com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Operator
- toString() - Method in class com.norconex.importer.handler.filter.impl.NumericMetadataFilter
- toString() - Method in class com.norconex.importer.handler.filter.impl.ReferenceFilter
- toString() - Method in class com.norconex.importer.handler.filter.impl.RegexContentFilter
-
Deprecated.
- toString() - Method in class com.norconex.importer.handler.filter.impl.RegexMetadataFilter
-
Deprecated.
- toString() - Method in class com.norconex.importer.handler.filter.impl.RegexReferenceFilter
-
Deprecated.
- toString() - Method in class com.norconex.importer.handler.filter.impl.RejectFilter
- toString() - Method in class com.norconex.importer.handler.filter.impl.ScriptFilter
- toString() - Method in class com.norconex.importer.handler.filter.impl.TextFilter
- toString() - Method in class com.norconex.importer.handler.HandlerConsumer
- toString() - Method in class com.norconex.importer.handler.HandlerDoc
- toString() - Method in class com.norconex.importer.handler.HandlerPredicate
- toString() - Method in class com.norconex.importer.handler.ScriptRunner
- toString() - Method in class com.norconex.importer.handler.splitter.impl.CsvSplitter
- toString() - Method in class com.norconex.importer.handler.splitter.impl.DOMSplitter
- toString() - Method in class com.norconex.importer.handler.splitter.impl.PDFPageSplitter
- toString() - Method in class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
- toString() - Method in class com.norconex.importer.handler.splitter.impl.XMLStreamSplitter
- toString() - Method in class com.norconex.importer.handler.tagger.AbstractCharStreamTagger
- toString() - Method in class com.norconex.importer.handler.tagger.AbstractStringTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.CharacterCaseTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.CharsetTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.ConstantTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.CopyTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger.MatchDetails
-
Deprecated.
- toString() - Method in class com.norconex.importer.handler.tagger.impl.CountMatchesTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.CurrentDateTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.DateFormatTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.DebugTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.DeleteTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.DocumentLengthTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger.DOMExtractDetails
- toString() - Method in class com.norconex.importer.handler.tagger.impl.DOMTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.ExternalTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.FieldReportTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.ForceSingleValueTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger.HierarchyDetails
- toString() - Method in class com.norconex.importer.handler.tagger.impl.HierarchyTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.KeepOnlyTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.LanguageTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger.Merge
- toString() - Method in class com.norconex.importer.handler.tagger.impl.MergeTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.RegexTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger.RenameDetails
- toString() - Method in class com.norconex.importer.handler.tagger.impl.RenameTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger.Replacement
- toString() - Method in class com.norconex.importer.handler.tagger.impl.ReplaceTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.ScriptTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger.SplitDetails
- toString() - Method in class com.norconex.importer.handler.tagger.impl.SplitTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger.TextBetweenDetails
- toString() - Method in class com.norconex.importer.handler.tagger.impl.TextBetweenTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.TextPatternTagger
-
Deprecated.
- toString() - Method in class com.norconex.importer.handler.tagger.impl.TextStatisticsTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.TruncateTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
- toString() - Method in class com.norconex.importer.handler.tagger.impl.UUIDTagger
- toString() - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
- toString() - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
- toString() - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
- toString() - Method in class com.norconex.importer.handler.transformer.impl.DOMDeleteTransformer
- toString() - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer.DOMExtractDetails
- toString() - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
- toString() - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
- toString() - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- toString() - Method in class com.norconex.importer.handler.transformer.impl.NoContentTransformer
- toString() - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
- toString() - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer.Replacement
- toString() - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
- toString() - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
- toString() - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
- toString() - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
- toString() - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer.StripBetweenDetails
- toString() - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
- toString() - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
- toString() - Method in class com.norconex.importer.ImporterConfig
- toString() - Method in class com.norconex.importer.ImporterEvent
- toString() - Method in class com.norconex.importer.ImporterRequest
- toString() - Method in class com.norconex.importer.parser.EmbeddedConfig
- toString() - Method in class com.norconex.importer.parser.GenericDocumentParserFactory
- toString() - Method in class com.norconex.importer.parser.impl.AbstractTikaParser
- toString() - Method in class com.norconex.importer.parser.impl.ExternalParser
- toString() - Method in class com.norconex.importer.parser.impl.xfdl.XFDLParser
- toString() - Method in class com.norconex.importer.parser.OCRConfig
- toString() - Method in class com.norconex.importer.parser.ParseHints
- toString() - Method in class com.norconex.importer.response.ImporterResponse
- toString() - Method in class com.norconex.importer.response.ImporterStatus
- toString() - Method in class com.norconex.importer.util.regex.RegexFieldExtractor
-
Deprecated.
- toTemporal() - Method in enum com.norconex.importer.handler.condition.impl.DateCondition.TimeUnit
- toTemporal() - Method in enum com.norconex.importer.handler.filter.impl.DateMetadataFilter.TimeUnit
- transformApplicableDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
- transformApplicableDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.transformer.AbstractDocumentTransformer
- transformApplicableDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.transformer.impl.CharsetTransformer
- transformApplicableDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.transformer.impl.DOMDeleteTransformer
- transformApplicableDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.transformer.impl.DOMPreserveTransformer
- transformApplicableDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.transformer.impl.ExternalTransformer
- transformApplicableDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- transformApplicableDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.transformer.impl.NoContentTransformer
- transformDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in class com.norconex.importer.handler.transformer.AbstractDocumentTransformer
- transformDocument(HandlerDoc, InputStream, OutputStream, ParseState) - Method in interface com.norconex.importer.handler.transformer.IDocumentTransformer
-
Transforms document content and metadata.
- transformImage(MutableImage) - Method in class com.norconex.importer.handler.transformer.impl.ImageTransformer
- transformStringContent(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
- transformStringContent(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.transformer.impl.ReduceConsecutivesTransformer
- transformStringContent(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.transformer.impl.ReplaceTransformer
- transformStringContent(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.transformer.impl.ScriptTransformer
- transformStringContent(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.transformer.impl.StripAfterTransformer
- transformStringContent(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.transformer.impl.StripBeforeTransformer
- transformStringContent(HandlerDoc, StringBuilder, ParseState, int) - Method in class com.norconex.importer.handler.transformer.impl.StripBetweenTransformer
- transformTextDocument(HandlerDoc, Reader, Writer, ParseState) - Method in class com.norconex.importer.handler.transformer.AbstractCharStreamTransformer
- transformTextDocument(HandlerDoc, Reader, Writer, ParseState) - Method in class com.norconex.importer.handler.transformer.AbstractStringTransformer
- transformTextDocument(HandlerDoc, Reader, Writer, ParseState) - Method in class com.norconex.importer.handler.transformer.impl.SubstringTransformer
- TRANSLATED_FROM - Static variable in class com.norconex.importer.doc.DocMetadata
-
Original language when translated.
- TranslatorSplitter - Class in com.norconex.importer.handler.splitter.impl
-
Translate documents using one of the supported translation API.
- TranslatorSplitter() - Constructor for class com.norconex.importer.handler.splitter.impl.TranslatorSplitter
-
Constructor.
- TruncateTagger - Class in com.norconex.importer.handler.tagger.impl
-
Truncates a
fromField
value(s) and optionally replace truncated portion by a hash value to help ensure uniqueness (not 100% guaranteed to be collision-free). - TruncateTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.TruncateTagger
- TruncateTagger(TextMatcher, int) - Constructor for class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
Constructor.
- TruncateTagger(String, int) - Constructor for class com.norconex.importer.handler.tagger.impl.TruncateTagger
-
Deprecated.Since 3.0.0, use
TruncateTagger(TextMatcher, int)
U
- UNLIMITED_TITLE_LENGTH - Static variable in class com.norconex.importer.handler.tagger.impl.TitleGeneratorTagger
- URLExtractorTagger - Class in com.norconex.importer.handler.tagger.impl
-
Extracts unique URLs matching specific patterns in plain text content and store them in a given field.
- URLExtractorTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.URLExtractorTagger
- UUIDTagger - Class in com.norconex.importer.handler.tagger.impl
-
Generates a random Universally unique identifier (UUID) and stores it in the specified
field
. - UUIDTagger() - Constructor for class com.norconex.importer.handler.tagger.impl.UUIDTagger
-
Constructor.
V
- ValueMatcher(Operator, double) - Constructor for class com.norconex.importer.handler.condition.impl.NumericCondition.ValueMatcher
- ValueMatcher(Operator, Supplier<ZonedDateTime>) - Constructor for class com.norconex.importer.handler.condition.impl.DateCondition.ValueMatcher
- valueOf(String) - Static method in enum com.norconex.importer.handler.condition.impl.DateCondition.TimeUnit
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.norconex.importer.handler.filter.impl.DateMetadataFilter.Operator
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.norconex.importer.handler.filter.impl.DateMetadataFilter.TimeUnit
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Operator
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.norconex.importer.handler.filter.OnMatch
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.norconex.importer.handler.tagger.impl.ConstantTagger.OnConflict
-
Deprecated.Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.norconex.importer.parser.ParseState
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.norconex.importer.response.ImporterStatus.Status
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum com.norconex.importer.handler.condition.impl.DateCondition.TimeUnit
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.norconex.importer.handler.filter.impl.DateMetadataFilter.Operator
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.norconex.importer.handler.filter.impl.DateMetadataFilter.TimeUnit
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.norconex.importer.handler.filter.impl.NumericMetadataFilter.Operator
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.norconex.importer.handler.filter.OnMatch
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.norconex.importer.handler.tagger.impl.ConstantTagger.OnConflict
-
Deprecated.Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.norconex.importer.parser.ParseState
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.norconex.importer.response.ImporterStatus.Status
-
Returns an array containing the constants of this enum type, in the order they are declared.
X
- XFDLParser - Class in com.norconex.importer.parser.impl.xfdl
-
Parser for PureEdge Extensible Forms Description Language (XFDL).
- XFDLParser() - Constructor for class com.norconex.importer.parser.impl.xfdl.XFDLParser
- XML_BASE_CONTENT_TYPES - Static variable in class com.norconex.importer.handler.CommonMatchers
-
Base content types for XML: "application/xml" and "text/xml".
- XML_CONTENT_TYPES - Static variable in class com.norconex.importer.handler.CommonMatchers
-
HTML, XHTML, and XML-based content types.
- XML_FEED_CONTENT_TYPES - Static variable in class com.norconex.importer.handler.CommonMatchers
-
ATOM, RDF, RSS, and strict XML content types.
- xmlContentTypes() - Static method in class com.norconex.importer.handler.CommonMatchers
-
Common content-types defining an XML document.
- xmlContentTypes(String) - Static method in class com.norconex.importer.handler.CommonRestrictions
-
Common content-types defining an XML document.
- xmlFeedContentTypes() - Static method in class com.norconex.importer.handler.CommonMatchers
-
Matcher for common content-types defining an XML feed (RSS, Atom).
- xmlFeedContentTypes(String) - Static method in class com.norconex.importer.handler.CommonRestrictions
-
Common content-types defining an XML feed (RSS, Atom).
- XMLStreamSplitter - Class in com.norconex.importer.handler.splitter.impl
-
Splits XML document on a specific element.
- XMLStreamSplitter() - Constructor for class com.norconex.importer.handler.splitter.impl.XMLStreamSplitter
Y
- YEAR - com.norconex.importer.handler.condition.impl.DateCondition.TimeUnit
- YEAR - com.norconex.importer.handler.filter.impl.DateMetadataFilter.TimeUnit
All Classes All Packages