- checksumMD5(InputStream) - Static method in class com.norconex.collector.core.checksum.ChecksumUtil
-
- checksumMD5(String) - Static method in class com.norconex.collector.core.checksum.ChecksumUtil
-
- ChecksumStageUtil - Class in com.norconex.collector.core.pipeline
-
Checksum stage utility methods.
- ChecksumUtil - Class in com.norconex.collector.core.checksum
-
Checksum utility methods.
- cleanupExecution(JobStatusUpdater, JobSuite, ICrawlDataStore) - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
- clone() - Method in class com.norconex.collector.core.data.BaseCrawlData
-
- clone() - Method in interface com.norconex.collector.core.data.ICrawlData
-
Clones this reference.
- close() - Method in interface com.norconex.collector.core.data.store.ICrawlDataStore
-
Closes a database connection.
- close() - Method in class com.norconex.collector.core.data.store.impl.jdbc.JDBCCrawlDataStore
-
- close() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
- close() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStore
-
- COLLECTOR_CHECKSUM_DOC - Static variable in class com.norconex.collector.core.doc.CollectorMetadata
-
- COLLECTOR_CHECKSUM_METADATA - Static variable in class com.norconex.collector.core.doc.CollectorMetadata
-
- COLLECTOR_CONTENT_ENCODING - Static variable in class com.norconex.collector.core.doc.CollectorMetadata
-
- COLLECTOR_CONTENT_TYPE - Static variable in class com.norconex.collector.core.doc.CollectorMetadata
-
- COLLECTOR_IS_CRAWL_NEW - Static variable in class com.norconex.collector.core.doc.CollectorMetadata
-
Boolean flag indicating whether a document is new to the crawler that
fetched it.
- COLLECTOR_PREFIX - Static variable in class com.norconex.collector.core.doc.CollectorMetadata
-
- CollectorConfigLoader - Class in com.norconex.collector.core
-
Collector configuration loader.
- CollectorConfigLoader(Class<? extends ICollectorConfig>) - Constructor for class com.norconex.collector.core.CollectorConfigLoader
-
- CollectorException - Exception in com.norconex.collector.core
-
Runtime exception for most unrecoverable issues thrown by Collector
classes.
- CollectorException() - Constructor for exception com.norconex.collector.core.CollectorException
-
- CollectorException(String) - Constructor for exception com.norconex.collector.core.CollectorException
-
- CollectorException(Throwable) - Constructor for exception com.norconex.collector.core.CollectorException
-
- CollectorException(String, Throwable) - Constructor for exception com.norconex.collector.core.CollectorException
-
- CollectorMetadata - Class in com.norconex.collector.core.doc
-
Collector metadata with constants for common metadata field
names.
- CollectorMetadata() - Constructor for class com.norconex.collector.core.doc.CollectorMetadata
-
- CollectorMetadata(Properties) - Constructor for class com.norconex.collector.core.doc.CollectorMetadata
-
- com.norconex.collector.core - package com.norconex.collector.core
-
- com.norconex.collector.core.checksum - package com.norconex.collector.core.checksum
-
- com.norconex.collector.core.checksum.impl - package com.norconex.collector.core.checksum.impl
-
- com.norconex.collector.core.crawler - package com.norconex.collector.core.crawler
-
- com.norconex.collector.core.crawler.event - package com.norconex.collector.core.crawler.event
-
- com.norconex.collector.core.data - package com.norconex.collector.core.data
-
- com.norconex.collector.core.data.store - package com.norconex.collector.core.data.store
-
- com.norconex.collector.core.data.store.impl.jdbc - package com.norconex.collector.core.data.store.impl.jdbc
-
- com.norconex.collector.core.data.store.impl.mongo - package com.norconex.collector.core.data.store.impl.mongo
-
- com.norconex.collector.core.data.store.impl.mvstore - package com.norconex.collector.core.data.store.impl.mvstore
-
- com.norconex.collector.core.doc - package com.norconex.collector.core.doc
-
- com.norconex.collector.core.filter - package com.norconex.collector.core.filter
-
- com.norconex.collector.core.filter.impl - package com.norconex.collector.core.filter.impl
-
- com.norconex.collector.core.jmx - package com.norconex.collector.core.jmx
-
- com.norconex.collector.core.pipeline - package com.norconex.collector.core.pipeline
-
- com.norconex.collector.core.pipeline.committer - package com.norconex.collector.core.pipeline.committer
-
- com.norconex.collector.core.pipeline.importer - package com.norconex.collector.core.pipeline.importer
-
- com.norconex.collector.core.pipeline.queue - package com.norconex.collector.core.pipeline.queue
-
- com.norconex.collector.core.spoil - package com.norconex.collector.core.spoil
-
- com.norconex.collector.core.spoil.impl - package com.norconex.collector.core.spoil.impl
-
- CommitModuleStage - Class in com.norconex.collector.core.pipeline.committer
-
Common pipeline stage for committing documents.
- CommitModuleStage() - Constructor for class com.norconex.collector.core.pipeline.committer.CommitModuleStage
-
- CopyIfNullBeanUtilsBean() - Constructor for class com.norconex.collector.core.crawler.AbstractCrawler.CopyIfNullBeanUtilsBean
-
- copyProperty(Object, String, Object) - Method in class com.norconex.collector.core.crawler.AbstractCrawler.CopyIfNullBeanUtilsBean
-
- CrawlDataStoreException - Exception in com.norconex.collector.core.data.store
-
Crawl data store runtime exception.
- CrawlDataStoreException() - Constructor for exception com.norconex.collector.core.data.store.CrawlDataStoreException
-
- CrawlDataStoreException(String) - Constructor for exception com.norconex.collector.core.data.store.CrawlDataStoreException
-
- CrawlDataStoreException(Throwable) - Constructor for exception com.norconex.collector.core.data.store.CrawlDataStoreException
-
- CrawlDataStoreException(String, Throwable) - Constructor for exception com.norconex.collector.core.data.store.CrawlDataStoreException
-
- CRAWLER_FINISHED - Static variable in class com.norconex.collector.core.crawler.event.CrawlerEvent
-
The crawler completed execution (without being stopped).
- CRAWLER_RESUMED - Static variable in class com.norconex.collector.core.crawler.event.CrawlerEvent
-
The crawler resumed execution (from a previous incomplete crawl).
- CRAWLER_STARTED - Static variable in class com.norconex.collector.core.crawler.event.CrawlerEvent
-
The crawler started.
- CRAWLER_STOPPED - Static variable in class com.norconex.collector.core.crawler.event.CrawlerEvent
-
Issued when a request to stop the crawler has been fully executed
(crawler stopped).
- CRAWLER_STOPPING - Static variable in class com.norconex.collector.core.crawler.event.CrawlerEvent
-
Issued when a request to stop the crawler has been received.
- CrawlerConfigLoader - Class in com.norconex.collector.core.crawler
-
HTTP Crawler configuration loader.
- CrawlerConfigLoader(Class<? extends ICrawlerConfig>) - Constructor for class com.norconex.collector.core.crawler.CrawlerConfigLoader
-
- CrawlerEvent - Class in com.norconex.collector.core.crawler.event
-
A crawler event.
- CrawlerEvent(String, ICrawlData, Object) - Constructor for class com.norconex.collector.core.crawler.event.CrawlerEvent
-
- crawlerEvent(ICrawler, CrawlerEvent) - Method in interface com.norconex.collector.core.crawler.event.ICrawlerEventListener
-
Fired when a crawler event occurs.
- CrawlerEventManager - Class in com.norconex.collector.core.crawler.event
-
Manage event listeners and log events.
- CrawlerEventManager(ICrawler, ICrawlerEventListener[]) - Constructor for class com.norconex.collector.core.crawler.event.CrawlerEventManager
-
- CrawlState - Class in com.norconex.collector.core.data
-
Reference processing status.
- CrawlState(String) - Constructor for class com.norconex.collector.core.data.CrawlState
-
Constructor.
- createCollector(ICollectorConfig) - Method in class com.norconex.collector.core.AbstractCollectorLauncher
-
- createCrawlDataStore(boolean) - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
- createCrawlDataStore(ICrawlerConfig, boolean) - Method in interface com.norconex.collector.core.data.store.ICrawlDataStoreFactory
-
Creates a new crawl data store.
- createCrawlDataStore(ICrawlerConfig, boolean) - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCCrawlDataStoreFactory
-
- createCrawlDataStore(ICrawlerConfig, boolean) - Method in class com.norconex.collector.core.data.store.impl.mongo.AbstractMongoCrawlDataStoreFactory
-
- createCrawlDataStore(ICrawlerConfig, boolean) - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStoreFactory
-
- createCrawler(ICrawlerConfig) - Method in class com.norconex.collector.core.AbstractCollector
-
Creates a new crawler instance.
- createDocumentChecksum(ImporterDocument) - Method in class com.norconex.collector.core.checksum.AbstractDocumentChecksummer
-
- createDocumentChecksum(ImporterDocument) - Method in interface com.norconex.collector.core.checksum.IDocumentChecksummer
-
Creates a document checksum.
- createEmbeddedCrawlData(String, ICrawlData) - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
- createIndices(MongoCollection<Document>, MongoCollection<Document>) - Method in class com.norconex.collector.core.data.store.impl.mongo.BaseMongoSerializer
-
- createIndices(MongoCollection<Document>, MongoCollection<Document>) - Method in interface com.norconex.collector.core.data.store.impl.mongo.IMongoSerializer
-
Creates Mongo indices for the given collections.
- createJDBCSerializer() - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCCrawlDataStoreFactory
-
- createJobSuite() - Method in class com.norconex.collector.core.AbstractCollector
-
- createMetadataChecksum(Properties) - Method in class com.norconex.collector.core.checksum.AbstractMetadataChecksummer
-
- createMetadataChecksum(Properties) - Method in interface com.norconex.collector.core.checksum.IMetadataChecksummer
-
Creates a metadata checksum.
- createMongoSerializer() - Method in class com.norconex.collector.core.data.store.impl.mongo.AbstractMongoCrawlDataStoreFactory
-
- GenericMetadataChecksummer - Class in com.norconex.collector.core.checksum.impl
-
Generic implementation of
IMetadataChecksummer
that uses
specified source field names and their values for the checksum.
- GenericMetadataChecksummer() - Constructor for class com.norconex.collector.core.checksum.impl.GenericMetadataChecksummer
-
- GenericSpoiledReferenceStrategizer - Class in com.norconex.collector.core.spoil.impl
-
Generic implementation of
ISpoiledReferenceStrategizer
that
offers a simple mapping between the crawl state of references that have
turned "bad" and the strategy to adopt for each.
- GenericSpoiledReferenceStrategizer() - Constructor for class com.norconex.collector.core.spoil.impl.GenericSpoiledReferenceStrategizer
-
- getActiveCount() - Method in interface com.norconex.collector.core.data.store.ICrawlDataStore
-
Gets the number of active references (currently being processed).
- getActiveCount() - Method in class com.norconex.collector.core.data.store.impl.jdbc.JDBCCrawlDataStore
-
- getActiveCount() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
- getActiveCount() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStore
-
- getAutoCommitBufferSize() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreConfig
-
- getAutoCommitDelay() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreConfig
-
- getAutoCompactFillRate() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreConfig
-
- getBaseDownloadDir() - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
- getCacheConcurrency() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreConfig
-
- getCached(String) - Method in interface com.norconex.collector.core.data.store.ICrawlDataStore
-
Gets the cached reference from previous time crawler was run
(e.g.
- getCached(String) - Method in class com.norconex.collector.core.data.store.impl.jdbc.JDBCCrawlDataStore
-
- getCached(String) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
- getCached(String) - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStore
-
- getCachedCollectionName() - Method in class com.norconex.collector.core.data.store.impl.mongo.AbstractMongoCrawlDataStoreFactory
-
Gets the cached collection name.
- getCachedCollectionName() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
Gets the cached collection name.
- getCachedCrawlData() - Method in class com.norconex.collector.core.pipeline.DocumentPipelineContext
-
Gets cached crawl data.
- getCachedCrawlDataSQL() - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCSerializer
-
- getCachedCrawlDataSQL() - Method in interface com.norconex.collector.core.data.store.impl.jdbc.IJDBCSerializer
-
Gets the SQL to obtain all
ICrawlData
from the cache table.
- getCachedCrawlDataValues(String) - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCSerializer
-
- getCachedCrawlDataValues(String) - Method in interface com.norconex.collector.core.data.store.impl.jdbc.IJDBCSerializer
-
- getCacheIterator() - Method in interface com.norconex.collector.core.data.store.ICrawlDataStore
-
Gets the cache iterator.
- getCacheIterator() - Method in class com.norconex.collector.core.data.store.impl.jdbc.JDBCCrawlDataStore
-
- getCacheIterator() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
- getCacheIterator() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStore
-
- getCacheSize() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreConfig
-
- getCollectorConfig() - Method in class com.norconex.collector.core.AbstractCollector
-
Gets the collector configuration
- getCollectorConfig() - Method in interface com.norconex.collector.core.ICollector
-
Gets the collector configuration
- getCollectorConfigClass() - Method in class com.norconex.collector.core.AbstractCollectorLauncher
-
- getCollectorListeners() - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
- getCollectorListeners() - Method in interface com.norconex.collector.core.ICollectorConfig
-
Gets collector life cycle listeners.
- getCommitter() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- getCommitter() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets the Committer module configuration.
- getCompress() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreConfig
-
- getConfig() - Method in class com.norconex.collector.core.pipeline.BasePipelineContext
-
- getConnectionDetails() - Method in class com.norconex.collector.core.data.store.impl.mongo.AbstractMongoCrawlDataStoreFactory
-
- getContent() - Method in class com.norconex.collector.core.pipeline.DocumentPipelineContext
-
- getContentChecksum() - Method in class com.norconex.collector.core.data.BaseCrawlData
-
Gets the content checksum.
- getContentChecksum() - Method in interface com.norconex.collector.core.data.ICrawlData
-
- getContentReader() - Method in class com.norconex.collector.core.pipeline.DocumentPipelineContext
-
- getContentType() - Method in class com.norconex.collector.core.data.BaseCrawlData
-
Gets the content type.
- getContentType() - Method in interface com.norconex.collector.core.data.ICrawlData
-
Gets the content type.
- getCrawlData() - Method in class com.norconex.collector.core.crawler.event.CrawlerEvent
-
Gets the crawl data holding contextual information about the
crawled reference.
- getCrawlData() - Method in class com.norconex.collector.core.pipeline.BasePipelineContext
-
- getCrawlDataStore() - Method in class com.norconex.collector.core.pipeline.BasePipelineContext
-
- getCrawlDataStoreFactory() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- getCrawlDataStoreFactory() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets the crawl data store factory a crawler should use.
- getCrawlDate() - Method in class com.norconex.collector.core.data.BaseCrawlData
-
Gets the crawl date.
- getCrawlDate() - Method in interface com.norconex.collector.core.data.ICrawlData
-
Gets the crawl date.
- getCrawler() - Method in class com.norconex.collector.core.pipeline.BasePipelineContext
-
- getCrawlerConfig() - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
Gets the crawler configuration
- getCrawlerConfig() - Method in interface com.norconex.collector.core.crawler.ICrawler
-
Gets the crawler configuration
- getCrawlerConfigs() - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
- getCrawlerConfigs() - Method in interface com.norconex.collector.core.ICollectorConfig
-
Gets all crawler configurations.
- getCrawlerDownloadDir() - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
- getCrawlerEventManager() - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
- getCrawlerEventManager() - Method in interface com.norconex.collector.core.crawler.ICrawler
-
Gets the crawler events manager.
- getCrawlerListeners() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- getCrawlerListeners() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets crawler event listeners.
- getCrawlers() - Method in class com.norconex.collector.core.AbstractCollector
-
Gets all crawler instances in this collector.
- getCreateTableSQLs(String) - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCSerializer
-
- getCreateTableSQLs(String) - Method in interface com.norconex.collector.core.data.store.impl.jdbc.IJDBCSerializer
-
Gets the SQLs used to create a data store table.
- getDatabaseName() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
- getDeleteCrawlDataSQL(String) - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCSerializer
-
- getDeleteCrawlDataSQL(String) - Method in interface com.norconex.collector.core.data.store.impl.jdbc.IJDBCSerializer
-
Gets the SQL to delete a
ICrawlData
from the given table.
- getDeleteCrawlDataValues(String, ICrawlData) - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCSerializer
-
- getDeleteCrawlDataValues(String, ICrawlData) - Method in interface com.norconex.collector.core.data.store.impl.jdbc.IJDBCSerializer
-
- getDocument() - Method in class com.norconex.collector.core.pipeline.DocumentPipelineContext
-
- getDocumentChecksummer() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- getDocumentChecksummer() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets the document checksummer.
- getDocumentFilters() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- getDocumentFilters() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets the document filters.
- getEventType() - Method in class com.norconex.collector.core.crawler.event.CrawlerEvent
-
Gets the event type.
- getExtensionParts() - Method in class com.norconex.collector.core.filter.impl.ExtensionReferenceFilter
-
- getExtensions() - Method in class com.norconex.collector.core.filter.impl.ExtensionReferenceFilter
-
- getFallbackStrategy() - Method in class com.norconex.collector.core.spoil.impl.GenericSpoiledReferenceStrategizer
-
- getField() - Method in class com.norconex.collector.core.filter.impl.RegexMetadataFilter
-
- getHost() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
- getId() - Method in class com.norconex.collector.core.AbstractCollector
-
- getId() - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Gets this collector unique identifier.
- getId() - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
- getId() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
Gets this crawler unique identifier.
- getId() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets this crawler unique identifier.
- getId() - Method in interface com.norconex.collector.core.ICollector
-
- getId() - Method in interface com.norconex.collector.core.ICollectorConfig
-
Gets this collector unique identifier.
- getImporter() - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
- getImporter() - Method in interface com.norconex.collector.core.crawler.ICrawler
-
Gets the crawler Importer module.
- getImporterConfig() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- getImporterConfig() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets the Importer module configuration.
- getImporterResponse() - Method in class com.norconex.collector.core.pipeline.importer.ImporterPipelineContext
-
- getInsertCrawlDataSQL(String) - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCSerializer
-
- getInsertCrawlDataSQL(String) - Method in interface com.norconex.collector.core.data.store.impl.jdbc.IJDBCSerializer
-
Gets the SQL to insert a new
ICrawlData
in the given table.
- getInsertCrawlDataValues(String, ICrawlData) - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCSerializer
-
- getInsertCrawlDataValues(String, ICrawlData) - Method in interface com.norconex.collector.core.data.store.impl.jdbc.IJDBCSerializer
-
- getJobErrorListeners() - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
- getJobErrorListeners() - Method in interface com.norconex.collector.core.ICollectorConfig
-
Gets JEF error listeners.
- getJobLifeCycleListeners() - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
- getJobLifeCycleListeners() - Method in interface com.norconex.collector.core.ICollectorConfig
-
Gets JEF job life cycle listeners.
- getJobSuite() - Method in class com.norconex.collector.core.AbstractCollector
-
Gets the job suite or null
if the the collector
was not yet started or is no longer running.
- getJobSuite() - Method in interface com.norconex.collector.core.ICollector
-
- getLogsDir() - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Gets the directory location of generated log files.
- getLogsDir() - Method in interface com.norconex.collector.core.ICollectorConfig
-
Gets the directory location of generated log files.
- getMaxDocuments() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- getMaxDocuments() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets the maximum number of documents that can be processed.
- getMaxParallelCrawlers() - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Gets the maximum number of crawlers that can be executed in parallel at
any given time.
- getMaxParallelCrawlers() - Method in interface com.norconex.collector.core.ICollectorConfig
-
Gets the maximum number of crawlers that can be executed in parallel at
any given time.
- getMechanism() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
Gets the authentication mechanism to use (MONGODB-CR
,
SCRAM-SHA-1
or null
to use default).
- getMetaChecksum() - Method in class com.norconex.collector.core.data.BaseCrawlData
-
- getMetaChecksum() - Method in interface com.norconex.collector.core.data.ICrawlData
-
- getMetadataFilters() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- getMetadataFilters() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets the metadata filters.
- getMVStoreConfig() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStoreFactory
-
- getNextQueued(MongoCollection<Document>) - Method in class com.norconex.collector.core.data.store.impl.mongo.BaseMongoSerializer
-
- getNextQueued(MongoCollection<Document>) - Method in interface com.norconex.collector.core.data.store.impl.mongo.IMongoSerializer
-
Gets the next queued DB document from the given collection.
- getNextQueuedCrawlDataSQL() - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCSerializer
-
- getNextQueuedCrawlDataSQL() - Method in interface com.norconex.collector.core.data.store.impl.jdbc.IJDBCSerializer
-
Gets the SQL to obtain the next
ICrawlData
from the queue table.
- getNextQueuedCrawlDataValues() - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCSerializer
-
- getNextQueuedCrawlDataValues() - Method in interface com.norconex.collector.core.data.store.impl.jdbc.IJDBCSerializer
-
- getNumThreads() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- getNumThreads() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets the number of threads (maximum) a crawler should use.
- getOrphansStrategy() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- getOrphansStrategy() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets the strategy to adopt when there are orphans.
- getPageSplitSize() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreConfig
-
- getParentRootReference() - Method in class com.norconex.collector.core.data.BaseCrawlData
-
- getParentRootReference() - Method in interface com.norconex.collector.core.data.ICrawlData
-
- getPassword() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
- getPasswordKey() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
Gets the password encryption key.
- getPort() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
- getProcessed(String) - Method in interface com.norconex.collector.core.data.store.ICrawlDataStore
-
Gets an already processed reference from the current crawl session.
- getProcessed(String) - Method in class com.norconex.collector.core.data.store.impl.jdbc.JDBCCrawlDataStore
-
- getProcessed(String) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
- getProcessed(String) - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStore
-
- getProcessedCount() - Method in interface com.norconex.collector.core.data.store.ICrawlDataStore
-
Gets the number of references processed.
- getProcessedCount() - Method in class com.norconex.collector.core.data.store.impl.jdbc.JDBCCrawlDataStore
-
- getProcessedCount() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
- getProcessedCount() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStore
-
- getProcessedURLCount() - Method in class com.norconex.collector.core.jmx.Monitoring
-
- getProcessedURLCount() - Method in interface com.norconex.collector.core.jmx.MonitoringMBean
-
- getProgressDir() - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Gets the directory location where progress files (from JEF API)
are stored.
- getProgressDir() - Method in interface com.norconex.collector.core.ICollectorConfig
-
Gets the directory location where progress files (from JEF API)
are stored.
- getQueueSize() - Method in interface com.norconex.collector.core.data.store.ICrawlDataStore
-
Gets the size of the reference queue (number of
references left to process).
- getQueueSize() - Method in class com.norconex.collector.core.data.store.impl.jdbc.JDBCCrawlDataStore
-
- getQueueSize() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
- getQueueSize() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStore
-
- getReference() - Method in class com.norconex.collector.core.data.BaseCrawlData
-
- getReference() - Method in interface com.norconex.collector.core.data.ICrawlData
-
Gets the unique identifier of this reference (e.g.
- getReferenceExistsSQL(String) - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCSerializer
-
- getReferenceExistsSQL(String) - Method in interface com.norconex.collector.core.data.store.impl.jdbc.IJDBCSerializer
-
Gets the SQL to find if a
ICrawlData
exists in the given table.
- getReferenceExistsValues(String, String) - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCSerializer
-
- getReferenceExistsValues(String, String) - Method in interface com.norconex.collector.core.data.store.impl.jdbc.IJDBCSerializer
-
- getReferenceFilters() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
Gets the reference filters
- getReferenceFilters() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets the reference filters.
- getReferencesCollectionName() - Method in class com.norconex.collector.core.data.store.impl.mongo.AbstractMongoCrawlDataStoreFactory
-
Gets the references collection name.
- getReferencesCollectionName() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
Gets the references collection name.
- getReferencesCount(IMongoSerializer.Stage) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
- getRegex() - Method in class com.norconex.collector.core.filter.impl.RegexMetadataFilter
-
- getRegex() - Method in class com.norconex.collector.core.filter.impl.RegexReferenceFilter
-
- getSafeDatabaseName(String) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
Gets a safe database name using MongoUtil, and treating a crawlerId as
the default.
- getSafeDBName(String, String) - Static method in class com.norconex.collector.core.data.store.impl.mongo.MongoUtil
-
Return or generate a DB name
If a valid dbName is provided, it is returned as is.
- getSelectCrawlDataSQL(String) - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCSerializer
-
- getSelectCrawlDataSQL(String) - Method in interface com.norconex.collector.core.data.store.impl.jdbc.IJDBCSerializer
-
Gets the SQL to obtain all
ICrawlData
entries in the given
table.
- getSourceFields() - Method in class com.norconex.collector.core.checksum.impl.GenericMetadataChecksummer
-
Gets the metadata fields used to construct a checksum.
- getSourceFields() - Method in class com.norconex.collector.core.checksum.impl.MD5DocumentChecksummer
-
Gets the fields used to construct a MD5 checksum.
- getSourceFieldsRegex() - Method in class com.norconex.collector.core.checksum.impl.GenericMetadataChecksummer
-
Gets the regular expression matching metadata fields used to construct
a checksum.
- getSourceFieldsRegex() - Method in class com.norconex.collector.core.checksum.impl.MD5DocumentChecksummer
-
Gets the regular expression matching metadata fields used to construct
a checksum.
- getSpoiledReferenceStrategizer() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- getSpoiledReferenceStrategizer() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets the spoiled state strategy resolver.
- getState() - Method in class com.norconex.collector.core.AbstractCollector
-
Gets the state of this collector.
- getState() - Method in class com.norconex.collector.core.data.BaseCrawlData
-
- getState() - Method in interface com.norconex.collector.core.data.ICrawlData
-
Gets this reference state.
- getStopOnExceptions() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- getStopOnExceptions() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets the exceptions we want to stop the crawler on.
- getStreamFactory() - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
- getSubject() - Method in class com.norconex.collector.core.crawler.event.CrawlerEvent
-
Gets the subject of this event.
- getSuiteLifeCycleListeners() - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
- getSuiteLifeCycleListeners() - Method in interface com.norconex.collector.core.ICollectorConfig
-
Gets JEF job suite life cycle listeners.
- getTargetField() - Method in class com.norconex.collector.core.checksum.AbstractDocumentChecksummer
-
Gets the metadata field to use to store the checksum value.
- getTargetField() - Method in class com.norconex.collector.core.checksum.AbstractMetadataChecksummer
-
Gets the metadata field to use to store the checksum value.
- getURLQueueSize() - Method in class com.norconex.collector.core.jmx.Monitoring
-
- getURLQueueSize() - Method in interface com.norconex.collector.core.jmx.MonitoringMBean
-
- getUsername() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
- getWorkDir() - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- getWorkDir() - Method in interface com.norconex.collector.core.crawler.ICrawlerConfig
-
Gets the crawler working directory where many files created at
execution time are stored.
- ICollector - Interface in com.norconex.collector.core
-
- ICollectorConfig - Interface in com.norconex.collector.core
-
- ICollectorLifeCycleListener - Interface in com.norconex.collector.core
-
Listens to collector life-cycle events.
- ICrawlData - Interface in com.norconex.collector.core.data
-
A pointer that uniquely identifies a resource being processed (e.g.
- ICrawlDataStore - Interface in com.norconex.collector.core.data.store
-
Holds necessary information about all references (e.g.
- ICrawlDataStoreFactory - Interface in com.norconex.collector.core.data.store
-
Factory responsible for creating new crawl data stores.
- ICrawler - Interface in com.norconex.collector.core.crawler
-
A document crawler.
- ICrawlerConfig - Interface in com.norconex.collector.core.crawler
-
Crawler configuration.
- ICrawlerConfig.OrphansStrategy - Enum in com.norconex.collector.core.crawler
-
- ICrawlerEventListener - Interface in com.norconex.collector.core.crawler.event
-
Allows implementers to react to any crawler-specific events.
- IDocumentChecksummer - Interface in com.norconex.collector.core.checksum
-
Creates a checksum representing a a document.
- IDocumentFilter - Interface in com.norconex.collector.core.filter
-
Filter a document after the document content is fetched, downloaded,
or otherwise read or acquired.
- IJDBCSerializer - Interface in com.norconex.collector.core.data.store.impl.jdbc
-
Serializer holding necessary information to insert, load, delete and create
document reference information specific to each database tables.
- IMetadataChecksummer - Interface in com.norconex.collector.core.checksum
-
Creates a checksum representing a document based on document metadata
values obtained prior to fetching that document (e.g.
- IMetadataFilter - Interface in com.norconex.collector.core.filter
-
Filter a reference based on the metadata that could be obtained for a
document, before it was fetched, downloaded, or otherwise read or acquired
(e.g.
- IMongoSerializer - Interface in com.norconex.collector.core.data.store.impl.mongo
-
Mongo serializer.
- IMongoSerializer.Stage - Enum in com.norconex.collector.core.data.store.impl.mongo
-
- ImporterPipelineContext - Class in com.norconex.collector.core.pipeline.importer
-
- ImporterPipelineContext(ImporterPipelineContext) - Constructor for class com.norconex.collector.core.pipeline.importer.ImporterPipelineContext
-
Constructor creating a copy of supplied context.
- ImporterPipelineContext(ICrawler, ICrawlDataStore) - Constructor for class com.norconex.collector.core.pipeline.importer.ImporterPipelineContext
-
Constructor.
- ImporterPipelineContext(ICrawler, ICrawlDataStore, BaseCrawlData) - Constructor for class com.norconex.collector.core.pipeline.importer.ImporterPipelineContext
-
Constructor.
- ImporterPipelineContext(ICrawler, ICrawlDataStore, BaseCrawlData, BaseCrawlData, ImporterDocument) - Constructor for class com.norconex.collector.core.pipeline.importer.ImporterPipelineContext
-
- ImporterPipelineUtil - Class in com.norconex.collector.core.pipeline.importer
-
- ImportModuleStage - Class in com.norconex.collector.core.pipeline.importer
-
Common pipeline stage for importing documents.
- ImportModuleStage() - Constructor for class com.norconex.collector.core.pipeline.importer.ImportModuleStage
-
- initCrawlData(ICrawlData, ICrawlData, ImporterDocument) - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
- IReferenceFilter - Interface in com.norconex.collector.core.filter
-
Filter a document based on its reference, before its properties or content
gets read or otherwise acquired.
- isActive(String) - Method in interface com.norconex.collector.core.data.store.ICrawlDataStore
-
Whether the given reference is currently being processed (i.e.
- isActive(String) - Method in class com.norconex.collector.core.data.store.impl.jdbc.JDBCCrawlDataStore
-
- isActive(String) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
- isActive(String) - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStore
-
- isCacheEmpty() - Method in interface com.norconex.collector.core.data.store.ICrawlDataStore
-
Whether there are any references the the cache from a previous crawler
run.
- isCacheEmpty() - Method in class com.norconex.collector.core.data.store.impl.jdbc.JDBCCrawlDataStore
-
- isCacheEmpty() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
- isCacheEmpty() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStore
-
- isCaseSensitive() - Method in class com.norconex.collector.core.filter.impl.ExtensionReferenceFilter
-
- isCaseSensitive() - Method in class com.norconex.collector.core.filter.impl.RegexMetadataFilter
-
- isCaseSensitive() - Method in class com.norconex.collector.core.filter.impl.RegexReferenceFilter
-
- isCombineFieldsAndContent() - Method in class com.norconex.collector.core.checksum.impl.MD5DocumentChecksummer
-
Gets whether we are combining the fields and content checksums.
- isDelete() - Method in class com.norconex.collector.core.pipeline.importer.ImporterPipelineContext
-
Gets whether the document should be deleted.
- isDisabled() - Method in class com.norconex.collector.core.checksum.impl.GenericMetadataChecksummer
-
Whether this checksummer is disabled or not.
- isDisabled() - Method in class com.norconex.collector.core.checksum.impl.MD5DocumentChecksummer
-
Whether this checksummer is disabled or not.
- isGoodState() - Method in class com.norconex.collector.core.data.CrawlState
-
Returns whether a reference should be considered "good" (the
corresponding document is not in a "bad" state, such as being rejected
or produced an error.
- isHeadersRejected(ImporterPipelineContext) - Static method in class com.norconex.collector.core.pipeline.importer.ImporterPipelineUtil
-
- isKeep() - Method in class com.norconex.collector.core.checksum.AbstractDocumentChecksummer
-
Whether to keep the document checksum value as a new field in the
document metadata.
- isKeep() - Method in class com.norconex.collector.core.checksum.AbstractMetadataChecksummer
-
Whether to keep the metadata checksum value as a new metadata field.
- isLogsUnmanaged() - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Gets whether written logs are managed by the collector.
- isLogsUnmanaged() - Method in interface com.norconex.collector.core.ICollectorConfig
-
Gets whether written logs are managed by the collector.
- isMaxDocuments() - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
- isNewOrModified() - Method in class com.norconex.collector.core.data.CrawlState
-
Returns whether a state indicates new or modified.
- isOneOf(CrawlState...) - Method in class com.norconex.collector.core.data.CrawlState
-
- isOrphan() - Method in class com.norconex.collector.core.pipeline.importer.ImporterPipelineContext
-
Gets whether the document is an orphan (no longer referenced).
- ISpoiledReferenceStrategizer - Interface in com.norconex.collector.core.spoil
-
Decides which strategy to adopt for a given reference with a bad state.
- isProcessed(String) - Method in interface com.norconex.collector.core.data.store.ICrawlDataStore
-
Whether the given reference has been processed.
- isProcessed(String) - Method in class com.norconex.collector.core.data.store.impl.jdbc.JDBCCrawlDataStore
-
- isProcessed(String) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
- isProcessed(String) - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStore
-
- isQueued(String) - Method in interface com.norconex.collector.core.data.store.ICrawlDataStore
-
Whether the given reference is in the queue or not
(waiting to be processed).
- isQueued(String) - Method in class com.norconex.collector.core.data.store.impl.jdbc.JDBCCrawlDataStore
-
- isQueued(String) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
- isQueued(String) - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStore
-
- isQueueEmpty() - Method in interface com.norconex.collector.core.data.store.ICrawlDataStore
-
Whether there are any references to process in the queue.
- isQueueEmpty() - Method in class com.norconex.collector.core.data.store.impl.jdbc.JDBCCrawlDataStore
-
- isQueueEmpty() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
- isQueueEmpty() - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStore
-
- isRootParentReference() - Method in class com.norconex.collector.core.data.BaseCrawlData
-
- isRootParentReference() - Method in interface com.norconex.collector.core.data.ICrawlData
-
- isSkipped() - Method in class com.norconex.collector.core.data.CrawlState
-
- isSslEnabled() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
Gets whether to use SSL.
- isSslInvalidHostNameAllowed() - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
Gets whether invalid host names should be allowed if SSL is enabled.
- isStage(String, IMongoSerializer.Stage) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoCrawlDataStore
-
- isStopped() - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
Whether the crawler job was stopped.
- saveChecksummerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.collector.core.checksum.AbstractDocumentChecksummer
-
- saveChecksummerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.collector.core.checksum.AbstractMetadataChecksummer
-
- saveChecksummerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.collector.core.checksum.impl.GenericMetadataChecksummer
-
- saveChecksummerToXML(EnhancedXMLStreamWriter) - Method in class com.norconex.collector.core.checksum.impl.MD5DocumentChecksummer
-
- saveCollectorConfigToXML(Writer) - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
- saveCrawlerConfigToXML(Writer) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- SaveDocumentStage - Class in com.norconex.collector.core.pipeline.importer
-
Common pipeline stage for saving documents.
- SaveDocumentStage() - Constructor for class com.norconex.collector.core.pipeline.importer.SaveDocumentStage
-
- saveToXML(Writer) - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
- saveToXML(Writer) - Method in class com.norconex.collector.core.checksum.AbstractDocumentChecksummer
-
- saveToXML(Writer) - Method in class com.norconex.collector.core.checksum.AbstractMetadataChecksummer
-
- saveToXML(Writer) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- saveToXML(Writer) - Method in class com.norconex.collector.core.data.store.impl.jdbc.BasicJDBCCrawlDataStoreFactory
-
- saveToXML(Writer) - Method in class com.norconex.collector.core.data.store.impl.mongo.AbstractMongoCrawlDataStoreFactory
-
- saveToXML(Writer) - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreCrawlDataStoreFactory
-
- saveToXML(Writer) - Method in class com.norconex.collector.core.filter.impl.ExtensionReferenceFilter
-
- saveToXML(Writer) - Method in class com.norconex.collector.core.filter.impl.RegexMetadataFilter
-
- saveToXML(Writer) - Method in class com.norconex.collector.core.filter.impl.RegexReferenceFilter
-
- saveToXML(Writer) - Method in class com.norconex.collector.core.spoil.impl.GenericSpoiledReferenceStrategizer
-
- setAutoCommitBufferSize(Integer) - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreConfig
-
- setAutoCommitDelay(Integer) - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreConfig
-
- setAutoCompactFillRate(Integer) - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreConfig
-
- setCacheConcurrency(Integer) - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreConfig
-
- setCachedCollectionName(String) - Method in class com.norconex.collector.core.data.store.impl.mongo.AbstractMongoCrawlDataStoreFactory
-
Sets the cached collection name.
- setCachedCrawlData(BaseCrawlData) - Method in class com.norconex.collector.core.pipeline.DocumentPipelineContext
-
Sets cached crawl data.
- setCacheSize(Integer) - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreConfig
-
- setCaseSensitive(boolean) - Method in class com.norconex.collector.core.filter.impl.ExtensionReferenceFilter
-
- setCaseSensitive(boolean) - Method in class com.norconex.collector.core.filter.impl.RegexMetadataFilter
-
- setCaseSensitive(boolean) - Method in class com.norconex.collector.core.filter.impl.RegexReferenceFilter
-
- setCollectorListeners(ICollectorLifeCycleListener...) - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Sets collector life cycle listeners.
- setCombineFieldsAndContent(boolean) - Method in class com.norconex.collector.core.checksum.impl.MD5DocumentChecksummer
-
Sets whether to combine the fields and content checksums.
- setCommitter(ICommitter) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- setCompress(Integer) - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreConfig
-
- setContentChecksum(String) - Method in class com.norconex.collector.core.data.BaseCrawlData
-
Sets the content checksum.
- setContentType(ContentType) - Method in class com.norconex.collector.core.data.BaseCrawlData
-
Sets the content type.
- setCrawlData(BaseCrawlData) - Method in class com.norconex.collector.core.pipeline.BasePipelineContext
-
Sets the current crawl data.
- setCrawlDataStoreFactory(ICrawlDataStoreFactory) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- setCrawlDate(Date) - Method in class com.norconex.collector.core.data.BaseCrawlData
-
Sets the crawl date.
- setCrawlerConfigs(ICrawlerConfig...) - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Sets crawler configurations.
- setCrawlerListeners(ICrawlerEventListener...) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- setCrawlers(ICrawler[]) - Method in class com.norconex.collector.core.AbstractCollector
-
Add the provided crawlers to this collector.
- setDatabaseName(String) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
- setDelete(boolean) - Method in class com.norconex.collector.core.pipeline.importer.ImporterPipelineContext
-
Sets whether the document should be deleted.
- setDisabled(boolean) - Method in class com.norconex.collector.core.checksum.impl.GenericMetadataChecksummer
-
Sets whether this checksummer is disabled or not.
- setDisabled(boolean) - Method in class com.norconex.collector.core.checksum.impl.MD5DocumentChecksummer
-
Sets whether this checksummer is disabled or not.
- setDocument(ImporterDocument) - Method in class com.norconex.collector.core.pipeline.DocumentPipelineContext
-
Sets document.
- setDocumentChecksummer(IDocumentChecksummer) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- setDocumentFilters(IDocumentFilter...) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- setExtensions(String) - Method in class com.norconex.collector.core.filter.impl.ExtensionReferenceFilter
-
- setFallbackStrategy(SpoiledReferenceStrategy) - Method in class com.norconex.collector.core.spoil.impl.GenericSpoiledReferenceStrategizer
-
- setField(String) - Method in class com.norconex.collector.core.filter.impl.RegexMetadataFilter
-
- setHost(String) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
- setId(String) - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Sets this collector unique identifier.
- setId(String) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
Sets this crawler unique identifier.
- setImporterConfig(ImporterConfig) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- setImporterResponse(ImporterResponse) - Method in class com.norconex.collector.core.pipeline.importer.ImporterPipelineContext
-
- setJobErrorListeners(IJobErrorListener...) - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Sets JEF error listeners.
- setJobLifeCycleListeners(IJobLifeCycleListener...) - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Sets JEF job life cycle listeners.
- setKeep(boolean) - Method in class com.norconex.collector.core.checksum.AbstractDocumentChecksummer
-
Sets whether to keep the document checksum value as a new field in the
document metadata.
- setKeep(boolean) - Method in class com.norconex.collector.core.checksum.AbstractMetadataChecksummer
-
Sets whether to keep the metadata checksum value as a new metadata field.
- setLogsDir(String) - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Sets the directory location of generated log files.
- setLogsUnmanaged(boolean) - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Sets whether written logs are managed by the collector.
- setMaxDocuments(int) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- setMaxParallelCrawlers(int) - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Sets the maximum number of crawlers that can be executed in parallel at
any given time.
- setMechanism(String) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
Sets the authentication mechanism to use (MONGODB-CR
,
SCRAM-SHA-1
or null
to use default).
- setMetaChecksum(String) - Method in class com.norconex.collector.core.data.BaseCrawlData
-
- setMetadataFilters(IMetadataFilter...) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- setNumThreads(int) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- setOrphan(boolean) - Method in class com.norconex.collector.core.pipeline.importer.ImporterPipelineContext
-
Sets whether the document is an orphan (no longer referenced).
- setOrphansStrategy(ICrawlerConfig.OrphansStrategy) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- setPageSplitSize(Integer) - Method in class com.norconex.collector.core.data.store.impl.mvstore.MVStoreConfig
-
- setParentRootReference(String) - Method in class com.norconex.collector.core.data.BaseCrawlData
-
- setPassword(String) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
- setPasswordKey(EncryptionKey) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
Sets the password encryption key.
- setPort(int) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
- setProgressDir(String) - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Sets the directory location where progress files (from JEF API)
are stored.
- setReference(String) - Method in class com.norconex.collector.core.data.BaseCrawlData
-
- setReferenceFilters(IReferenceFilter...) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
Sets the reference filters.
- setReferencesCollectionName(String) - Method in class com.norconex.collector.core.data.store.impl.mongo.AbstractMongoCrawlDataStoreFactory
-
Sets the references collection name.
- setRegex(String) - Method in class com.norconex.collector.core.filter.impl.RegexMetadataFilter
-
- setRegex(String) - Method in class com.norconex.collector.core.filter.impl.RegexReferenceFilter
-
- setRootParentReference(boolean) - Method in class com.norconex.collector.core.data.BaseCrawlData
-
- setSourceFields(String...) - Method in class com.norconex.collector.core.checksum.impl.GenericMetadataChecksummer
-
Sets the metadata header fields used construct a checksum.
- setSourceFields(String...) - Method in class com.norconex.collector.core.checksum.impl.MD5DocumentChecksummer
-
Sets the fields used to construct a MD5 checksum.
- setSourceFieldsRegex(String) - Method in class com.norconex.collector.core.checksum.impl.GenericMetadataChecksummer
-
Sets the regular expression matching metadata fields used construct
a checksum.
- setSourceFieldsRegex(String) - Method in class com.norconex.collector.core.checksum.impl.MD5DocumentChecksummer
-
Sets the regular expression matching metadata fields used construct
a checksum.
- setSpoiledReferenceStrategizer(ISpoiledReferenceStrategizer) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- setSslEnabled(boolean) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
Sets whether to use SSL.
- setSslInvalidHostNameAllowed(boolean) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
Sets whether invalid host names should be allowed if SSL is enabled.
- setState(CrawlState) - Method in class com.norconex.collector.core.data.BaseCrawlData
-
- setStopOnExceptions(Class<? extends Exception>...) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
Sets the exceptions we want to stop the crawler on.
- setSuiteLifeCycleListeners(ISuiteLifeCycleListener...) - Method in class com.norconex.collector.core.AbstractCollectorConfig
-
Sets JEF job suite life cycle listeners.
- setTargetField(String) - Method in class com.norconex.collector.core.checksum.AbstractDocumentChecksummer
-
Sets the metadata field name to use to store the checksum value.
- setTargetField(String) - Method in class com.norconex.collector.core.checksum.AbstractMetadataChecksummer
-
Sets the metadata field name to use to store the checksum value.
- setUsername(String) - Method in class com.norconex.collector.core.data.store.impl.mongo.MongoConnectionDetails
-
- setWorkDir(File) - Method in class com.norconex.collector.core.crawler.AbstractCrawlerConfig
-
- SpoiledReferenceStrategy - Enum in com.norconex.collector.core.spoil
-
Markers indicating what to do with references that were once processed
properly, but failed to get a good processing state a subsequent time around.
- start(boolean) - Method in class com.norconex.collector.core.AbstractCollector
-
Start all crawlers defined in configuration.
- start(boolean) - Method in interface com.norconex.collector.core.ICollector
-
Launched all crawlers defined in configuration.
- startExecution(JobStatusUpdater, JobSuite) - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
- stop() - Method in class com.norconex.collector.core.AbstractCollector
-
Stops a running instance of this Collector.
- stop(IJobStatus, JobSuite) - Method in class com.norconex.collector.core.crawler.AbstractCrawler
-
- stop() - Method in interface com.norconex.collector.core.ICollector
-
Stops a running instance of this Collector.