Package | Description |
---|---|
com.norconex.collector.core.crawler | |
com.norconex.collector.core.doc | |
com.norconex.collector.core.pipeline |
Modifier and Type | Method and Description |
---|---|
protected abstract CrawlDocInfo |
Crawler.createChildDocInfo(String embeddedReference,
CrawlDocInfo parentCrawlRef) |
CrawlDocInfo |
CrawlerEvent.getCrawlDocInfo()
Gets the crawl data holding contextual information about the
crawled reference.
|
Modifier and Type | Method and Description |
---|---|
protected Class<? extends CrawlDocInfo> |
Crawler.getCrawlDocInfoType() |
Modifier and Type | Method and Description |
---|---|
CrawlerEvent.Builder |
CrawlerEvent.Builder.crawlDocInfo(CrawlDocInfo crawlDocInfo) |
protected abstract CrawlDocInfo |
Crawler.createChildDocInfo(String embeddedReference,
CrawlDocInfo parentCrawlRef) |
protected abstract void |
Crawler.executeQueuePipeline(CrawlDocInfo ref) |
protected abstract void |
Crawler.markReferenceVariationsAsProcessed(CrawlDocInfo crawlRef) |
Modifier and Type | Method and Description |
---|---|
CrawlDocInfo |
CrawlDoc.getCachedDocInfo() |
CrawlDocInfo |
CrawlDoc.getDocInfo() |
Modifier and Type | Method and Description |
---|---|
Optional<CrawlDocInfo> |
CrawlDocInfoService.getCached(String id) |
Optional<CrawlDocInfo> |
CrawlDocInfoService.getProcessed(String id) |
Optional<CrawlDocInfo> |
CrawlDocInfoService.pollQueue() |
Modifier and Type | Method and Description |
---|---|
void |
CrawlDocInfoService.processed(CrawlDocInfo docInfo) |
void |
CrawlDocInfoService.queue(CrawlDocInfo docInfo) |
Modifier and Type | Method and Description |
---|---|
boolean |
CrawlDocInfoService.forEachActive(BiPredicate<String,CrawlDocInfo> predicate) |
boolean |
CrawlDocInfoService.forEachCached(BiPredicate<String,CrawlDocInfo> predicate) |
boolean |
CrawlDocInfoService.forEachProcessed(BiPredicate<String,CrawlDocInfo> predicate) |
boolean |
CrawlDocInfoService.forEachQueued(BiPredicate<String,CrawlDocInfo> predicate) |
Constructor and Description |
---|
CrawlDoc(DocInfo docInfo,
CrawlDocInfo cachedDocInfo,
CachedInputStream content) |
CrawlDoc(DocInfo docInfo,
CrawlDocInfo cachedDocInfo,
CachedInputStream content,
boolean orphan) |
Constructor and Description |
---|
CrawlDocInfoService(Crawler crawler,
Class<? extends CrawlDocInfo> type) |
Modifier and Type | Method and Description |
---|---|
CrawlDocInfo |
DocumentPipelineContext.getCachedDocInfo()
Gets cached crawl data.
|
CrawlDocInfo |
DocInfoPipelineContext.getDocInfo() |
CrawlDocInfo |
DocumentPipelineContext.getDocInfo() |
Constructor and Description |
---|
DocInfoPipelineContext(Crawler crawler,
CrawlDocInfo docInfo)
Constructor.
|
Copyright © 2014–2023 Norconex Inc.. All rights reserved.