Uses of Interface
com.norconex.collector.http.link.ILinkExtractor
-
Packages that use ILinkExtractor Package Description com.norconex.collector.http.crawler com.norconex.collector.http.link com.norconex.collector.http.link.impl -
-
Uses of ILinkExtractor in com.norconex.collector.http.crawler
Methods in com.norconex.collector.http.crawler that return types with arguments of type ILinkExtractor Modifier and Type Method Description List<ILinkExtractor>
HttpCrawlerConfig. getLinkExtractors()
Gets link extractors.Methods in com.norconex.collector.http.crawler with parameters of type ILinkExtractor Modifier and Type Method Description void
HttpCrawlerConfig. setLinkExtractors(ILinkExtractor... linkExtractors)
Sets link extractors.Method parameters in com.norconex.collector.http.crawler with type arguments of type ILinkExtractor Modifier and Type Method Description void
HttpCrawlerConfig. setLinkExtractors(List<ILinkExtractor> linkExtractors)
Sets link extractors. -
Uses of ILinkExtractor in com.norconex.collector.http.link
Classes in com.norconex.collector.http.link that implement ILinkExtractor Modifier and Type Class Description class
AbstractLinkExtractor
Base class for link extraction providing common configuration settings.class
AbstractTextLinkExtractor
Base class for link extraction from text documents, providing common configuration settings such as being able to apply extraction to specific documents only, and being able to specify one or more metadata fields from which to grab the text for extracting links. -
Uses of ILinkExtractor in com.norconex.collector.http.link.impl
Classes in com.norconex.collector.http.link.impl that implement ILinkExtractor Modifier and Type Class Description class
DOMLinkExtractor
Extracts links from a Document Object Model (DOM) representation of an HTML, XHTML, or XML document content based on values of matching elements and attributes.class
GenericLinkExtractor
Deprecated.Since 3.0.0, useHtmlLinkExtractor
orDOMLinkExtractor
instead.class
HtmlLinkExtractor
Html link extractor for URLs found in HTML and possibly other text files.class
RegexLinkExtractor
Link extractor using regular expressions to extract links found in text documents.class
TikaLinkExtractor
Implementation ofILinkExtractor
using Apache Tika to perform URL extractions from HTML documents.class
XMLFeedLinkExtractor
-