Uses of Interface
com.norconex.collector.http.link.ILinkExtractor
-
Packages that use ILinkExtractor Package Description com.norconex.collector.http.crawler com.norconex.collector.http.link com.norconex.collector.http.link.impl -
-
Uses of ILinkExtractor in com.norconex.collector.http.crawler
Methods in com.norconex.collector.http.crawler that return types with arguments of type ILinkExtractor Modifier and Type Method Description List<ILinkExtractor>HttpCrawlerConfig. getLinkExtractors()Gets link extractors.Methods in com.norconex.collector.http.crawler with parameters of type ILinkExtractor Modifier and Type Method Description voidHttpCrawlerConfig. setLinkExtractors(ILinkExtractor... linkExtractors)Sets link extractors.Method parameters in com.norconex.collector.http.crawler with type arguments of type ILinkExtractor Modifier and Type Method Description voidHttpCrawlerConfig. setLinkExtractors(List<ILinkExtractor> linkExtractors)Sets link extractors. -
Uses of ILinkExtractor in com.norconex.collector.http.link
Classes in com.norconex.collector.http.link that implement ILinkExtractor Modifier and Type Class Description classAbstractLinkExtractorBase class for link extraction providing common configuration settings.classAbstractTextLinkExtractorBase class for link extraction from text documents, providing common configuration settings such as being able to apply extraction to specific documents only, and being able to specify one or more metadata fields from which to grab the text for extracting links. -
Uses of ILinkExtractor in com.norconex.collector.http.link.impl
Classes in com.norconex.collector.http.link.impl that implement ILinkExtractor Modifier and Type Class Description classDOMLinkExtractorExtracts links from a Document Object Model (DOM) representation of an HTML, XHTML, or XML document content based on values of matching elements and attributes.classGenericLinkExtractorDeprecated.Since 3.0.0, useHtmlLinkExtractororDOMLinkExtractorinstead.classHtmlLinkExtractorHtml link extractor for URLs found in HTML and possibly other text files.classRegexLinkExtractorLink extractor using regular expressions to extract links found in text documents.classTikaLinkExtractorImplementation ofILinkExtractorusing Apache Tika to perform URL extractions from HTML documents.classXMLFeedLinkExtractor
-