Uses of Interface
com.norconex.collector.http.link.ILinkExtractor
Packages that use ILinkExtractor
Package
Description
-
Uses of ILinkExtractor in com.norconex.collector.http.crawler
Methods in com.norconex.collector.http.crawler that return types with arguments of type ILinkExtractorMethods in com.norconex.collector.http.crawler with parameters of type ILinkExtractorModifier and TypeMethodDescriptionvoidHttpCrawlerConfig.setLinkExtractors(ILinkExtractor... linkExtractors) Sets link extractors.Method parameters in com.norconex.collector.http.crawler with type arguments of type ILinkExtractorModifier and TypeMethodDescriptionvoidHttpCrawlerConfig.setLinkExtractors(List<ILinkExtractor> linkExtractors) Sets link extractors. -
Uses of ILinkExtractor in com.norconex.collector.http.link
Classes in com.norconex.collector.http.link that implement ILinkExtractorModifier and TypeClassDescriptionclassBase class for link extraction providing common configuration settings.classBase class for link extraction from text documents, providing common configuration settings such as being able to apply extraction to specific documents only, and being able to specify one or more metadata fields from which to grab the text for extracting links. -
Uses of ILinkExtractor in com.norconex.collector.http.link.impl
Classes in com.norconex.collector.http.link.impl that implement ILinkExtractorModifier and TypeClassDescriptionclassExtracts links from a Document Object Model (DOM) representation of an HTML, XHTML, or XML document content based on values of matching elements and attributes.classDeprecated.classHtml link extractor for URLs found in HTML and possibly other text files.classLink extractor using regular expressions to extract links found in text documents.classImplementation ofILinkExtractorusing Apache Tika to perform URL extractions from HTML documents.class
HtmlLinkExtractororDOMLinkExtractorinstead.