Class GenericLinkExtractor
java.lang.Object
com.norconex.collector.http.link.AbstractLinkExtractor
com.norconex.collector.http.link.AbstractTextLinkExtractor
com.norconex.collector.http.link.impl.HtmlLinkExtractor
com.norconex.collector.http.link.impl.GenericLinkExtractor
- All Implemented Interfaces:
ILinkExtractor,IXMLConfigurable
Deprecated.
Deprecated.
- Since:
- 2.3.0
- Author:
- Pascal Essiembre
-
Nested Class Summary
Nested classes/interfaces inherited from class com.norconex.collector.http.link.impl.HtmlLinkExtractor
HtmlLinkExtractor.RegexPair -
Field Summary
Fields inherited from class com.norconex.collector.http.link.impl.HtmlLinkExtractor
DEFAULT_MAX_URL_LENGTH, MAX_BUFFER_SIZE, OVERLAP_SIZE -
Constructor Summary
Constructors -
Method Summary
Methods inherited from class com.norconex.collector.http.link.impl.HtmlLinkExtractor
addExtractBetween, addExtractSelectors, addExtractSelectors, addLinkTag, addNoExtractBetween, addNoExtractSelectors, addNoExtractSelectors, clearLinkTags, equals, extractTextLinks, getCharset, getExtractBetweens, getExtractSelectors, getMaxURLLength, getNoExtractBetweens, getNoExtractSelectors, getSchemes, hashCode, isCommentsEnabled, isIgnoreLinkData, isIgnoreNofollow, loadTextLinkExtractorFromXML, removeLinkTag, saveTextLinkExtractorToXML, setCharset, setCommentsEnabled, setExtractBetweens, setExtractBetweens, setExtractSelectors, setExtractSelectors, setIgnoreLinkData, setIgnoreNofollow, setMaxURLLength, setNoExtractBetweens, setNoExtractBetweens, setNoExtractSelectors, setNoExtractSelectors, setSchemes, setSchemes, toStringMethods inherited from class com.norconex.collector.http.link.AbstractTextLinkExtractor
extractLinks, getFieldMatcher, loadLinkExtractorFromXML, saveLinkExtractorToXML, setFieldMatcherMethods inherited from class com.norconex.collector.http.link.AbstractLinkExtractor
addRestriction, addRestrictions, clearRestrictions, extractLinks, getRestrictions, loadFromXML, removeRestriction, removeRestriction, saveToXML, setRestrictions
-
Constructor Details
-
GenericLinkExtractor
public GenericLinkExtractor()Deprecated.
-
HtmlLinkExtractororDOMLinkExtractorinstead.