ApacheHttpUtil |
Utility methods for fetcher implementations using Apache HttpClient.
|
ApacheRedirectCaptureStrategy |
This class is used by each crawler instance to capture the closest
redirect target whether it is part of a redirect chain or not.
|
DocImageHandler |
Handles images associated with a document (which is different than a document
being itself an image).
|
GenericRedirectURLProvider |
Provide redirect URLs by grabbing them from the HTTP Response
Location header value.
|
HstsResolver |
Class handling HSTS support for servers supporting it.
|
TrustAllX509TrustManager |
A very unsafe trust manager accepting ALL certificates.
|