Interface IRecrawlableResolver

  • All Known Implementing Classes:
    GenericRecrawlableResolver

    public interface IRecrawlableResolver
    Indicates whether a document that was successfully crawled on a previous crawling session should be recrawled or not. Documents not ready to be recrawled are not downloaded again (no HTTP calls will be made) and are not committed.
    Since:
    2.5.0
    Author:
    Pascal Essiembre
    • Method Detail

      • isRecrawlable

        boolean isRecrawlable​(HttpDocInfo prevDocInfo)
        Whether a document recrawlable or not.
        Parameters:
        prevDocInfo - data about previously crawled document (if any)
        Returns:
        true if recrawlable