Interface IRecrawlableResolver

All Known Implementing Classes:
GenericRecrawlableResolver

public interface IRecrawlableResolver
Indicates whether a document that was successfully crawled on a previous crawling session should be recrawled or not. Documents not ready to be recrawled are not downloaded again (no HTTP calls will be made) and are not committed.
Since:
2.5.0
Author:
Pascal Essiembre
  • Method Summary

    Modifier and Type
    Method
    Description
    boolean
    Whether a document recrawlable or not.
  • Method Details

    • isRecrawlable

      boolean isRecrawlable(HttpDocInfo prevDocInfo)
      Whether a document recrawlable or not.
      Parameters:
      prevDocInfo - data about previously crawled document (if any)
      Returns:
      true if recrawlable