Knowing more about the sequence of events taking place can help you
better configure your crawling solution.
The following flowchart details how each URL encountered is processed.
While it does not cover all available features, it will give
you a better idea of what's going on under the hood.
Click on a shape to get additional information.
What about deletions?
The diagram covers "upserts" only. Your Committer can also receive deletion
requests. The conditions triggering deletions are many and are greatly
influenced by configuration options. A few examples that may
apply:
"Orphan" pages detected.
"Not Found" pages detected.
Pages generating errors.
Selected crawler events.
New filtering rules (e.g., robots.txt, custom, etc.).