Class ReferenceFilter

  • All Implemented Interfaces:
    IXMLConfigurable, IDocumentFilter, IOnMatchFilter, IImporterHandler

    public class ReferenceFilter
    extends AbstractDocumentFilter

    Accepts or rejects a document based on its reference (e.g. URL).

    Can be used both as a pre-parse or post-parse handler.

    XML configuration usage:

    
    <handler
        class="com.norconex.importer.handler.filter.impl.ReferenceFilter"
        onMatch="[include|exclude]">
      <!-- multiple "restrictTo" tags allowed (only one needs to match) -->
      <restrictTo>
        <fieldMatcher>(field-matching expression)</fieldMatcher>
        <valueMatcher>(value-matching expression)</valueMatcher>
      </restrictTo>
      <valueMatcher>(expression of reference value to match)</valueMatcher>
    </handler>

    XML usage example:

    
    <handler
        class="ReferenceFilter"
        onMatch="exclude">
      <valueMatcher
          method="regex">
        .*&#47;login/.*
      </valueMatcher>
    </handler>

    The above eample reject documents having "/login/" in their reference.

    Since:
    3.0.0
    Author:
    Pascal Essiembre