public interface IDocumentChecksummer
ImporterDocument
can hold different values, but
be deemed logically the same.
Such documents do not have to be equal, but they should return the
same checksum. An example of
this can be two different URLs pointing to the same document, where only a
single instance should be kept.
IXMLConfigurable
should offer the following
XML configuration usage:
<documentChecksummer class="(class)"> keep="[false|true]" targetField="(optional metadata field to store the checksum)" />
targetField
is ignored unless the keep
attribute is set to true
.AbstractDocumentChecksummer
Modifier and Type | Method and Description |
---|---|
String |
createDocumentChecksum(ImporterDocument document)
Creates a document checksum.
|
String createDocumentChecksum(ImporterDocument document)
document
- an HTTP documentCopyright © 2014–2021 Norconex Inc.. All rights reserved.