Norconex GSA Committer

Configuration

When used with a Norconex Collector, you can use the following XML to configure Google Search Appliance as the <committer> section of your Norconex Collector configuration:

<committer class="com.norconex.committer.gsa.GsaCommitter">
    <feedUrl>...</feedUrl>
    <sourceReferenceField keep="[false|true]">...</sourceReferenceField>
    <sourceContentField keep="[false|true]">...</sourceContentField>
    <targetReferenceField>...</targetReferenceField>
    <targetContentField>...</targetContentField>
    <queueDir>...</queueDir>
    <queueSize>...</queueSize>
    <commitBatchSize>...</commitBatchSize>
    <maxRetries>...</maxRetries>
    <maxRetryWait>...</maxRetryWait>        
</committer>

Tag descriptions:

Tag Description
feedUrl GSA feed URL.
sourceReferenceField Name of source field that will be mapped to the GSA id field. Default is the document reference the Committer stores as document.reference. The metadata source field is deleted, unless keep is set to true
sourceContentField Source field name for a document content/body. Default is not a field, but rather the document body content. Once re-mapped, the metadata source field is deleted, unless keep is set to true.
targetReferenceField Target field name for the document reference.
targetContentField Target field name for a document content/body. Default is content.
queueDir Optional path where to queue files before sending them to GSA. Default is ./committer-queue.
queueSize Optional maximum queue size before sending document to GSA. Default is 1000.
commitBatchSize Optional maximum of documents to send to GSA at once. Default is 100.
maxRetries Maximum retries upon commit failures. Default is 0 (no retry).
maxRetryWait Maximum delay (millisecond) between retries. Default is 0 (no delay).