Norconex Collector Core

Open-Source Collector Core Library

Documentation Download 1.10.0

Core Collector library

The Norconex Collector Core is a library containing features and reusable code shared between all collectors. It is not an executable application.

If you are looking for a web crawler, filesystem crawler or else, check out Norconex HTTP Collector or Norconex Filesystem Collector.

Use this library if you are interested to build a new collector, or if you want to replace the Collector Core version you have with your downloaded Collector zip file with a more recent.

Refer to the API documentation for many of the features that are shared features between all collectors.

Latest news

Norconex HTTP Collector 3.0.0 snapshots available
2020-09-07
Development builds of upcoming version 3 now available to experiment with. More...

opensource.norconex.com
2020-09-07
All Norconex open-source projects are now grouped under the same domain. More...

Norconex HTTP and FileSystem Collectors 2.9.0 released
2019-12-22
New URL normalization rules, support for CMIS protocol, ACL extraction from more sources, etc. More...

Norconex Collector Core 1.10.0 released
2019-12-22
Unmanaged logs, max parallel crawlers, etc. More...

Norconex Importer 2.10.0 released
2019-12-22
New FieldReportTagger, Tika upgrade, fixes, etc. More...