Class HttpCollector


  • public class HttpCollector
    extends Collector
    Main application class. Instances of this class can hold several crawler, running at once. This is convenient when there are configuration setting to be shared amongst crawlers. When you have many crawler jobs defined that have nothing in common, it may be best to configure and run them separately, to facilitate troubleshooting. There is no set rules for this, experimenting with your target sites will help you.
    Author:
    Pascal Essiembre
    • Constructor Detail

      • HttpCollector

        public HttpCollector()
        Creates a non-configured HTTP collector.
      • HttpCollector

        public HttpCollector​(HttpCollectorConfig collectorConfig)
        Creates and configure an HTTP Collector with the provided configuration.
        Parameters:
        collectorConfig - HTTP Collector configuration