64 Forks
156 Stars
156 Watchers

collector-http

Norconex Web Crawler (or spider) is a flexible web crawler for collecting, parsing, and manipulating data from the Internet (or Intranet) to various data repositories such as search engines.

How to download and setup collector-http

Open terminal and run command
git clone https://github.com/Norconex/collector-http.git
git clone is used to create a copy or clone of collector-http repositories. You pass git clone a repository URL.
it supports a few different network protocols and corresponding URL formats.

Also you may download zip file with collector-http https://github.com/Norconex/collector-http/archive/master.zip

Or simply clone collector-http with SSH
[email protected]:Norconex/collector-http.git

If you have some problems with collector-http

You may open issue on collector-http support forum (system) here: https://github.com/Norconex/collector-http/issues