27 Forks
94 Stars
94 Watchers

scaleable-crawler-with-docker-cluster

a scaleable and efficient crawelr with docker cluster , crawl million pages in 2 hours with a single machine

How to download and setup scaleable-crawler-with-docker-cluster

Open terminal and run command
git clone https://github.com/tonywangcn/scaleable-crawler-with-docker-cluster.git
git clone is used to create a copy or clone of scaleable-crawler-with-docker-cluster repositories. You pass git clone a repository URL.
it supports a few different network protocols and corresponding URL formats.

Also you may download zip file with scaleable-crawler-with-docker-cluster https://github.com/tonywangcn/scaleable-crawler-with-docker-cluster/archive/master.zip

Or simply clone scaleable-crawler-with-docker-cluster with SSH
[email protected]:tonywangcn/scaleable-crawler-with-docker-cluster.git

If you have some problems with scaleable-crawler-with-docker-cluster

You may open issue on scaleable-crawler-with-docker-cluster support forum (system) here: https://github.com/tonywangcn/scaleable-crawler-with-docker-cluster/issues

Similar to scaleable-crawler-with-docker-cluster repositories

Here you may see scaleable-crawler-with-docker-cluster alternatives and analogs

 tensorflow    scrapy    CNTK    diaspora    Qix    handson-ml    Sasila    Price-monitor    infinit    diplomat    olric    qTox    LightGBM    h2o-3    catboost    distributed    tns    webmagic    colly    headless-chrome-crawler    scrapy-cluster    Lulu    newcrawler    scrapple    goose-parser    arachnid    gopa    scrapy-zyte-smartproxy    EvaEngine.js    dgraph