DeadPool
该项目是一个使用celery作为主体框架的爬虫应用,能够灵活的添加爬虫任务,并且同时运行多站点的爬虫工作,所有组件都能够原生支持规模并发和分布式,加上celery原生的分布式调用,实现大规模并发。
How to download and setup DeadPool
Open terminal and run command
git clone https://github.com/Ryuchen/DeadPool.git
git clone is used to create a copy or clone of DeadPool repositories.
You pass git clone a repository URL. it supports a few different network protocols and corresponding URL formats.
Also you may download zip file with DeadPool https://github.com/Ryuchen/DeadPool/archive/master.zip
Or simply clone DeadPool with SSH
[email protected]:Ryuchen/DeadPool.git
If you have some problems with DeadPool
You may open issue on DeadPool support forum (system) here: https://github.com/Ryuchen/DeadPool/issuesSimilar to DeadPool repositories
Here you may see DeadPool alternatives and analogs
scrapy Sasila Price-monitor webmagic colly headless-chrome-crawler Lulu newcrawler scrapple goose-parser arachnid gopa scrapy-zyte-smartproxy node-crawler arachni newspaper webster spidy N2H4 easy-scraping-tutorial antch pomp talospider podcastcrawler FileMasta lux scrapy-redis haipproxy DotnetSpider TumblThree