scrapy-distributed
A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
How to download and setup scrapy-distributed
Open terminal and run command
git clone https://github.com/Insutanto/scrapy-distributed.git
git clone is used to create a copy or clone of scrapy-distributed repositories.
You pass git clone a repository URL. it supports a few different network protocols and corresponding URL formats.
Also you may download zip file with scrapy-distributed https://github.com/Insutanto/scrapy-distributed/archive/master.zip
Or simply clone scrapy-distributed with SSH
[email protected]:Insutanto/scrapy-distributed.git
If you have some problems with scrapy-distributed
You may open issue on scrapy-distributed support forum (system) here: https://github.com/Insutanto/scrapy-distributed/issuesSimilar to scrapy-distributed repositories
Here you may see scrapy-distributed alternatives and analogs
scrapy requests-html technology-talk Sasila Price-monitor webmagic colly headless-chrome-crawler Embed artoo instagram-scraper django-dynamic-scraper scrapy-cluster Lulu newcrawler panther facebook_data_analyzer ImageScraper scrapple parsel nickjs jsoup-annotations jekyll Musoq goose-parser arachnid lambdasoup gopa geeksforgeeks.pdf scrapy-zyte-smartproxy