Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
How to download and setup Squidwarc
Open terminal and run command
git clone https://github.com/N0taN3rd/Squidwarc.git
git clone is used to create a copy or clone of Squidwarc repositories.
You pass git clone a repository URL. it supports a few different network protocols and corresponding URL formats.
Also you may download zip file with Squidwarc https://github.com/N0taN3rd/Squidwarc/archive/master.zip
Or simply clone Squidwarc with SSH
[email protected]:N0taN3rd/Squidwarc.git
If you have some problems with Squidwarc
You may open issue on Squidwarc support forum (system) here: https://github.com/N0taN3rd/Squidwarc/issuesSimilar to Squidwarc repositories
Here you may see Squidwarc alternatives and analogs
scrapy Sasila Price-monitor webmagic colly headless-chrome-crawler Lulu newcrawler scrapple goose-parser arachnid gopa scrapy-zyte-smartproxy node-crawler arachni newspaper isp-data-pollution webster cdp4j spidy stopstalk-deployment N2H4 memorious easy-scraping-tutorial antch pomp Harvester diffbot-php-client talospider corpuscrawler