crawlerr
A simple and fully customizable web crawler/spider for Node.js with server-side DOM. Comes with elegant and hell-simple APIs.
How to download and setup crawlerr
Open terminal and run command
git clone https://github.com/Bartozzz/crawlerr.git
git clone is used to create a copy or clone of crawlerr repositories.
You pass git clone a repository URL. it supports a few different network protocols and corresponding URL formats.
Also you may download zip file with crawlerr https://github.com/Bartozzz/crawlerr/archive/master.zip
Or simply clone crawlerr with SSH
[email protected]:Bartozzz/crawlerr.git
If you have some problems with crawlerr
You may open issue on crawlerr support forum (system) here: https://github.com/Bartozzz/crawlerr/issuesSimilar to crawlerr repositories
Here you may see crawlerr alternatives and analogs
scrapy Sasila Price-monitor webmagic colly headless-chrome-crawler Lulu newcrawler scrapple goose-parser arachnid gopa scrapy-zyte-smartproxy node-crawler arachni newspaper webster spidy N2H4 easy-scraping-tutorial antch pomp talospider podcastcrawler FileMasta lux scrapy-redis haipproxy DotnetSpider TumblThree