crawler
Libraries and scripts for crawling the TYPO3 page tree. Used for re-caching, re-indexing, publishing applications etc.
How to download and setup crawler
Open terminal and run command
git clone https://github.com/tomasnorre/crawler.git
git clone is used to create a copy or clone of crawler repositories.
You pass git clone a repository URL. it supports a few different network protocols and corresponding URL formats.
Also you may download zip file with crawler https://github.com/tomasnorre/crawler/archive/master.zip
Or simply clone crawler with SSH
[email protected]:tomasnorre/crawler.git
If you have some problems with crawler
You may open issue on crawler support forum (system) here: https://github.com/tomasnorre/crawler/issuesSimilar to crawler repositories
Here you may see crawler alternatives and analogs
scrapy Sasila Price-monitor webmagic colly headless-chrome-crawler Lulu newcrawler scrapple goose-parser arachnid gopa scrapy-zyte-smartproxy node-crawler arachni newspaper webster spidy N2H4 easy-scraping-tutorial antch pomp talospider podcastcrawler FileMasta lux scrapy-redis haipproxy DotnetSpider TumblThree