OceanMonkey
OceanMonkey is a High-Level Distributed Web Crawling and Web Scraping framework base on multi-process and multi-coroutines, used to crawl websites and extract structured data from their pages like the classical scrapy framework.
How to download and setup OceanMonkey
Open terminal and run command
git clone https://github.com/chipscoco/OceanMonkey.git
git clone is used to create a copy or clone of OceanMonkey repositories.
You pass git clone a repository URL. it supports a few different network protocols and corresponding URL formats.
Also you may download zip file with OceanMonkey https://github.com/chipscoco/OceanMonkey/archive/master.zip
Or simply clone OceanMonkey with SSH
[email protected]:chipscoco/OceanMonkey.git
If you have some problems with OceanMonkey
You may open issue on OceanMonkey support forum (system) here: https://github.com/chipscoco/OceanMonkey/issuesSimilar to OceanMonkey repositories
Here you may see OceanMonkey alternatives and analogs
scrapy requests-html Sasila webmagic colly headless-chrome-crawler Embed artoo instagram-scraper django-dynamic-scraper scrapy-cluster Lulu newcrawler panther facebook_data_analyzer ImageScraper scrapple parsel nickjs jsoup-annotations jekyll Musoq goose-parser arachnid lambdasoup crawler geeksforgeeks.pdf scrapy-zyte-smartproxy sqrape comic-dl