29 Forks
151 Stars
151 Watchers

comcrawl

A python utility for downloading Common Crawl data

How to download and setup comcrawl

Open terminal and run command
git clone https://github.com/michaelharms/comcrawl.git
git clone is used to create a copy or clone of comcrawl repositories. You pass git clone a repository URL.
it supports a few different network protocols and corresponding URL formats.

Also you may download zip file with comcrawl https://github.com/michaelharms/comcrawl/archive/master.zip

Or simply clone comcrawl with SSH
[email protected]:michaelharms/comcrawl.git

If you have some problems with comcrawl

You may open issue on comcrawl support forum (system) here: https://github.com/michaelharms/comcrawl/issues