1208 Forks
2546 Stars
2546 Watchers

nutch

Apache Nutch is an extensible and scalable web crawler

How to download and setup nutch

Open terminal and run command
git clone https://github.com/apache/nutch.git
git clone is used to create a copy or clone of nutch repositories. You pass git clone a repository URL.
it supports a few different network protocols and corresponding URL formats.

Also you may download zip file with nutch https://github.com/apache/nutch/archive/master.zip

Or simply clone nutch with SSH
[email protected]:apache/nutch.git

If you have some problems with nutch

You may open issue on nutch support forum (system) here: https://github.com/apache/nutch/issues

Similar to nutch repositories

Here you may see nutch alternatives and analogs

 angular    flask    scrapy    CNTK    NativeScript    flutter    zxing    jadx    fastjson    libgdx    Android-CleanArchitecture    selenium    sinatra    graal    Anki-Android    echo    iris    spring-boot    vapor    cakephp    aws-doc-sdk-examples    java-design-patterns    RxJava    elasticsearch    guava    interviews    dubbo    generator-jhipster    jenkins    ExoPlayer