140 Forks
402 Stars
402 Watchers

sparkler

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.

How to download and setup sparkler

Open terminal and run command
git clone https://github.com/USCDataScience/sparkler.git
git clone is used to create a copy or clone of sparkler repositories. You pass git clone a repository URL.
it supports a few different network protocols and corresponding URL formats.

Also you may download zip file with sparkler https://github.com/USCDataScience/sparkler/archive/master.zip

Or simply clone sparkler with SSH
[email protected]:USCDataScience/sparkler.git

If you have some problems with sparkler

You may open issue on sparkler support forum (system) here: https://github.com/USCDataScience/sparkler/issues

Similar to sparkler repositories

Here you may see sparkler alternatives and analogs

 learn-anything    etcd    nsq    Qix    elasticsearch    dubbo    incubator-mxnet    hraftd    diplomat    js    elasticell    olric    translations    scalecube-services    finagle    MHTextSearch    neutrino    mgmt    burry.sh    gosiris    dbtester    bit    rqlite    atomix    copycat    raft-rs    PySyncObj    raft    ra    verdi-raft