2 repositories on SrcLog
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Models, and associated helper code for GSOC 2017 project Tensorflow Image to Text in Apache Tika