3 repositories on SrcLog
DOM Based Content Extraction via Text Density
Helpers and stuff for building web crawlers.
landing page for http://imscraping.ninja