Most popular crawler repositories and open source projects
newspaper
newspaper3k is a news, full-text, and article metadata extraction in P...
2130 14668 14668
awesome-web-scraping
List of libraries, tools and APIs for web scraping and data processing...
814 7082 7082
browser-fingerprinting
Analysis of Bot Protection systems with available countermeasures 🚿....
225 4247 4247
DotnetSpider
DotnetSpider, a .NET standard web crawling library. It is lightweight,...
1002 3673 3673
Crawler_Illegal_Cases_In_China
Collection of China illegal cases about web crawler 本项目用来整理所有...
250 3101 3101
GoogleScraper
A Python module to scrape several search engines (like Google, Yandex,...
745 2683 2683