Most popular crawler repositories and open source projects

puppeteer-sharp hardkoded C#

Headless Chrome .NET API

3.9k 483 3.9k

feapder Boris-code Python

🚀🚀🚀feapder is an easy to use, powerful crawler framework | feapder是一款上手简单，功能强大的Python爬虫框架。内置AirSpider、Spider、TaskSpider、Batc...

3.7k 542 3.7k

RED_HAWK Tuhinshubhra PHP

All in one tool for Information Gathering, Vulnerability Scanning and Crawling. A must have tool for all penetration testers

3.6k 938 3.6k

mdcx sqzw-x Python

Movie metadata scraper

3.6k 464 3.6k

toapi elliotgao2 Python

Every web site provides APIs.

3.6k 238 3.6k

cariddi edoardottt Go

Take a list of domains, crawl urls and scan for endpoints, secrets, api keys, file extensions, tokens and more

3.4k 292 3.4k

Python3-Spider wkunzhi Python

Python爬虫实战 - 模拟登陆各大网站包含但不限于：滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝，如果喜欢请start ❤️

3.4k 1k 3.4k

NGCBot ngc660sec

一个基于✨HOOK机制的微信机器人，支持🌱安全新闻定时推送【FreeBuf，先知，安全客，奇安信攻防社区】，👯Kfc文案，⚡漏洞查询，⚡手机号归属地查询，⚡知识库查...

3.3k 484 3.3k

crawlergo Qianlitp Go

A powerful browser crawler for web vulnerability scanners

3k 497 3k

gospider jaeles-project Go

Gospider - Fast web spider written in Go

2.9k 335 2.9k

DecryptLogin CharlesPikachu Python

DecryptLogin: APIs for loginning some websites by using requests.

2.9k 748 2.9k

owllook howie6879 Python

owllook-小说搜索引擎

2.8k 756 2.8k

google-play-scraper facundoolano JavaScript

Node.js scraper to get data from Google Play

2.8k 706 2.8k

crawler spatie PHP

https://spatie.be/docs/crawler

2.8k 368 2.8k

GoogleScraper NikolaiT HTML

A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.

2.8k 750 2.8k

geziyor geziyor Go

Geziyor, blazing fast web crawling & scraping framework for Go. Supports JS rendering.

2.8k 157 2.8k

FinalRecon thewhiteh4t Python

All In One Web Recon

2.7k 488 2.7k

QueryList jae-jae PHP

:spider: The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。

2.7k 429 2.7k

gecco xtuhcy Java

Easy to use lightweight web crawler（易用的轻量化网络爬虫）

2.5k 876 2.5k

instagram-scraper realsirjoe Python

scrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot

2.5k 398 2.5k

xalpha refraction-ray Python

基金投资管理回测引擎

2.5k 470 2.5k

lianjia-beike-spider jumper2014 Python

链家网和贝壳网房价爬虫，采集北京上海广州深圳等21个中国主要城市的房价数据（小区，二手房，出租房，新房），稳定可靠快速！支持csv,MySQL, MongoDB,Excel, js...

2.5k 645 2.5k

grab lorien Python

Web Scraping Framework

2.5k 278 2.5k

news-please fhamborg Python

news-please - an integrated web crawler and information extractor for news that just works

2.4k 452 2.4k

spider spider-rs Rust

Web crawler and scraper for Rust

2.4k 197 2.4k

Leaked-GPTs friuns2 Python

Leaked GPTs Prompts Bypass the 25 message limit or to try out GPTs without a Plus subscription.

2.4k 389 2.4k

Crawler-Detect JayBizzle PHP

🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent

2.3k 277 2.3k

abot sjdirect C#

Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.

2.3k 552 2.3k

vulnx anouarbensaad Python

vulnx 🕷️ an intelligent Bot, Shell can achieve automatic injection, and help researchers detect security vulnerabilities CMS system. It can perform a...

2.1k 362 2.1k

goclone goclone-dev Go

Website Cloner - Utilizes powerful Go routines to clone websites to your computer within seconds.

2.1k 385 2.1k

skycaiji zorlan PHP

蓝天采集器是一款开源免费的爬虫系统，仅需点选编辑规则即可采集数据，可运行在本地、虚拟主机或云服务器中，几乎能采集所有类型的网页，无缝对接各类CMS建站程...

2.1k 609 2.1k

gocrawl PuerkitoBio Go

Polite, slim and concurrent web crawler.

2.1k 194 2.1k

gain elliotgao2 Python

Web crawling framework based on asyncio.

2k 205 2k

SCrawler AAndyProgram Visual Basic .NET

🏳️‍🌈 Media downloader from any sites, including Twitter, Reddit, Instagram, BlueSky, TikTok, Threads, Facebook, OnlyFans, YouTube, Pinterest, PornHub...

2k 140 2k