Topic

crawler

Repositories (1232)

zhihu-login
zhihu-login zkqiang Python

知乎模拟登录,支持提取验证码和保存 Cookies

355
Rcrawler
Rcrawler salimk R

An R web crawler and scraper

353
supercrawler
supercrawler brendonboshell JavaScript

A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limi...

351
91porn-api
91porn-api colikno JavaScript

🌭💦 91porn爬虫在线无限制API接口(永久有效,口令每日更新) 及 在线web预览

350
spidy
spidy rivermont Python

The simple, easy to use command line web crawler.

349
magic_google
magic_google howie6879 Python

Google search results crawler, get google search results that you need

345
tsec
tsec Asoul

台灣上市上櫃股票爬蟲 Taiwan Stock Exchange Crawler

344
hQuery.php
hQuery.php duzun PHP

An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.

342
xcrawler
xcrawler yan68 PHP

快速、简洁且强大的PHP爬虫框架

338
Search-Engines-Scraper
Search-Engines-Scraper tasos-py Python

Search google, bing, yahoo, and other search engines with python

338
Moodle-DL
Moodle-DL C0D3D3V Python

Moodle-DL downloads course content fast from Moodle (eg. lecture pdfs)

338
ppspider
ppspider xiyuan-fengyu TypeScript

web spider built by puppeteer, support task-queue and task-scheduling by decorators,support nedb / mongodb, support data visualization; 基于puppetee...

336
polite
polite dmi3kno R

Be nice on the web

327
tiktok-downloader
tiktok-downloader krypton-byte Python

Tiktok Downloader/Scraper using requests & bs4

321
telegram-crawler
telegram-crawler MarshalX Python

🕷 Automatically detect changes made to the official Telegram sites, clients and servers.

318
Free_Proxy_Website
Free_Proxy_Website cyubuchen Python

获取免费socks/https/http代理的网站集合

316
lightnovel_epub
lightnovel_epub JeffersonQin Python

🍭 epub generator for (light)novels (轻)小说 epub 生成器,支持站点:轻之国度、轻小说文库

314
Laravel-Crawler-Detect
Laravel-Crawler-Detect JayBizzle PHP

A Laravel wrapper for CrawlerDetect - the web crawler detection library

313
crawler
crawler infinilabs Go

🕷️ An easy-to-use spider written in Golang. (previous named GOPA.)

309
nudecrawler
nudecrawler yaroslaff Python

Crawl telegra.ph searching for nudes!

305
line-bot-tutorial
line-bot-tutorial twtrubiks Python

line-bot-tutorial use python flask

298
Sasila
Sasila da2vin Python

一个灵活、友好的爬虫框架

296
pychromeless
pychromeless jairovadillo Python

Python Lambda Chrome Automation (naming pending)

294
chinese-fund-crawler
chinese-fund-crawler jackluson Python

中国场外基金数据爬取&汇总分析

290
PulsarRPA
PulsarRPA platonai Kotlin

Automate webpages at scale, scrape web data completely and accurately with high performance, distributed RPA.

287
Instagram-Bot
Instagram-Bot mustafadalga Python

An Instagram bot developed using the Selenium Framework

281
Python-Web-Scraping-Tutorial
Python-Web-Scraping-Tutorial oxylabs Python

In this Python Web Scraping Tutorial, we will outline everything needed to get started with web scraping. We will begin with simple examples and move...

280
awesome-java-crawler
awesome-java-crawler rockswang

本仓库收集整理爬虫相关资源,开发语言以Java为主

276
oddish
oddish puppylpg Python

Crawl csgo skin info from `buff.163.com` and steam, then find the most suitable one to buy from the former and to sell to the latter.

276
Fast-LianJia-Crawler
Fast-LianJia-Crawler CaoZ Python

直接通过链家 API 抓取数据的极速爬虫,宇宙最快~~ 🚀

274
th-music-video-generator
th-music-video-generator Jasonnor JavaScript

Touhou Project random music video generator/player, crawling image and video from websites to generate MV.

274
Strong-Web-Crawler
Strong-Web-Crawler microfisher C#

基于C#.NET+PhantomJS+Sellenium的高级网络爬虫程序。可执行Javascript代码、触发各类事件、操纵页面Dom结构。

272
sitemap-generator-cli
sitemap-generator-cli lgraubner JavaScript

Creates an XML-Sitemap by crawling a given site.

268
go-movies
go-movies hezhizheng Go

golang spider Crawler 爬虫 电影

268
scrapper
scrapper amerkurev Python

Web scraper with a simple REST API living in Docker and using a Headless browser and Readability.js for parsing.

266
Github-spider
Github-spider chenjiandongx Python

Github 仓库及用户分析爬虫

266
weiboPicDownloader
weiboPicDownloader yAnXImIN Java

免登录下载微博图片 爬虫 Download Weibo Images without Logging-in

264
Gorecon
Gorecon devanshbatham Go

Gorecon is a All in one Reconnaissance Tool , a.k.a swiss knife for Reconnaissance , A tool that every pentester/bughunter might wanna consider into...

263
antch
antch antchfx Go

Antch, a fast, powerful and extensible web crawling & scraping framework for Go

263
algoliasearch-netlify
algoliasearch-netlify algolia TypeScript

Official Algolia Plugin for Netlify. Index your website to Algolia when deploying your project to Netlify with the Algolia Crawler

259
weibo_terminator_workflow
weibo_terminator_workflow lucasjinreal Python

Update Version of weibo_terminator, This is Workflow Version aim at Get Job Done!

258
Selenops
Selenops zntfdr Swift

A Swift Web Crawler 🕷

257
ok_ip_proxy_pool
ok_ip_proxy_pool cwjokaka Python

🍿爬虫代理IP池(proxy pool) python🍟一个还ok的IP代理池

256
arachnid
arachnid zrashwani PHP

Crawl all unique internal links found on a given website, and extract SEO related information - supports javascript based sites

255
bitextor
bitextor bitextor Python

Bitextor generates translation memories from multilingual websites

255
wencai
wencai GraySilver JavaScript

This is a wencai crawler.(i问财的策略回测接口的Pythonic工具包)

252
Tumblr_Crawler
Tumblr_Crawler sparrow629 Python

This is a Multi-thread crawler for Tumblr.

251
FileSensor
FileSensor Xyntax Python

Dynamic file detection tool based on crawler 基于爬虫的动态敏感文件探测工具

250
chromium_for_spider
chromium_for_spider myvyang HTML

dynamic crawler for web vulnerability scanner

250
Sub
Sub Leon406 Kotlin

节点爬取,筛选, 支持Clash,base64订阅解析,自动生成可用的ss, ssr, v2ray, trojan节点. 已集成Github Action,每天8-24,定时更新.

249