Most popular crawler repositories and open source projects

tiktok-scraper-php snuzi PHP

Tiktok (Musically) PHP scraper

70 30 70

spider jhao104 Python

python crawler spider

70 26 70

lrabbit_scrapy litter-rabbit Python

a quick start python mutil thread crawl

70 1 70

slime nekolr Java

🍰 A visual crawler management platform

70 29 70

ipfs-crawler trudi-group Go

A crawler for the IPFS network, code for our paper (https://arxiv.org/abs/2002.07747). Also holds scripts to evaluate the obtained data and make simil...

70 16 70

meta-spy DEENUU1 Python

👾 CLI MetaSpy (Facebook, Instagram) scraper and crawler - instagram account, facebook accounts, pages and search

70 18 70

BOJ-AutoCommit ISKU Python

When you solve the problem of Baekjoon Online Judge, it automatically commits and pushes to the remote repository.

69 12 69

rubium vifreefly Ruby

Antidetect Headless Chrome Browser for Ruby Web Scraping and Automation

69 0 69

Crawling-CV-Conference-Papers seanywang0408 Jupyter Notebook

Crawling CV conference papers with Python.

69 5 69

crawlerdetect x-way Go

Golang module to detect bots and crawlers via the user agent

69 7 69

awesome-fingerprinting embeddinglayer

A collection of browser fingerprinting projects, research, and resources. Intended as a way to aggregate research surrounding the subject.

68 6 68

python-crawler ityouknow Python

Python Crawler

68 51 68

robotstxt ropensci R

R 📦 for parsing and checking robots.txt files 🤖

68 10 68

Tor_Spider absingh31 Python

Python project to crawl and scrap the lesser known deep web or one can say dark web. Just provide the onion link and get started.

68 16 68

MusicTagger Mai-icy Python

一个可以补全mp3，flac文件元数据的图形化界面。还可以下载歌词

67 6 67

chat-plugin-web-crawler lobehub HTML

🧩 / 🕸 WebsiteCrawler - This plugin automatically crawls the main content of a specified URL webpage and uses it as context input.

67 24 67

hproxy howie6879 Python

hproxy - Asynchronous IP proxy pool, aims to make getting proxy as convenient as possible.(异步爬虫代理池)

67 14 67

SecretScraper PadishahIII Python

SecretScraper is a web scraper that crawl through target websites, scrape from http response and extract secret information via regular expression.

67 4 67

bilibili_comment_crawl lemonmindyes Python

爬取bilibili视频下的评论，最新出品！！！⚠本代码只适用于学习，做其他事情概不负责！！！

67 5 67

zhihu-crawler NightMarcher Python

徒手实现定时爬取知乎，从中发掘有价值的信息，并可视化爬取的数据作网页展示。

67 9 67

lezhin-comics-downloader ImSejin Java

📥 Downloader for lezhin comics

67 9 67

OpenCrawler merwin-asm Python

Open Crawler || Open Source Crawler

67 9 67

mtywatch jufeng-2022

一句话监控网页内容变化，AI | 爬虫 | 网页监控 | 网页更新提醒 | 网页内容订阅

67 12 67

js_block webcoding HTML

研究学习各种拦截：反爬虫、拦截ad、防广告注入、斗黄牛等

66 17 66

nest-crawler saltyshiomix TypeScript

An easiest crawling and scraping module for NestJS

66 9 66

dht-crawler hijkzzz Go

A DHT Crawler based on Goroutine

65 4 65

bolsa gicornachini Python

Biblioteca feita em Python com o objetivo de facilitar o acesso a dados de seus investimentos na bolsa de valores(B3/CEI) através do Portal CEI.

65 18 65

PSGameSpider RavelloH JavaScript

自动爬取所有PlayStationStore中的所有游戏信息，包括封面、描述、价格、评分等，生成网页并索引 # # # Automatically crawl all game infos in all playstation...

65 6 65

datacrawl DataCrawl-AI Python

A simple and easy to use web crawler for Python

64 11 64

github-trending doforce Python

GitHub trending repositories and developers APIs for real time, powered by crawlers | 通过爬虫获取 GitHub 热门项目和开发者的实时 API

64 26 64

Chemrtron cho45 TypeScript

Chemr is a document viewer; fuzzy match incremental search.

64 14 64

crawdad schollz Go

Cross-platform persistent and distributed web crawler :crab:

64 9 64

medium-crawler NISH1001 Python

A crawler for scraping posts from medium.com

64 13 64

bthello rehe0x Python

Python3 DHT 磁力种子爬虫种子解析种子搜索演示地址

64 31 64

Auto_Shadowsocks VonSdite Python

Shadowsocks. 科学上网, 仅供学习。是免费的服务器，可能存在科学上网不稳定。

63 17 63

SoFIFA DiogoDantas Jupyter Notebook

A SoFIFA webcrawler and Machine Learning prediction

63 11 63

ZhihuVAPI cheezone Python

优雅地玩知乎

62 13 62

koshort koshort Python

(deprecated) :cat: koshort is a Python package for Korean internet spoken language crawling and processing... or maybe Korean domestic cat.

62 10 62

sciBASIC xieguigang Visual Basic .NET

sciBASIC# is a kind of dialect language which is derive from the native VB.NET language, and written for the data scientist.

62 29 62

Taiwan-Stocks smalldan1022 Python

台灣上市櫃公司爬蟲，分析盤後股票趨勢以及繪製K線圖、均線圖、三大法人成交量

62 18 62

Java-Carwler-Technology soberqian Java

网络数据采集技术—Java网络爬虫 (书稿完整代码，涉及网络爬虫的各种技术和知识点)

62 21 62

m3u8Downloader mrzhangfelix Python

meijuba.net,Python crawler,M3U8格式视频下载，桌面应用

62 21 62

crawler-project Albert-W Go

Google资深工程师深度讲解Go语言爬虫项目。

62 31 62

novel-downloader yjqiang Python

万能小说下载器

62 19 62

paperCrawler sucv Python

This is a Scrapy-based web-spider. It scrapes papers from TOP conferences and journals.

61 7 61

TumblTwo johanneszab C#

TumblTwo, an Improved Fork of TumblOne, a Tumblr Downloader.

61 14 61

crawler_JD_what_worthy_buying HarborZeng Python

爬取京东商品所有评论，利用情感分析，判断商品是否值得买

61 13 61

aio-vextractor panoslin Python

解析视频网站/APP/H5 页面视频信息。支持抖音、腾讯视频、YouTube、Instagram 等40余个网站与APP

61 15 61

webrtc-local-ip-leak niespodd HTML

Oh no, stop this. You can see my local IP address 😲! Use `foundation` attribute against CRC32 lookup table to reveal local IP address of a Chrome/Chr...

61 6 61

a11y-sitechecker forsti0506 TypeScript

Automatic accessibility checker with website crawling + screenshots for easy use

61 10 61

crawler

Repositories (1431)