Topic

crawler

Repositories (1232)

js-cookie-monitor-debugger-hook
js-cookie-monitor-debugger-hook JSREI JavaScript

js cookie逆向利器:js cookie变动监控可视化工具 & js cookie hook打条件断点

41
noscrape
noscrape schoenbergerb TypeScript

This repository is deprecated

41
aristotle
aristotle egcodes Python

highly customizable news collector

40
php-crawler
php-crawler elboletaire PHP

:spider: A simple crawler (spider) writen in php just for fun, with zero dependencies

40
laundry
laundry endquote JavaScript

Data laundering tools

40
USTBCrawlers
USTBCrawlers nladuo Python

那些年,我爬过的北科。一个由浅入深的定向爬虫教程。

40
ncrawler
ncrawler kant2002 C#

Web Crawler written in C#

40
HttpProxy
HttpProxy asche910 Java

JAVA实现的IP代理池,支持HTTP与HTTPS两种方式

40
sponge
sponge spypunk Kotlin

sponge is a website crawler and links downloader command-line tool

40
MahjongKit
MahjongKit erreurt Python

Riichi Mahjong Kit: (1) Game log crawler (sqlite3, json, bs4); (2) Game log preprocessor; (3) Deterministic algorithms library

40
lezhin-comics-downloader
lezhin-comics-downloader ImSejin Java

📥 Downloader for lezhin comics

40
scrapingant-client-python
scrapingant-client-python ScrapingAnt Python

ScrapingAnt API client for Python.

40
ICLR2023-OpenReviewData
ICLR2023-OpenReviewData fedebotu Jupyter Notebook

Crawl & Visualize ICLR 2023 Data from OpenReview

40
cewler
cewler roys Python

CeWLeR - Custom Word List generator Redefined. CeWL alternative in Python, based on the Scrapy framework.

40
insecres
insecres kkomelin Go

A console tool that finds insecure resources on HTTPS sites

39
SpiderWho
SpiderWho lanrat Python

A very fast whois crawler

39
podcastcrawler
podcastcrawler podcastcrawler PHP

PHP library to find podcasts

39
WebCrawler
WebCrawler zhk0603 C#

一个轻量级、快速、多线程、多管道、灵活配置的网络爬虫。

39
LuoguCrawler
LuoguCrawler himself65 Python

一个python爬虫来爬取洛谷各种信息

39
AppCrawler
AppCrawler tongtzeho Python

Android应用市场网络爬虫

39
crawel
crawel MrXujiang JavaScript

基于Apify+node+react搭建的有点意思的爬虫平台

39
dijnet-bot
dijnet-bot juzraai JavaScript

Az összes számlád még egy helyen :)

39
Domainker
Domainker BitTheByte Python

BugBounty Tool

38
TripAdvisor_crawler
TripAdvisor_crawler Tang-Li-Jen Python

Python Crawler: Scrape Data From Tripadvisor

38
BaiduImageCrawler
BaiduImageCrawler flexwang-zz Python

A multithreaded tool for downloading search results of Baidu image search.

38
leboncoin-crawler
leboncoin-crawler rfussien HTML

Crawler for leboncoin.fr

38
ArticleSpider
ArticleSpider hackfengJam Python

Crawling zhihu, jobbole, lagou by Scrapy, and using Elasticsearch+Django to build a Search Engine website --- README_zh.md (including: implementation...

38
CrawlerSamples
CrawlerSamples VAllens C#

This is a Puppeteer+AngleSharp crawler console app samples, used C# 7.1 coding and dotnet core build.

38
tiktok-crawler
tiktok-crawler hackertogether Python

This is a Tiktok Crawler App.

38
DeadPool
DeadPool Ryuchen Python

该项目是一个使用celery作为主体框架的爬虫应用,能够灵活的添加爬虫任务,并且同时运行多站点的爬虫工作,所有组件都能够原生支持规模并发和分布式,加上celery...

38
SeleniumLogin
SeleniumLogin CharlesPikachu Python

Login some website using selenium.

38
crawlerdetect
crawlerdetect x-way Go

Golang module to detect bots and crawlers via the user agent

38
papercut
papercut armand1m TypeScript

Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Cachin...

38
novel-downloader
novel-downloader yjqiang Python

万能小说下载器

38
ProxyScan
ProxyScan Its-Vichy Go

🔎 scan the internet to find "private" proxies.

38
CygnusX1
CygnusX1 datnnt1997 Python

A multithreaded tool for searching and downloading images from popular search engines. It is straightforward to set up and run!

38
chan-downloader
chan-downloader mariot Rust

CLI to download all images/webms in a 4chan thread

38
EH-PDF
EH-PDF Galgamer-org Python

將一個 E-Hentai 畫廊下載並轉換成 PDF,方便在 Kindle 上閱讀 以及在 iPad 上閱讀並作筆記,,,

38
BiliBili-Manga-Downloader
BiliBili-Manga-Downloader BiliBili-Manga-Downloader-Dev-Team Python

一个好用的哔哩哔哩漫画下载器,拥有图形界面,支持关键词搜索漫画,多线程下载,多种保存格式,本地漫画管理,一键检查更新! keywords:B站;B漫;漫画下载;B...

38
CobWeb-lnx
CobWeb-lnx GoncaloMark Python

CobWeb is a Python library for web scraping. The library consists of two classes: Spider and Scraper.

38
scrapy-zyte-api
scrapy-zyte-api scrapy-plugins Python

Zyte API integration for Scrapy

38
auto_crawler_ptt_beauty_image
auto_crawler_ptt_beauty_image twtrubiks Python

Auto Crawler Ptt Beauty Image Use Python Schedule

37
Spider
Spider xiantang Python

web crawler

37
NodeSpider
NodeSpider Bin-Huang TypeScript

[DEPRECATED] Simple, flexible, delightful web crawler/spider package

37
UniversityRecruitment-sSurvey
UniversityRecruitment-sSurvey Maicius Python

用严肃的数据来回答“什么样的企业会到什么样的大学招聘”?

37
medium-stat-box
medium-stat-box kylemocode TypeScript

Practical pinned gist which show your latest medium status 📌

37
d00r
d00r CYB3RMX Python

Simple directory brute-force tool written with python.

37
fii
fii riquellopes HTML

API para recuperar informações sobre FII

37
lolcrawler
lolcrawler jonaslejon Python

Headless web crawler for bugbounty and penetration-testing/redteaming

37
crawlhtmltopdf
crawlhtmltopdf osdodo Python

一个将runoob.com转换为PDF的爬虫

37