Most popular crawler repositories and open source projects

findmeaflat adriankumpf JavaScript

Get notified of new listings on popular German real estate portals.

23 5 23

googleplay_api alessandrodd Python

Google Play Unofficial Python 3 API Library

23 8 23

Helios stefan2200 Python

A Python based Web Application security scanner

23 11 23

app-crawler maguowei Python

crawling App by uiautomator2 & mitmproxy

23 9 23

indieweb-search capjamesg Python

Source code for the IndieWeb search engine.

23 2 23

doc_crawler.py Siltaar

Explore a website recursively and download all the wanted documents (PDF, ODT…)

22 7 22

okcoin-socket-crawler Asoul Python

A okcoin crawler based on websocket, save data to mysql

22 9 22

Gumo nvk681 JavaScript

A crawler that extracts data from a dynamic webpage. Written in node js.

22 0 22

spider-video tibaiwan JavaScript

Node 批量爬取头条视频

22 7 22

RedBetter-WM2 Mechazawa Python

Better.php crawler for Redacted that uses WhatManager

22 0 22

exoskeleton RuedigerVoigt Python

A Python framework to build polite, but tenacious crawlers / scrapers with a MariaDB backend

22 1 22

linkedin-public-dir-companies robertoarruda Python

Crawler and scraper of the public directory of companies on LinkedIn.

22 5 22

minicrawler testomato C

Multiplexing web client supporting HTTP/2 and WHATWG URL compliant parser written in C

22 5 22

tistore Kagami JavaScript

:camera: Tistory photo grabber

22 10 22

libp2p-dht-scrape-aas alanshaw Go

🧹 A libp2p DHT scraper as a service allowing anyone to collect, consume and use to generate useful reports & visualisations.

22 3 22

httpsuite whoamisec75 Python

A toolkit for web reconnaissance, it's fast and easy to use.

22 8 22

sinaCrawlerV HubQin Python

backup posts and comments of specify user in sina

21 1 21

hupu_spider kongtrio Python

虎扑步行街爬虫

21 8 21

ParseMyCF-contest JanaSabuj Python

A personal submission codeforces parser for CF, parsed by individual contests.The user is prompted for the username and has the flexibilty to parse la...

21 8 21

cea beetcb JavaScript

高校高校统一身份认证 Node.js 优雅可扩展示例，已集成今日校园签到(支持多平台一键部署)

21 10 21

crawling-framework tokenmill Java

Easily crawl news portals or blog sites using Storm Crawler.

21 4 21

akka-react-cloudant IBM CSS

A Soccer Dashboard created by scraping EPL website using Akka backend and ReactJS frontend and IBM Cloudant for object storage. IBM Cloud Foundry is u...

21 18 21

rankr endlessdev TypeScript

🇰🇷 Realtime integrated information analysis service

21 2 21

SlackWebhooksGithubCrawler Gruppio JavaScript

Search for Slack Webhooks token publicly exposed on Github

21 1 21

web-crawljs kayslay JavaScript

web crawler for Nodejs

21 4 21

QQSpider FanhuaandLuomu Python

爬取QQ用户信息（qq号、昵称、生日、地址等基本信息）并做简要analysis。

21 4 21

crawler mediamonks PHP

Crawl your own website with various clients for SEO and indexing purposes.

21 4 21

ZhengFang_System_Spider ZYSzys Python

:bug:一只登录正方教务管理系统，爬取数据的小爬虫

21 2 21

html-article-extractor woojubb JavaScript

A web page content extractor

21 1 21

MovieRater Asing1001 TypeScript

A useful website for finding movie's rating in Chinese and English. By crawling Yahoo, Ptt, IMDB.

21 2 21

covid-19-crawler LiveCoronaDetector Python

코로나 확진자 수/정보 크롤링

21 10 21

taiwanlottery yiyu0x Python

Taiwan Lottery Crawler 🐛 (Crawler for Various Types of Lotteries in Taiwan)

21 4 21

xianzhi_articles h4ckdepy

先知文章爬虫项目-[包含2021年7月之前所有文章]

21 5 21

actor-youtube-scraper bernardro JavaScript

Apify actor to scrape Youtube search results. You can set the maximum videos to scrape per page as well as the date from which to start scraping.

21 16 21

proxycrawl-php crawlbase PHP

ProxyCrawl PHP library for scraping and crawling websites

21 5 21

scrapy_poetry Imposingapple Python

本项目使用scrapy对古诗文网进行爬虫，获取不同分类（爱情、七夕等）的宋词的：词牌、作者、正文、注释、创作背景。

21 0 21

estate-crawler nstapelbroek Python

Scraping the real estate agencies for up-to-date house listings as soon as they arrive!

21 5 21

lopez tokahuke Rust

Crawling and scraping the Web for fun and profit

21 3 21

Codeforces-AutoCommit ISKU Python

When you solve the problem of the Codeforces site, it automatically commits and pushes to the remote repository.

21 5 21

book-spider Cansiny0320 TypeScript

🎉 开箱即用的高性能可自定义的笔趣阁小说爬虫快速下载无广告小说

21 4 21

Crawler ggfgh Python

整理本人在2021年10月-12月期间写的一些爬虫demo，比如用于渗透测试中SQL注入的URL收集脚本(爬取必应和百度搜索结果的URL)，子域名爆破demo，各大高校漏洞信息收...

21 5 21

telegram-member-inviter mjavadhpour Python

Crawling client's groups and channels to invite their members to a target group.

21 15 21

vermouth yasongxu Python

A torrent site written in the python language & douban scraper

20 11 20

crawl crackcomm Go

Lightweight library for scalable crawlers in Go.

20 4 20

crawler xbynet Java

A simple and flexible web crawler framework for java.

20 5 20

scrapy-azuresearch-crawler-samples yokawasa Python

Scrapy as a Web Crawler for Azure Search Samples

20 7 20

botanalyse gtbotsonar

botsonar analyse open api

20 3 20

domfind diogo-fernan Python

A Python DNS crawler to find identical domain names under different TLDs.

20 3 20

hero iflycn Python

百万英雄答题助手 - 兼容全部答题 APP

20 7 20

flutter_spider_fx Deali-Axy Dart

Flutter爬虫框架，帮助开发者快速在移动设备上构建爬虫，单线程版本

20 1 20

crawler

Repositories (1431)