Topic

crawler

Repositories (1431)

findmeaflat
findmeaflat adriankumpf JavaScript

Get notified of new listings on popular German real estate portals.

23
googleplay_api
googleplay_api alessandrodd Python

Google Play Unofficial Python 3 API Library

23
Helios
Helios stefan2200 Python

A Python based Web Application security scanner

23
app-crawler
app-crawler maguowei Python

crawling App by uiautomator2 & mitmproxy

23
indieweb-search
indieweb-search capjamesg Python

Source code for the IndieWeb search engine.

23
doc_crawler.py
doc_crawler.py Siltaar

Explore a website recursively and download all the wanted documents (PDF, ODT…)

22
okcoin-socket-crawler
okcoin-socket-crawler Asoul Python

A okcoin crawler based on websocket, save data to mysql

22
Gumo
Gumo nvk681 JavaScript

A crawler that extracts data from a dynamic webpage. Written in node js.

22
spider-video
spider-video tibaiwan JavaScript

Node 批量爬取头条视频

22
RedBetter-WM2
RedBetter-WM2 Mechazawa Python

Better.php crawler for Redacted that uses WhatManager

22
exoskeleton
exoskeleton RuedigerVoigt Python

A Python framework to build polite, but tenacious crawlers / scrapers with a MariaDB backend

22
linkedin-public-dir-companies
linkedin-public-dir-companies robertoarruda Python

Crawler and scraper of the public directory of companies on LinkedIn.

22
minicrawler
minicrawler testomato C

Multiplexing web client supporting HTTP/2 and WHATWG URL compliant parser written in C

22
tistore
tistore Kagami JavaScript

:camera: Tistory photo grabber

22
libp2p-dht-scrape-aas
libp2p-dht-scrape-aas alanshaw Go

🧹 A libp2p DHT scraper as a service allowing anyone to collect, consume and use to generate useful reports & visualisations.

22
httpsuite
httpsuite whoamisec75 Python

A toolkit for web reconnaissance, it's fast and easy to use.

22
sinaCrawlerV
sinaCrawlerV HubQin Python

backup posts and comments of specify user in sina

21
hupu_spider
hupu_spider kongtrio Python

虎扑步行街爬虫

21
ParseMyCF-contest
ParseMyCF-contest JanaSabuj Python

A personal submission codeforces parser for CF, parsed by individual contests.The user is prompted for the username and has the flexibilty to parse la...

21
cea
cea beetcb JavaScript

高校高校统一身份认证 Node.js 优雅可扩展示例,已集成今日校园签到(支持多平台一键部署)

21
crawling-framework
crawling-framework tokenmill Java

Easily crawl news portals or blog sites using Storm Crawler.

21
akka-react-cloudant
akka-react-cloudant IBM CSS

A Soccer Dashboard created by scraping EPL website using Akka backend and ReactJS frontend and IBM Cloudant for object storage. IBM Cloud Foundry is u...

21
rankr
rankr endlessdev TypeScript

🇰🇷 Realtime integrated information analysis service

21
SlackWebhooksGithubCrawler
SlackWebhooksGithubCrawler Gruppio JavaScript

Search for Slack Webhooks token publicly exposed on Github

21
web-crawljs
web-crawljs kayslay JavaScript

web crawler for Nodejs

21
QQSpider
QQSpider FanhuaandLuomu Python

爬取QQ用户信息(qq号、昵称、生日、地址等基本信息)并做简要analysis。

21
crawler
crawler mediamonks PHP

Crawl your own website with various clients for SEO and indexing purposes.

21
ZhengFang_System_Spider
ZhengFang_System_Spider ZYSzys Python

:bug:一只登录正方教务管理系统,爬取数据的小爬虫

21
html-article-extractor
html-article-extractor woojubb JavaScript

A web page content extractor

21
MovieRater
MovieRater Asing1001 TypeScript

A useful website for finding movie's rating in Chinese and English. By crawling Yahoo, Ptt, IMDB.

21
covid-19-crawler
covid-19-crawler LiveCoronaDetector Python

코로나 확진자 수/정보 크롤링

21
taiwanlottery
taiwanlottery yiyu0x Python

Taiwan Lottery Crawler 🐛 (Crawler for Various Types of Lotteries in Taiwan)

21
xianzhi_articles
xianzhi_articles h4ckdepy

先知文章爬虫项目-[包含2021年7月之前所有文章]

21
actor-youtube-scraper
actor-youtube-scraper bernardro JavaScript

Apify actor to scrape Youtube search results. You can set the maximum videos to scrape per page as well as the date from which to start scraping.

21
proxycrawl-php
proxycrawl-php crawlbase PHP

ProxyCrawl PHP library for scraping and crawling websites

21
scrapy_poetry
scrapy_poetry Imposingapple Python

本项目使用scrapy对 古诗文网 进行爬虫,获取不同分类(爱情、七夕等)的宋词的:词牌、作者、正文、注释、创作背景。

21
estate-crawler
estate-crawler nstapelbroek Python

Scraping the real estate agencies for up-to-date house listings as soon as they arrive!

21
lopez
lopez tokahuke Rust

Crawling and scraping the Web for fun and profit

21
Codeforces-AutoCommit
Codeforces-AutoCommit ISKU Python

When you solve the problem of the Codeforces site, it automatically commits and pushes to the remote repository.

21
book-spider
book-spider Cansiny0320 TypeScript

🎉 开箱即用的高性能可自定义的笔趣阁小说爬虫 快速下载无广告小说

21
Crawler
Crawler ggfgh Python

整理本人在2021年10月-12月期间写的一些爬虫demo,比如用于渗透测试中SQL注入的URL收集脚本(爬取必应和百度搜索结果的URL),子域名爆破demo,各大高校漏洞信息收...

21
telegram-member-inviter
telegram-member-inviter mjavadhpour Python

Crawling client's groups and channels to invite their members to a target group.

21
vermouth
vermouth yasongxu Python

A torrent site written in the python language & douban scraper

20
crawl
crawl crackcomm Go

Lightweight library for scalable crawlers in Go.

20
crawler
crawler xbynet Java

A simple and flexible web crawler framework for java.

20
scrapy-azuresearch-crawler-samples
scrapy-azuresearch-crawler-samples yokawasa Python

Scrapy as a Web Crawler for Azure Search Samples

20
botanalyse
botanalyse gtbotsonar

botsonar analyse open api

20
domfind
domfind diogo-fernan Python

A Python DNS crawler to find identical domain names under different TLDs.

20
hero
hero iflycn Python

百万英雄答题助手 - 兼容全部答题 APP

20
flutter_spider_fx
flutter_spider_fx Deali-Axy Dart

Flutter爬虫框架,帮助开发者快速在移动设备上构建爬虫,单线程版本

20