Topic

crawler

Repositories (1232)

soccer-scrape
soccer-scrape o8e JavaScript

:page_with_curl: Scrape football data from Bet365

27
php-google
php-google howie6879 PHP

Google search results crawler, get google search results that you need - php

27
AyugeSpiderTools
AyugeSpiderTools shengchenyang Python

scrapy 扩展库:其主要功能使 scrapy 开发不用在意 item,pipeline,middleware 等通用场景下模块的编写,结合模板功能和常见开发的一些方法集成,可以让爬虫开...

27
PyperGrabber
PyperGrabber pykong Python

Fetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.

27
Android-Apps-Downloader
Android-Apps-Downloader harismuneer Python

📱 A tool to download android apps from Google Play Store and Xiaomi App Store (the famous Chinese Store).

27
generic-seeder
generic-seeder team-exor C++

Generic altcoin DNS seeder. Compatible with virtually any cryptocurrency cloned from bitcoin. Built-in lightweight DNS server ~ Cloudflare DNS support...

27
iranian-news-agencies-crawler
iranian-news-agencies-crawler hamid JavaScript

a crawler to fetch last news from Iranian(Persian) news agencies.

27
selenium_facebook_scraper
selenium_facebook_scraper Mhmd-Hisham Python

A simple python3 script used to download a users's friend list from facebook.

27
botcity-framework-web-python
botcity-framework-web-python botcity-dev Python

BotCity Framework Web - Python

27
pinscrape
pinscrape iamatulsingh Python

A simple library to scrape Pinterest images written in Python

27
YourLesson
YourLesson Lewin671 Python

深圳大学抢课系统

27
spider
spider GeoffZhu JavaScript

A web spider framework

27
serverless-crawler-demo
serverless-crawler-demo novemberde JavaScript

Serverless Architecture Crawler demo

26
PY-Login
PY-Login PY-Trade Python

模拟登录各类网站,操作 API 完成各种不可描述的事情

26
pimcore-lucene-search
pimcore-lucene-search dachcom-digital PHP

Pimcore Website Indexer (powered by Zend Search Lucene)

26
od-database-crawler
od-database-crawler terorie Go

OD-Database Go crawler

26
LeetCodeCrawler
LeetCodeCrawler ZhaoxiZhang Java

A tool for crawling the description and accepted submitted code of problems on the LeetCode and LeetCode-Cn website.

26
narr
narr IljaN Go
26
nivinEdu
nivinEdu nivin-studio

拟物校园,一个开源的高校教务移动化解决方案。

26
BestBuy-Parser
BestBuy-Parser gamemann Python

A personal tool using Python's Scrapy framework to scrape Best Buy's product pages for RTX 3080 TIs and notify if available/not sold out.

26
tor-ip-rotation-python-example
tor-ip-rotation-python-example baatout Python

An example of Tor IP rotation in Python

26
cambridge
cambridge mhwgoo Python

Terminal version of Cambridge Dictionary by default. Also supports Merrian-Webster Dictionary.

26
USPTO-PatFT-Web-Crawler
USPTO-PatFT-Web-Crawler mattwang44 Python

Crawler for fetching information of US Patents and PDF bulk download

26
douyin-sdk
douyin-sdk Video-Hub Python

联系微信(1764328791)、抖音SDK、抖音数据、抖音直播数据、抖音直播Api、抖音视频Api、抖音爬虫、抖音去水印、抖音视频下载、抖音视频解析、抖音直播监控、抖...

26
PySitemap
PySitemap Cartman720 Python

Simple sitemap generator with Python 3

26
FTPSearcher
FTPSearcher Sunlight-Rim Python

Asynchronous file scanner and downloader for FTP servers. Also takes IP ranges.

26
froxy
froxy matheusfelipeog Python

Hide your IP with free proxies using Froxy 🔄

26
n46-crawler
n46-crawler janelin612 JavaScript

Nogizaka46 Blog Crawler - 乃木坂46卒業成員部落格備份程式

26
get-site-urls
get-site-urls alex-page JavaScript

🔗 Get all of the URL's from a website.

26
wallstreetcnScrapy
wallstreetcnScrapy jianzhichun Python

a crawler for wallstreetcn,finance.sina by Scrapy-新浪财经,同花顺财经,华尔街见闻的爬虫

25
tucan-tools
tucan-tools tucanlib Python

Nomen est omen. It exports tucan grades/vv etc.

25
zhihu-crawler
zhihu-crawler pithyone PHP

轻量级知乎爬虫,支持问题、收藏夹和本月最热

25
wind-bell
wind-bell yishuifengxiao Java

风铃虫是一款轻量级的爬虫工具,似风铃一样灵敏,如蜘蛛一般敏捷,能感知任何细小的风吹草动,轻松抓取互联网上的内容。它是一款对目标服务器相对友好的蜘蛛程序...

25
ProxyCrawler
ProxyCrawler WeihanLi C#

代理爬虫服务,爬取代理IP并保存到 Redis 中, topshelf+Quartz.Net+redis

25
CrawlerDetectBundle
CrawlerDetectBundle nicolasmure PHP

A Symfony bundle for the Crawler-Detect library (detects bots/crawlers/spiders via the user agent)

25
CyberCrowl
CyberCrowl tnmch Python

CyberCrowl is a python Web path scanner tool

25
Real_Time_Social_Media_Mining
Real_Time_Social_Media_Mining stormsinbrewing HTML

DevOps pipeline for Real Time Social/Web Mining

25
crazyDhtSpider
crazyDhtSpider ixiaofeng PHP

Based on Swoole,a PHP DHT crawler, which have insane productivity(依托于swoole的PHP版本的DHT爬虫,有着奇高的效率)

25
MedicalKG
MedicalKG yeeeqichen Python

医疗知识图谱构建实战,通过爬虫获取百度百科数据,使用Mongodb存储结构化三元组,并使用neo4j进行知识图谱的构建及可视化; Medical Knowledge Graph; Crawler;...

25
novelsave_sources
novelsave_sources m-haisham Python

A collection of webnovel sources offering varying amounts of scraping capability.

25
social-media-archiver
social-media-archiver Combo819 TypeScript

A Node.js template to be implemented to archive post from any social media.

25
marmot
marmot hunterhug Go

💐Marmot A Golang HTTP Download

25
Techweekly
Techweekly xiongwilee JavaScript

高可配的技术周报邮件推送工具

24
realestate-scraper
realestate-scraper pauloromeira Python

A scraper that gathers data from real estate ads

24
PaperCrawler
PaperCrawler JustJokerX Python

Crawler used to crawl papers

24
AndroidValidatorCrawler
AndroidValidatorCrawler AliAzaz Kotlin

Kotlin library, Validator box that can inspect any type of form, provides multiple validation functions with an inclusion of clearing views

24
Pinterest-Crawler
Pinterest-Crawler SajjadAemmi Python

Downloads HD images from pinterest by keywords

24
dorker
dorker 0xdln1 Python

Better Google Dorking with Dorker.

24
kontests
kontests AliOsm Ruby

Competitive programming contests schedule

24
convertible-bond-crawler
convertible-bond-crawler jackluson HTML

宁稳网(旧富投网)、集思录可转债数据&策略分析

24