Most popular scraping repositories and open source projects

diffbot-php-client

[Deprecated - Maintenance mode - use APIs directly please!] The offici...

20   53   53  

trex

youtube & tiktok analysis + youchoose recommendation custmizer. backen...

15   53   53  

garlic

πŸ§„πŸ§› protect your website from being scraped by bots.

0   53   53  

Ecole-Directe-Plus

A better EcoleDirecte (unaffiliated): more pleasant, functional, and i...

25   53   53  

scraping-reviews-from-googlemaps

This is a simple script with python to scrap Google Maps reviews and r...

18   52   52  

dart-scraper

ν•œκ΅­ κΈˆμœ΅κ°λ…μ›μ—μ„œ μš΄μ˜ν•˜λŠ” λ‹€νŠΈ(Dart) μ‹œμŠ€ν…œμ„ μ΄μš©ν•œ κΈ°μ—… μž¬λ¬΄μ œν‘œ...

22   52   52  

hext

Domain-specific language for extracting structured data from HTML docu...

3   52   52  

tiktok-trending-data-api

Scraping the TikTok Discovery Data API every 1 hour using Github Actio...

7   52   52  

scrapers

scrapers for building your own image databases

7   51   51  

freenom-auto-renew-domains

A scraper built with puppeteer that auto renew free domains on Freenom...

20   51   51  

aniyoi-api

REST API Anime Subtitle Indonesia | Streaming Anime Sub Indo

12   51   51  

CaseHarvester

AWS-based application for scraping the Maryland Judiciary Case Search

10   51   51  

imslp

🎼 The clean and modern way of accessing IMSLP data and scores program...

6   50   50  

instagram-without-api

A simple PHP code to get unlimited instagram public pictures by every...

14   49   49  

local-api-client-python

Official Python library for interacting with Kameleo Client

6   48   48  

configs

Public, free to use, repository with diggers configs for scraping / ex...

16   48   48  

beautifulsoup-tutorial

:sparkles: :ramen: Scrape webpage metadata using BeautifulSoup.

17   47   47  

News_Summary

Dataset and scripts for scraping the news articles from popular source...

28   47   47  

AngleParse

HTML parsing and processing tool for PowerShell.

6   47   47  

Scraping-Dynamic-JavaScript-Ajax-Websites-With-BeautifulSoup

A guide on how to scrape JavaScript rendered websites with Python and...

9   47   47  

thecrowler

A Content Discovery and Development Platform. Empowering Cybersecurity...

9   47   47  

wajik-anime-api

REST API streaming dan download Anime subtitle Indonesia | sub Indo

35   47   47  

scrape-google-python

In this tutorial, we showcase how to scrape public Google data with Py...

0   47   47  

CraigslistScraper

Simple webscraper for Craigslist.

19   46   46  

react-node-web-scraper

Final Year project, scraping data of e-commerce stores and display in...

23   46   46  

socials

πŸ‘¨β€πŸ‘©β€πŸ‘¦ Social account detection and extraction in Python, e.g. for c...

9   46   46  

jason-the-miner

⛏ A versatile Web scraper for Node.js

11   45   45  

image-collector

Download images from Google Image Search

23   45   45  

local-api-client-typescript

Official JavaScript/TypeScript library for interacting with Kameleo Cl...

3   45   45  

xdsl-exporter

xDSL Prometheus Exporter

3   45   45  

Outlook-account-creator

Python tool that automatically create outlook account with auto-captch...

8   45   45  

spidercreator

Automated web scraping spider generation using Browser Use and LLMs. S...

3   45   45  

getter

Get and put users (scraping) to the target group/channel efficiently,...

22   44   44  

jimov_api

This project is an open-source API for retrieving multimedia content s...

18   44   44  

greenlight

A Golang based Undetected Web Automation Framework

4   44   44  

python

Repo Python

1   44   44  

sniffagrammers

Node.js and PHP files to automatically downloading pictures from insta...

3   44   44  

scrapegraph-sdk

πŸ•·οΈ Official Scrapegraph API SDK: Effortlessly extract content from any...

5   44   44  

oversmash

Overwatch API library for player details and career stats

7   44   44  

go-ps4

Search your favorite PS4 games from Playstation Store using the Comman...

6   44   44  

RARBG-scraper

With Selenium headless browsing and CAPTCHA solving

11   44   44  

activesoup

A headless pure-python browser for the web

5   43   43  

scrape-github-trending

Tutorial for web scraping / crawling with Node.js.

8   43   43  

bluebird

Unofficial Python client for Twitter

14   43   43  

torchestrator

Spin up Tor containers and then proxy HTTP requests via these Tor inst...

8   43   43  

info-bot

πŸ€– A Versatile Telegram Bot

14   43   43  

how-to-scrape-google-trends

Learn step-by-step how to scrape Google Trends data and make a result...

0   43   43  

firecrawl-quickstarts

A collection of cookbooks to help developers get started quickly with...

3   43   43  

scaling-to-distributed-crawling

Repository for the Mastering Web Scraping in Python: Scaling to Distri...

9   42   42  

youtube-comment-scraper

This script will dump youtube video comments to a CSV from youtube vid...

15   42   42