Most popular scraping repositories and open source projects

social-media-profiles-regexs lorey Python

:card_index: Extract social media profiles and more with regular expressions

652 71 23

comic-dl Xonshiz Python

Comic-dl is a command line tool to download manga and comics from various comic and manga sites. Supported sites : readcomiconline.to, mangafox.me, co...

649 72 649

pricewise adrianhajdin TypeScript

Dive into web scraping and build a Next.js 13 eCommerce price tracker within a single video that teaches you data scraping, cron jobs, sending emails,...

641 186 641

reverse-api-engineer kalil0321 Python

Claude engineer that captures traffic, writes documentation and automatically generates API clients. Reverse engineer APIs!

641 59 641

docker-selenium-lambda umihico Dockerfile

The simplest demo of chrome automation by python and selenium in AWS Lambda

621 141 9

LinkedInDumper l4rm4nd Python

Python 3 script to dump/scrape/extract company employees from LinkedIn API

602 59 9

newcrawler speed JavaScript

Free Web Scraping Tool with Java

587 112 587

PHPScraper spekulatius PHP

A universal web-util for PHP.

586 76 586

n8n-nodes-puppeteer drudge TypeScript

n8n node for browser automation using Puppeteer

583 101 4

juriscraper freelawproject HTML

An API to scrape American court websites for metadata.

568 152 568

reddit-universal-scraper ksanjeev284 Python

Universal Reddit Scraper - Works on any Subreddit or User

568 87 6

spidermon scrapinghub Python

Scrapy Extension for monitoring spiders execution.

561 102 68

Ominis-OSINT AnonCatalyst Python

This Python application is an OSINT (Open Source Intelligence) tool called "Ominis OSINT - Web Hunter." It performs online information gathering by qu...

560 58 560

jekyll programminghistorian HTML

Jekyll-based static site for The Programming Historian

546 225 41

facebook_data_analyzer Lackoftactics Ruby

Analyze facebook copy of your data with ruby language. Download zip file from facebook and get info about friends ranking by message, vocabulary, con...

542 50 542

jikan-rest jikan-me PHP

The REST API for Jikan

538 289 9

scrape-linkedin-selenium austinoboyle HTML

`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.

526 167 526

quick-start-guide oxylabs

Python quick start guides to get the most out of Oxylabs' Web Scraper API free trial.

516 3 516

Musoq Puchaczov C#

SQL Runtime without any database

505 22 6

scrapple AlexMathew Python

A framework for creating semi-automatic web content extractors

503 41 21

quetre zyachel JavaScript

A libre front-end for Quora

499 36 7

gogoanime-api riimuru JavaScript

Anime Streaming, Discovery API made with Cheerio and Express. Uses data from Gogoanime

498 138 498

nickjs phantombuster JavaScript

Web scraping library made by the Phantombuster team. Modern, simple & works on all websites. (Deprecated)

496 49 5

MetaDetective franckferman Python

Unleash Metadata Intelligence with MetaDetective. Your Assistant Beyond Metagoofil.

495 58 6

search-engine-parser bisohns Python

Lightweight package to query popular search engines and scrape for result titles, links and descriptions

489 86 7

Kemono-Downloader Yuvi9587 Python

Kemono Downloader is a fast, powerful PyQt5 app for archiving content from a wide array of sites, including Kemono, Coomer, Bunkr, Erome, Saint2.su, n...

461 24 461

List-of-user-agents tamimibrahim17 Python

List of major web + mobile browser user agent strings. +1 Bonus script to scrape :)

461 219 461

ha-multiscrape danieldotnl Python

Home Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for eac...

441 20 8

libremdb zyachel TypeScript

A free & open source IMDb front-end.

428 36 9

tinking baptisteArno TypeScript

🧶 Extract data from any website without code, just clicks.

426 30 426

dude roniemartinez Python

dude uncomplicated data extraction: A simple framework for writing web scrapers using Python decorators

426 19 1

scraperai scraperai HTML

ScraperAI is an open-source, AI-powered tool designed to simplify web scraping for users of all skill levels.

422 60 8

GoogleBard PawanOsman TypeScript

GoogleBard - A reverse engineered API for Google Bard chatbot for NodeJS

419 59 419

SpotAPI Aran404 Python

A python wrapper for the public & private Spotify API

415 37 415

lambdasoup aantron OCaml

Functional HTML scraping and rewriting with CSS in OCaml

408 35 10

Torrent-Api-py Ryuk-me Python

An Unofficial API for 1337x, Piratebay, Nyaasi, Torlock, Torrent Galaxy, Zooqle, Kickass, Bitsearch, MagnetDL,Libgen, YTS, Limetorrent, TorrentFunk, G...

401 245 401