Most popular scraping repositories and open source projects

puppeteer-boiler

🛢 A batteries included boilerplate for puppeteer-extra. Automate all t...

2   17   17  

alibaba_scraper

Alibaba scraper with using of rotating proxies and headless Chrome fro...

3   17   17  

go-scrapy

Web crawling and scraping framework for Golang

2   16   16  

Shaman.Dokan.Warc

Mounts WARC files on Windows

1   16   16  

pyReptile

web crawling & scraping framework for Python

7   16   16  

scrawler

Scala web crawling and scraping using fs2 streams

3   16   16  

monitor_web_page_changes

I thought it would be nice to get an email alert when a new job postin...

4   16   16  

gochanges

**[ARCHIVED]** website changes tracker 🔍

3   16   16  

quora-scraper

Quora Question Scraper - Find & Export relevant Questions 10x faster

2   16   16  

precios-claros

Precios Claros (http://preciosclaros.gob.ar) scraper / data downloader

4   16   16  

locust

Distributed web data discovery and collection framework built for serv...

1   16   16  

autobet

Soccer outcome modeling and algorithmic betting system

7   16   16  

socialblade-com-api

Unofficial APIs for socialblade.com website.

7   16   16  

Discord-Qr-Code-Token

Generate a QR code of connection and introduce it automatically on a b...

6   16   16  

telegram_members_scrapper

Python Script to scrape members from a selected Telegram group.

10   16   16  

Zeiver

A Scraper, Downloader, & Recorder for static open directories.

1   16   16  

Impressionist

Impressionist is a JavaScript library that allows you to scrape data i...

2   16   16  

SocialNetworkScraper

Web scraping is simply the process of using a social media web scraper...

3   16   16  

Redbubble-automatisation-bot

Upload automatisation for RedBubble

2   16   16  

zenrows-node-sdk

SDK to access ZenRows API directly from Node.js. We handle proxies rot...

3   16   16  

crawlbase-php

A lightweight, dependency free PHP class that acts as wrapper for Craw...

1   16   16  

Bootleg_Macro

A simple tool-kit written in python for sourcing and displaying macroe...

2   16   16  

CBBpy

A Python-based web scraper for NCAA basketball.

3   16   16  

israeli-supermarket-scarpers

A python package with client to scrape the israeli supermarkets data

4   16   16  

elata-vsm-system-4

AI-powered intelligence platform aggregating breakthroughs in neurosci...

2   16   16  

ensembledata-python

Python library to scrape social media data via the EnsembleData API.

2   16   16  

streamlit-selenium-chrome

Scrape the web with Selenium and Chrome on Streamlit's Community Cloud

81   16   16  

langchain-scrapegraph

ScrapeGraph client langchain integration

2   16   16  

abx-spec-behaviors

🧩 Proposal to allow user scripts like "expand comments", "hide popups...

0   16   16  

google-images-scraper

Google images scrapper in full resolution with multi threading speed.

3   16   16  

tradingview-script-downloader

This Python script automates the extraction of PineScript codes from t...

1   16   16  

amazon-data-scraper

A Python script that uses Selenium and BeautifulSoup to scrape data fr...

4   15   15  

scrapingai

Build web scraping agents using AI to auto-extract the data from websi...

2   15   15  

Exploring-the-Landscape-of-the-Egyptian-Software-Market

A Data-driven Approach. Our story begins with a quest for knowledge. T...

1   15   15  

Chrome-extension-web-scraping-in-bc-

Chrome extension(Web Scraping) in BC

0   15   15  

re-employment-kraken

re-employment-kraken scrapes (job) sites, remembers what it saw and no...

1   15   15  

spidey

Robust web spider for NodeJS

5   15   15  

autoscout24_scraping

Python-based web scraping and data analysis tool designed to collect v...

12   15   15  

undetectedselenium

Java implementation of python library undetected-chromedriver and sele...

3   15   15  

gunaydin

Your good mornings ☀️

5   15   15  

turtle

Instagram Photo Downloader

7   15   15  

spb-unofficial-wrapper

Unofficial NodeJS wrapper for the ScrapingBee API

0   15   15  

baking-lyrics

Random lyrics generator.

1   15   15  

internet-affordability

🌍 Did you know that internet costs >20% of the average income in some...

0   15   15  

schedule-tweet

Schedules tweets using TweetDeck

1   15   15  

javawebscrapinghandbook_code

15   15   15  

reddit-top-posts-scrapy

Scrape top posts from list of subreddits and insert into MongoDB

3   15   15  

proxycrawl-ruby

ProxyCrawl API ruby gem for scraping and crawling

0   15   15  

web-scraping

Code samples of web scraping using Java.

8   15   15  

VITask

VITask is a Dynamic API server for VTOP with Moodle Integration. This...

6   15   15