Most popular scraping repositories and open source projects

emec-api

API Python para consulta na base de dados oficial do e-MEC

9   20   20  

reason-rust-scraper

🦀 Scraping & crawling websites using Rust, and ReasonML

1   19   19  

chesf

CHeSF is the Chrome Headless Scraping Framework, a very very alpha cod...

2   19   19  

scraper-supermercados

Pequeño programa para realizar scraping de precios de supermercados de...

5   19   19  

dust

Archive web pages with all relevant assets or save as a single file HT...

0   19   19  

proxycrawl-php

ProxyCrawl PHP library for scraping and crawling websites

5   19   19  

google-search-results-python

Scrape and parse Google search results in Python

7   19   19  

MotoGP-API

MotoGP Api: Library that reads the results of the MotoGP, 500cc, Moto2...

1   19   19  

Google-Web-Scraper

This Python code scrapes Google search results then applies sentiment...

8   19   19  

ptt-crawler

ptt-crawler is a web crawler module designed to scarpe data from Ptt.

8   19   19  

html2rss-configs

📇 A growing collection of html2rss feed configs. Generate configs with...

7   19   19  

board-game-scraper

Board game data scraper

4   19   19  

NFT-Dataset

Includes data about over 250 NFT Collections

3   19   19  

path-finder-rl

Method For Establishing Database For Global Value Chain For Parts Proc...

15   19   19  

timetable-grabber-sit

Timetable Grabber - SIT is a tool that allows you to grab and export y...

2   19   19  

sg-food-ml

This script is used to scrap images from the Internet to classify 5 c...

1   18   18  

scrapher

A web scraper for PHP to easily extract data from web pages

13   18   18  

crawler

Web Crawler created with Node.js and Puppeteer

1   18   18  

instagram_explorer

:camera: An app to scrap instagram posts and analyze data.

5   18   18  

instatag

Extract Instagram Users from tags (Public , Without API and Login)

3   18   18  

go-scrapy

Web crawling and scraping framework for Golang

4   18   18  

hass-multiscrape

Home Assistant custom component for scraping multiple values (from a s...

4   18   18  

document-dl

Command line program to download documents from web portals

3   18   18  

master-to-pythonista

A list of awesome beginners-friendly projects.

19   18   18  

froxy

Hide your IP with free proxies using Froxy 🔄

1   18   18  

zoominfo_scraper

Zoominfo scraper with using of rotating proxies and headless Chrome fr...

6   18   18  

htmltab

Command-line utility to convert HTML tables into CSV files

2   18   18  

udemyscraper

A Udemy Course Scraper built with bs4 and selenium, that fetches udemy...

10   18   18  

scrapeops-scrapy-sdk

Scrapy extension that gives you all the scraping monitoring, alerting...

3   18   18  

raiplay-dl

The most advanced raiplay.it downloader

3   18   18  

GoogleBard

A reverse engineered API for Google Bard chatbot for NodeJS

1   18   18  

scrapy-fieldstats

A Scrapy extension to log items coverage when the spider shuts down

4   17   17  

nintendo-games-ratings

Dataset and visualizations of Nintendo Games and ratings, scraped from...

5   17   17  

Shaman.Dokan.Warc

Mounts WARC files on Windows

1   17   17  

precios-claros

Precios Claros (http://preciosclaros.gob.ar) scraper / data downloader

4   17   17  

jwscraper

A python library for scraping videos from JW Player

6   17   17  

kenya-news-scrapper

It scrapes various kenyan news sites and returns top news from each an...

7   17   17  

Instagram-Network_scraping_and_analysis

Python script to scrape Instagram network

3   17   17  

tiktok-trending-data

Scraping the TikTok discovery web API every 15 minutes using Github Ac...

2   17   17  

Justdial-Scraper

JustDial Scraper to scrap all the requested data which includes their...

14   17   17  

papercut

Papercut is a scraping/crawling library for Node.js built on top of JS...

1   17   17  

AnimeDl

⚡️An API for downloading or streaming your favorite anime.

1   17   17  

noscrape

obfuscate text via node to make scraping your content really difficult...

5   17   17  

public-roadmap

Public roadmap for SerpApi, LLC (https://serpapi.com)

0   17   17  

WebHere

HTML scraping for Objective-C.

6   16   16  

gunaydin

Your good mornings ☀️

5   16   16  

pyReptile

web crawling & scraping framework for Python

7   16   16  

web-clipper

Easily download the main content of a web page in html, markdown, and/...

0   16   16  

scrapyd-mongodb

Library designed to replace the SQLite backend by a MongoDB backend on...

9   16   16  

ScrapeBot

A Selenium-driven tool for automated website interaction and scraping.

5   16   16