Most popular scraping repositories and open source projects

automation-samples

Using clicknium to automate platforms like Linkedin, twitter, Slack, Y...

7   14   14  

apify-client-python

Apify API client for Python

3   14   14  

GSOC_Data_Extractor

A simple tool created to make life easier for the people applying for...

4   13   13  

minervaclient

A CLI client for Minerva course registration

4   13   13  

browse

browse is a declerative programming language for web scraping, automat...

5   13   13  

internet-affordability

🌍 Dataset that shows the Internet affordability by country (a shocking...

0   13   13  

uruguayan_parliamentary_session_diary

Code for my blog post about text mining uruguayan Parliamentary sessio...

3   13   13  

quora-scraper

Quora Question Scraper - Find & Export relevant Questions 10x faster

2   13   13  

spotify-scraper

A Python Windows API for getting the artist, song, and album art for t...

3   13   13  

node-fetch-dom

Magic utility that extract javascript global variables from a remote h...

0   13   13  

javawebscrapinghandbook_code

14   13   13  

reddit-top-posts-scrapy

Scrape top posts from list of subreddits and insert into MongoDB

3   13   13  

facebook_events_scraper

Scrape Facebook page events(recurring and upcoming), and individual ev...

2   13   13  

autobet

Soccer outcome modeling and algorithmic betting system

7   13   13  

statscraper

A base library for building web scrapers for statistical data, and a h...

4   13   13  

socials-api

👨‍👩‍👧‍👦 (Rest) API to extract social media profiles from websites or s...

5   13   13  

RSI-Scraper

Web Scaper for RSI

3   13   13  

product-integrations

Code examples and general information

6   13   13  

Statball

Statball - Football soccer stats analyser from top 5 european leagues...

1   13   13  

scraper

Python web scrapers

6   13   13  

cli

Ferret CLI

4   13   13  

moviestills

A small CLI app to scrap high-quality movie snapshots from various web...

0   13   13  

Fetcher

A chrome extension which fetches your favourite feeds, so you don't ha...

1   13   13  

puppeteer-table-parser

Scrape and parse HTML tables with the Puppeteer table parser.

2   13   13  

headless-task-server

A headless browser task manager based on Hero (Chrome)

2   13   13  

scraping-berita

scraping berita dari beberapa portal berita indonesia

1   13   13  

code4rena-scraper

Scraping Code4rena contest audits reports for stats, fun (and profit ?...

1   13   13  

Instagram-Tools

Little php class for instagram.

4   13   13  

wtchr

Open-source privacy focused show downloader & tracker.

0   12   12  

Justdial-Scrapper

A 100% working Justdial scrapper, Just enter the url and it'll extract...

22   12   12  

covid19br-pub

Projeto de monitoramento de publicações oficiais relacionadas a COVID-...

0   12   12  

crawlerUtils

Utils for programming web crawler

3   12   12  

gathering_data

Примеры к книге "Сбор данных в Интернете на языке R".

4   12   12  

venom-tutorial

A tutorial based on your preferred open source focused crawler for the...

0   12   12  

acciotables

API to scrap data from dynamic webpages. (say tables on Sports Referen...

0   12   12  

php-scrape

A simple, easy to use, scalable scraping framework written in PHP

4   12   12  

web_scraper

An application designed to scrap the web and retrieve information from...

0   12   12  

PyCarGr

PyCarGr - Unofficial car.gr API

12   12   12  

github-trending

Command line tool for fetching GitHub trending repositories

2   12   12  

Reddit-Scraper

Reddit Scraper is a Python script that utilizies the PRAW Python libra...

1   12   12  

journal

A movie journal coupled with open IMDb data, and a Flask web-app for e...

0   12   12  

worker

Containerized Ferret worker

7   12   12  

fastlinkcheck

Check local static links and online links fast and in parallel

6   12   12  

scrapy-twitter

Web scraper based on Scrapy to fetch tweets from a list of user accoun...

3   12   12  

WebnovelYoinker

Downloads converts webnovels to Epub or PDF. Supported websites: wuxia...

1   12   12  

Realestate

Contains a set of Python scripts for real estate data analysis, visual...

7   12   12  

ProductHunt-scraper

Producthunt.com famous website scraper script. Scrap all offers and sa...

7   12   12  

cloudflare-iuam-solver

CloudflareIuamSolver is the Java library for breaking through the Clou...

4   12   12  

PressScraper

A scraper application for crawling press releases of US government age...

4   12   12  

chromedl

Go library for scraping or downloading files bypassing Cloudflare prot...

2   12   12