Most popular scraping repositories and open source projects

data-engineering-challenge-th

Dockerizing a Python Script for Web Scraping and consume the scraped d...

2   15   15  

Earnings_Call_Analyzed_By_NLP

Earnings Call Sentiment Analysis. This repository includes my work on...

3   15   15  

prntscraper

An effective random image scraper for the website image hosting and sh...

2   14   14  

Sreality

Sreality, Scraping, Analysis, Python

7   14   14  

short-term-rentals-warehouse

Pipeline, warehouse, and visualization tools for investigating the imp...

1   14   14  

scraperlite

Scrape text and HTML based on CSS selectors and save contents to a SQL...

0   14   14  

Linkedin-Job-Postings-Visualization-and-Analysis-Python

This Python script scrapes up to 100 most recent Linkedin job postings...

1   14   14  

Investopedia-Bot

Pick the best stocks and automate Investopedia

10   14   14  

scholar-scrap

Extract relevant information of research papers, into a downloadable C...

3   14   14  

Web-Resource-Downloader

This is a Python script that downloads all resources (images, scripts,...

3   14   14  

LLM_InformationRetrieval

extracting "structured" information that is embedded in natural langua...

2   14   14  

tradingview.com-scraper

A python project that scrapes data from www.tradingview.com stores it...

2   14   14  

volleystats

🏐 Command-line tool to scrape volleyball statistics from Data Project...

0   14   14  

idealista_data_extraction

Data extraction from Idealista, using their API and web scraping (pyth...

8   14   14  

NewApkPure

Search and download applications from apkpure.com

2   14   14  

ToKillATweetingBird

A Twitter scraper to retrieve tweets and users from X (formerly Twitte...

4   14   14  

python-overwatch

A simple API for scraping Overwatch stats

5   14   14  

scotch-scraping-node

Simple app for scraping author profiles and tutorials from Scotch.io -...

2   14   14  

COVID-19-ANGOLA

Um app para colecta de dados sobre o COVID-19 em Angola.

1   14   14  

SeleniumSample

a set of samples about Login & Cookie with PhantomJS

11   14   14  

nightmareHeadlessTest

test project to execute nightmare in headless mode

2   14   14  

scavenger

Scrape and take screenshots of dynamic and static webpages

2   14   14  

browse

browse is a declerative programming language for web scraping, automat...

5   14   14  

uruguayan_parliamentary_session_diary

Code for my blog post about text mining uruguayan Parliamentary sessio...

3   14   14  

Temphael

A Tumblr-scraping text post bot

3   14   14  

-Competitive-Coding-Problem-Classifier-and-Recommender

Competitive Coding Problem Classifier and Problem Recommendation

6   14   14  

InstaScraper

A Simple Scraper for Instagram public accounts' E-mail addresses using...

5   14   14  

kirinuki-core

Kirinuki is a library that convert any html to JSON using CSS selector...

2   14   14  

worker

Containerized Ferret worker

7   14   14  

scrapy-twitter

Web scraper based on Scrapy to fetch tweets from a list of user accoun...

3   14   14  

BestCarDeal

:moneybag: Scraping, Visualizing, and Analyzing 1,700,000 Entries of U...

5   14   14  

manifold

Manifold is a plug-and-play end-to-end real estate asset tracker, from...

0   14   14  

Faceapp-Gender-Swap-Detection

Detecting fake photos generated by FACEAPP gender swap feature using D...

2   14   14  

Dataset-Indian-Companies

Web Scraping "List of companies in India" from AmbitionBox Website usi...

8   14   14  

playstore-scraper

🏷️ A simple and fast way to get search results and more from Google Pl...

0   14   14  

GSOC_Data_Extractor

A simple tool created to make life easier for the people applying for...

4   13   13  

minervaclient

A CLI client for Minerva course registration

2   13   13  

subscene_scraper

Library to download subtitles from subscene.com

6   13   13  

Justdial-Scrapper

A 100% working Justdial scrapper, Just enter the url and it'll extract...

22   13   13  

irasutoya

:woman: Ruby library for いらすとや

2   13   13  

venom-tutorial

A tutorial based on your preferred open source focused crawler for the...

0   13   13  

spotify-scraper

A Python Windows API for getting the artist, song, and album art for t...

3   13   13  

acciotables

API to scrap data from dynamic webpages. (say tables on Sports Referen...

0   13   13  

facebook_events_scraper

Scrape Facebook page events(recurring and upcoming), and individual ev...

2   13   13  

Linkedin-Scraper

Selenium based LinkedIn profile data scraper

0   13   13  

statscraper

A base library for building web scrapers for statistical data, and a h...

4   13   13  

github-trending

Command line tool for fetching GitHub trending repositories

3   13   13  

Reddit-Scraper

Reddit Scraper is a Python script that utilizies the PRAW Python libra...

1   13   13  

FreshProxies

Fresh Proxies | Proxied Browser | HTTP/HTTPS/SOCKS4/SOCKS5

6   13   13  

nhkore

🇯🇵📰🗻 NHK News Web (Easy) word frequency (core list) scraper for Japa...

2   13   13