Topic

scraping

Repositories (1626)

schedule-tweet
schedule-tweet honzajavorek Python

Schedules tweets using TweetDeck

15
reddit-top-posts-scrapy
reddit-top-posts-scrapy alysivji Python

Scrape top posts from list of subreddits and insert into MongoDB

15
javawebscrapinghandbook_code
javawebscrapinghandbook_code ksahin Java
15
-Competitive-Coding-Problem-Classifier-and-Recommender
-Competitive-Coding-Problem-Classifier-and-Recommender ParasAvkirkar Python

Competitive Coding Problem Classifier and Problem Recommendation

14
Linkedin-Job-Postings-Visualization-and-Analysis-Python
Linkedin-Job-Postings-Visualization-and-Analysis-Python DmytroNorth Jupyter Notebook

This Python script scrapes up to 100 most recent Linkedin job postings of any job title and creates sentiment visualization in a form of a word cloud.

14
Investopedia-Bot
Investopedia-Bot bassel27 Python

Pick the best stocks and automate Investopedia

14
python-overwatch
python-overwatch alexbotello Python

A simple API for scraping Overwatch stats

14
scrapy-twitter
scrapy-twitter BurnzZ Python

Web scraper based on Scrapy to fetch tweets from a list of user accounts

14
scotch-scraping-node
scotch-scraping-node gladchinda JavaScript

Simple app for scraping author profiles and tutorials from Scotch.io - https://scotch.io.

14
COVID-19-ANGOLA
COVID-19-ANGOLA EmanuelJoseCandido PHP

Um app para colecta de dados sobre o COVID-19 em Angola.

14
SeleniumSample
SeleniumSample qwefgh90 Python

a set of samples about Login & Cookie with PhantomJS

14
worker
worker MontFerret Go

Containerized Ferret worker

14
short-term-rentals-warehouse
short-term-rentals-warehouse rsanjabi Python

Pipeline, warehouse, and visualization tools for investigating the impact of Airbnb short-term rentals on world cities.

14
Sreality
Sreality JirkaZelenka Jupyter Notebook

Sreality, Scraping, Analysis, Python

14
scholar-scrap
scholar-scrap nainiayoub Python

Extract relevant information of research papers, into a downloadable CSV file, from Google Scholar based on user input.

14
scraperlite
scraperlite danp Go

Scrape text and HTML based on CSS selectors and save contents to a SQLite database.

14
kirinuki-core
kirinuki-core rike422 TypeScript

Kirinuki is a library that convert any html to JSON using CSS selectors.

14
prntscraper
prntscraper ItzBlinkzy Python

An effective random image scraper for the website image hosting and sharing website (https://prnt.sc)

14
scavenger
scavenger pietrovismara JavaScript

Scrape and take screenshots of dynamic and static webpages

14
Faceapp-Gender-Swap-Detection
Faceapp-Gender-Swap-Detection moaaztaha Jupyter Notebook

Detecting fake photos generated by FACEAPP gender swap feature using Deep Learning.

14
Dataset-Indian-Companies
Dataset-Indian-Companies mratanusarkar Jupyter Notebook

Web Scraping "List of companies in India" from AmbitionBox Website using Python and Beautiful Soup

14
playstore-scraper
playstore-scraper LuanRT JavaScript

🏷️ A simple and fast way to get search results and more from Google Play Store.

14
manifold
manifold Guilherme-B Python

Manifold is a plug-and-play end-to-end real estate asset tracker, from web scraping to ETL (data warehouse) using Python, Go, Apache Airflow/Spark, AW...

14
BestCarDeal
BestCarDeal TheAhmadOsman HTML

:moneybag: Scraping, Visualizing, and Analyzing 1,700,000 Entries of Used Cars for Sale on Craigslist to Find The Best Car Deal :car:

14
Web-Resource-Downloader
Web-Resource-Downloader mehmetkahya0 Python

This is a Python script that downloads all resources (images, scripts, stylesheets, etc.) from a given website.

14
LLM_InformationRetrieval
LLM_InformationRetrieval yaminivibha Python

extracting "structured" information that is embedded in natural language text on the web using iterative set expansion, spanBERT, and openAI API

14
tradingview.com-scraper
tradingview.com-scraper svetlozardraganov Jupyter Notebook

A python project that scrapes data from www.tradingview.com stores it into database and visualize financial parameters like revenue, net income and et...

14
volleystats
volleystats claromes Python

🏐 Command-line tool to scrape volleyball statistics from Data Project Web Competition websites

14
idealista_data_extraction
idealista_data_extraction laurabarredaagusti Python

Data extraction from Idealista, using their API and web scraping (python). It contains both the datasets and the extraction files.

14
browse
browse windsorio JavaScript

browse is a declerative programming language for web scraping, automation and UI testing

14
NewApkPure
NewApkPure Alnyz Python

Search and download applications from apkpure.com

14
ToKillATweetingBird
ToKillATweetingBird zer0Percent Python

A Twitter scraper to retrieve tweets and users from X (formerly Twitter) without using the API.

14
uruguayan_parliamentary_session_diary
uruguayan_parliamentary_session_diary d4tagirl R

Code for my blog post about text mining uruguayan Parliamentary sessions 🇺🇾

14
amazon-wishless
amazon-wishless andre-st Python

The more and longer wishlists you have, the less you look for buying opportunities arising from price trends. This filters your Amazon lists by (used)...

14
Temphael
Temphael NoraCodes Python

A Tumblr-scraping text post bot

14
nightmareHeadlessTest
nightmareHeadlessTest gabrielperales JavaScript

test project to execute nightmare in headless mode

14
InstaScraper
InstaScraper supmanyu Python

A Simple Scraper for Instagram public accounts' E-mail addresses using Python and BeautifulSoup

14
cfr-iris-scraper
cfr-iris-scraper FlashWebIT CSS

Live train information: scrape-powered API with JSON endpoints for the Romanian national railway infrastructure company CFR S.A's realtime information...

13
subscene_scraper
subscene_scraper jodevsa JavaScript

Library to download subtitles from subscene.com

13
acciotables
acciotables npranav10

API to scrap data from dynamic webpages. (say tables on Sports Reference websites)

13
Fetcher
Fetcher neriymus JavaScript

A chrome extension which fetches your favourite feeds, so you don't have to.

13
React-YouTube-Comment-Section-Scraper
React-YouTube-Comment-Section-Scraper MikeM711 JavaScript

A full stack application that scrapes & filters YouTube comments using Google's Puppeteer, instead of using the YouTube API

13
api
api Yalies Python

👥 The best directory of Yale personnel, with a clean API to match. Used by 70% of undergrads!

13
Tap-News
Tap-News ZhekaiJin Python

Real Time News Scraping and Recommendation System

13
jjwxc-crawler
jjwxc-crawler dev-chenxing Python

基于Scrapy开发的晋江爬虫,根据书号下载小说非V章节,生成可编辑的Word文档 | A simple tool to scrape and download non-V chapters of any novel from jjwxc....

13
cms-downloader-refined
cms-downloader-refined aboueleyes Python

a tiny script that automates downloading files from guc cms website

13
price_compare_crawler
price_compare_crawler pritpalxyz Python

Product price comparison scrapy crawler

13
rust-scraping
rust-scraping gregstoll Rust

Examples of web scraping in Rust

13
dynamic-rendering-ultra-generation__vite-vue
dynamic-rendering-ultra-generation__vite-vue anhchangvt1994 TypeScript

Advanced SEO for Vite + VueJS Project

13
pro-penetration
pro-penetration fulldecent PHP

Penetration research against NSFW websites

13