Topic

scraping

Repositories (1766)

autopahe
autopahe haxsysgit Python

Automatically download all your favorite anime from AnimePahe

23
autoscout24_scraping
autoscout24_scraping lorenzoelia Python

Python-based web scraping and data analysis tool designed to collect vehicle listings from the Autoscout24 website.

23
yahoo-auction-alert-discord-bot
yahoo-auction-alert-discord-bot vlourme Python

Get a Discord alert when a new item is posted on Yahoo Auction or Mercari

23
BrowserProxy
BrowserProxy aioke JavaScript

Allows to bypass Cloudflare checks

23
code4rena-scraper
code4rena-scraper 0237h HTML

Scraping Code4rena contest audits reports for stats, fun (and profit ?)

23
PrawWallpaperDownloader
PrawWallpaperDownloader nikolajlauridsen Python

Download images from reddit

22
EinsteinBot
EinsteinBot DouglasTaylorSupportGroup Python

🤖 A Discord bot that allows you to access solutions to homework problems from Chegg.

22
agentql-integrations
agentql-integrations tinyfish-io Jupyter Notebook

AgentQL's integrations with workflow automation tools and AI agent frameworks let you extract structured data from web pages using queries or natural...

22
deepstate-map-data
deepstate-map-data cyterat Jupyter Notebook

DeepState Map | Occupied | GeoJSON Multipolygon | Daily update

22
document-dl
document-dl heeplr Python

Command line program to download documents from web portals

22
price_tracker
price_tracker lucafluri Dart

Price Tracker app for Android and iOS. Built with Flutter

22
qddate
qddate ivbeg Python

Quick and dirty date parsing Python library to parse HTML dates really fast

22
exoskeleton
exoskeleton RuedigerVoigt Python

A Python framework to build polite, but tenacious crawlers / scrapers with a MariaDB backend

22
scrapman
scrapman danielnieto JavaScript

Retrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs

22
puppeteer-table-parser
puppeteer-table-parser Tomas2D TypeScript

Scrape and parse HTML tables with the Puppeteer table parser.

22
imdb-api
imdb-api Scrip7 Go

[🚧 WIP] Cross-platform microservice to scrape the IMDb website.

22
google-images-scraper
google-images-scraper pruthvik-sheth Python

Google images scrapper in full resolution with multi threading speed.

22
eCommerce-Scraping-API
eCommerce-Scraping-API Smartproxy PHP

eCommerce Scraping API code examples for Python, PHP and Node.js

22
mlscraper-rust
mlscraper-rust hilbigan HTML

Scrape structured data from HTML documents automatically

22
product-integrations
product-integrations oxylabs PHP

Code examples and general information

22
github-languages
github-languages alex-benoit Ruby

Tiny little ruby on rails website that crawls though your public github repos to find out what your favourite languages are.

22
Champ
Champ umangahuja1 Python

A Telegram bot combined with python to serve some basic functions like weather, music charts, cricket score and much more.

22
Justdial-Scraper
Justdial-Scraper yatin94 Python

JustDial Scraper to scrap all the requested data which includes their name, address, email address and phone number.

21
memes-api
memes-api pr0gramista Python

API for scrapping common 🇵🇱 meme sites

21
moviestills
moviestills kinoute Go

A small CLI app to scrap high-quality movie snapshots from various websites.

21
CMScrape
CMScrape DrankRock Python

A simple python CardMarket price scraper.

21
DeCryPt
DeCryPt WTSTiNy Python

Scrapes: [HTTP, HTTPS, SOCKS4, and SOCKS5] || "Bypasses" and scrapes paid proxies

21
Fuelprices_DK
Fuelprices_DK J-Lindvig Python

Scraping of 5 types of fuel :fuelpump: from 8 different fuelcompanies in Denmark :denmark:.

21
instagram-scraping-fish
instagram-scraping-fish mateuszbuda Jupyter Notebook

A tutorial for scraping Instagram profile information and posts using Scraping Fish API: https://scrapingfish.com

21
dzone-refcardz-downloader
dzone-refcardz-downloader luckylittle Go

Downloads all refcardz from https://dzone.com/refcardz

21
Bootleg_Macro
Bootleg_Macro HelloThereMatey Jupyter Notebook

A simple tool-kit written in python for sourcing and displaying macroeconomic and financial data.

21
ensembledata-python
ensembledata-python EnsembleData Python

Python library to scrape social media data via the EnsembleData API.

21
72m-domains-dataset
72m-domains-dataset digitalcortex

Dataset with unique registered domains extracted from Common Crawl's columnar index (cc-index).

21
py
py supadata-ai Python

Official Python SDK for the Supadata API.

21
ig-dm-reels-autodownload
ig-dm-reels-autodownload kelvinthh Python

A Python script that auto downloads all reels sent to your Instagram DM

21
cdp-proxy-interceptor
cdp-proxy-interceptor zackiles TypeScript

Transparent man-in-the-middle (MitM) proxy for the Chrome DevTools Protocol (CDP). Intercept, modify, inject, and filter messages and events between a...

21
Article-Web-Scraping
Article-Web-Scraping KalyanM45 Jupyter Notebook

This Python script is designed to scrape articles from The Guardian's technology section using their API. It fetches article data, extracts the titles...

21
proxycrawl-php
proxycrawl-php crawlbase PHP

ProxyCrawl PHP library for scraping and crawling websites

21
jwscraper
jwscraper morpheusthewhite Python

A python library for scraping videos from JW Player

21
bard-unofficial-api
bard-unofficial-api AdamSEY JavaScript

Google's Bard ChatBot Unofficial NodeJS API

21
Justdail-scrapper
Justdail-scrapper harsh4870 Python

A 100% working Justdial scrapper, Just enter the url and it'll extract business info from it

21
Elektra-Auto-Checkout
Elektra-Auto-Checkout Johnw7789 Go

Toolkit to assist in stock checking and checkout automation of various retail sites.

21
node-github-trend
node-github-trend rhysd TypeScript

node.js library for scraping GitHub trending repositories.

21
fb-github-trends
fb-github-trends bumbeishvili JavaScript

Automatically post Github trends on Facebook page

21
crawling-framework
crawling-framework tokenmill Java

Easily crawl news portals or blog sites using Storm Crawler.

21
searchenginepy
searchenginepy codewithnick Python

search engine for python (Query and scrape search engines)

21
web-ctf-help
web-ctf-help xnomas Python

Collection of scripts to help with web based ctfs.

21
darwin
darwin cgrimal JavaScript

Projet pour générer la page http://clementgrimal.fr/darwin/

21
xplore
xplore zTgx Rust

Xplore is a scraper for Twitter/X without using API in Rust.

20
Auto-Proxy-Fetcher
Auto-Proxy-Fetcher VolkanSah Python

Automatically fetch and update proxy lists from multiple sources every 6 hours using GitHub Actions

20