Topic

scraping

Repositories (1626)

torrent-tracker-scraper
torrent-tracker-scraper project-mk-ultra Python

A UDP torrent tracker scraper library written in Python 3

54
diffbot-php-client
diffbot-php-client Swader PHP

[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library

53
garlic
garlic velocitatem JavaScript

๐Ÿง„๐Ÿง› protect your website from being scraped by bots.

53
Ecole-Directe-Plus
Ecole-Directe-Plus Magic-Fishes JavaScript

A better EcoleDirecte (unaffiliated): more pleasant, functional, and improved experience.

53
trex
trex tracking-exposed HTML

youtube & tiktok analysis + youchoose recommendation custmizer. backend, extensions, and tooling

53
dart-scraper
dart-scraper josw123 Vue

ํ•œ๊ตญ ๊ธˆ์œต๊ฐ๋…์›์—์„œ ์šด์˜ํ•˜๋Š” ๋‹คํŠธ(Dart) ์‹œ์Šคํ…œ์„ ์ด์šฉํ•œ ๊ธฐ์—… ์žฌ๋ฌด์ œํ‘œ ์ถ”์ถœ ํ”„๋กœ๊ทธ๋žจ

52
hext
hext html-extract C++

Domain-specific language for extracting structured data from HTML documents

52
scraping-reviews-from-googlemaps
scraping-reviews-from-googlemaps MajideND Python

This is a simple script with python to scrap Google Maps reviews and ratings.

52
tiktok-trending-data-api
tiktok-trending-data-api ogohogo JavaScript

Scraping the TikTok Discovery Data API every 1 hour using Github Actions to view changes

52
scrapers
scrapers montoyamoraga Python

scrapers for building your own image databases

51
greenlight
greenlight bosniankicks Go

A Golang based Undetected Web Automation Framework

51
CaseHarvester
CaseHarvester dismantl Python

AWS-based application for scraping the Maryland Judiciary Case Search

51
freenom-auto-renew-domains
freenom-auto-renew-domains Sorok-Dva TypeScript

A scraper built with puppeteer that auto renew free domains on Freenom and send discord message using bot

51
aniyoi-api
aniyoi-api miukyo TypeScript

REST API Anime Subtitle Indonesia | Streaming Anime Sub Indo

51
imslp
imslp jlumbroso Python

๐ŸŽผ The clean and modern way of accessing IMSLP data and scores programmatically. ๐ŸŽถ

50
thecrowler
thecrowler pzaino Go

A Content Discovery and Development Platform. Empowering Cybersecurity, AI, Marketing, and Finance professionals and researchers to discover, analyze,...

49
instagram-without-api
instagram-without-api orsifrancesco PHP

A simple PHP code to get unlimited instagram public pictures by every user without api, without credentials.

49
configs
configs Diggernaut

Public, free to use, repository with diggers configs for scraping / extracting data from various e-commerce websites and online stores

48
local-api-client-python
local-api-client-python kameleo-io Python

Official Python library for interacting with Kameleo Client

48
socials
socials lorey Python

๐Ÿ‘จโ€๐Ÿ‘ฉโ€๐Ÿ‘ฆ Social account detection and extraction in Python, e.g. for crawling/scraping.

47
beautifulsoup-tutorial
beautifulsoup-tutorial hackersandslackers Python

:sparkles: :ramen: Scrape webpage metadata using BeautifulSoup.

47
News_Summary
News_Summary sunnysai12345 Jupyter Notebook

Dataset and scripts for scraping the news articles from popular sources along with the summary of the article.

47
AngleParse
AngleParse kamome283 C#

HTML parsing and processing tool for PowerShell.

47
wajik-anime-api
wajik-anime-api wajik45 TypeScript

REST API streaming dan download Anime subtitle Indonesia | sub Indo

47
Scraping-Dynamic-JavaScript-Ajax-Websites-With-BeautifulSoup
Scraping-Dynamic-JavaScript-Ajax-Websites-With-BeautifulSoup oxylabs Python

A guide on how to scrape JavaScript rendered websites with Python and BeautifulSoup.

47
scrape-google-python
scrape-google-python oxylabs

In this tutorial, we showcase how to scrape public Google data with Python and Oxylabs API.

47
scaling-to-distributed-crawling
scaling-to-distributed-crawling ZenRows HTML

Repository for the Mastering Web Scraping in Python: Scaling to Distributed Crawling blogpost with the final code.

46
CraigslistScraper
CraigslistScraper ryanirl Python

Simple webscraper for Craigslist.

46
react-node-web-scraper
react-node-web-scraper codegratia JavaScript

Final Year project, scraping data of e-commerce stores and display in ReactJS app.

46
jason-the-miner
jason-the-miner mawrkus JavaScript

โ› A versatile Web scraper for Node.js

45
image-collector
image-collector x-sk217 Python

Download images from Google Image Search

45
Outlook-account-creator
Outlook-account-creator Skuxblan Python

Python tool that automatically create outlook account with auto-captcha

45
local-api-client-typescript
local-api-client-typescript kameleo-io TypeScript

Official JavaScript/TypeScript library for interacting with Kameleo Client

45
oversmash
oversmash filp TypeScript

Overwatch API library for player details and career stats

44
scrapegraph-sdk
scrapegraph-sdk ScrapeGraphAI Jupyter Notebook

๐Ÿ•ท๏ธ Official Scrapegraph API SDK: Effortlessly extract content from any website. AI-powered. ๐Ÿค– Hassle-free web scraping made simple.

44
go-ps4
go-ps4 lucasepe Go

Search your favorite PS4 games from Playstation Store using the Command Line

44
RARBG-scraper
RARBG-scraper evyatarmeged Python

With Selenium headless browsing and CAPTCHA solving

44
bluebird
bluebird labteral Python

Unofficial Python client for Twitter

44
xdsl-exporter
xdsl-exporter Dentrax Go

xDSL Prometheus Exporter

44
sniffagrammers
sniffagrammers orsifrancesco JavaScript

Node.js and PHP files to automatically downloading pictures from instagram by https://orsi.me/sniffagram

44
getter
getter kastaid Python

Get and put users (scraping) to the target group/channel efficiently, correctly and safety.

44
python
python joaopauloaramuni Python

Repo Python

44
jimov_api
jimov_api koikiss-dev TypeScript

This project is an open-source API for retrieving multimedia content such as anime, movies and series, news, and manga in both Spanish and English.

44
firecrawl-quickstarts
firecrawl-quickstarts alexfazio Jupyter Notebook

A collection of cookbooks to help developers get started quickly with the Firecrawl API.

43
activesoup
activesoup jelford Python

A headless pure-python browser for the web

43
scrape-github-trending
scrape-github-trending transitive-bullshit JavaScript

Tutorial for web scraping / crawling with Node.js.

43
torchestrator
torchestrator lspahija Kotlin

Spin up Tor containers and then proxy HTTP requests via these Tor instances

43
how-to-scrape-google-trends
how-to-scrape-google-trends oxylabs Python

Learn step-by-step how to scrape Google Trends data and make a result comparison using Python and Oxylabs SERP API. Extract keywords, their popularity...

43
info-bot
info-bot irevenko Python

๐Ÿค– A Versatile Telegram Bot

43
myanimelist-data-set-creator
myanimelist-data-set-creator debakarr Python

Collection of some simple python scripts to create https://myanimelist.net/ anime and user data set.

42