Topic

scraping

Repositories (1766)

flutter_notification_listener
flutter_notification_listener jiusanzhou Kotlin

Flutter plugin to listen for and interact with all incoming notifications for Android. 一个监听手机通知的插件。

47
UpworkScraper
UpworkScraper roperi Python

UpworkScraper allows you to scrape your best matches job postings from Upwork.

47
github-trending-cli
github-trending-cli psalias2006 Python

A simple CLI tool to browse GitHub's trending repositories from your terminal.

46
jimov_api
jimov_api koikiss-dev TypeScript

This project is an open-source API for retrieving multimedia content such as anime, movies and series, news, and manga in both Spanish and English.

46
info-bot
info-bot irevenko Python

🤖 A Versatile Telegram Bot

46
scaling-to-distributed-crawling
scaling-to-distributed-crawling ZenRows HTML

Repository for the Mastering Web Scraping in Python: Scaling to Distributed Crawling blogpost with the final code.

46
oversmash
oversmash filp TypeScript

Overwatch API library for player details and career stats

45
jason-the-miner
jason-the-miner mawrkus JavaScript

⛏ A versatile Web scraper for Node.js

45
go-ps4
go-ps4 lucasepe Go

Search your favorite PS4 games from Playstation Store using the Command Line

45
image-collector
image-collector x-sk217 Python

Download images from Google Image Search

45
sniffagrammers
sniffagrammers orsifrancesco JavaScript

Node.js and PHP files to automatically downloading pictures from instagram by https://orsi.me/sniffagram

45
local-api-client-typescript
local-api-client-typescript kameleo-io TypeScript

Official JavaScript/TypeScript library for interacting with Kameleo Client

45
async-pubmed-scraper
async-pubmed-scraper IliaZenkov Python

PubMed scraper for async search on a list of keywords and concurrent extraction of all found URLs, returning a DataFrame/CSV containing all article da...

45
bluebird
bluebird labteral Python

Unofficial Python client for Twitter

44
torchestrator
torchestrator lspahija Kotlin

Spin up Tor containers and then proxy HTTP requests via these Tor instances

44
Extracty
Extracty Mamdouh66 Python

Extract structured data from any unstructured web page

44
xdsl-exporter
xdsl-exporter Dentrax Go

xDSL Prometheus Exporter

44
fake-http-header
fake-http-header MichaelTatarski Python

A python package to generate random request fields for a http header.

44
Rotating-Proxies-With-Python
Rotating-Proxies-With-Python oxylabs Python

Learn about how to rotate proxies by using Python.

44
activesoup
activesoup jelford Python

A headless pure-python browser for the web

43
scrape-github-trending
scrape-github-trending transitive-bullshit JavaScript

Tutorial for web scraping / crawling with Node.js.

43
RARBG-scraper
RARBG-scraper evyatarmeged Python

With Selenium headless browsing and CAPTCHA solving

43
TikDown
TikDown xtekky Python

Fast TikTok NO Watermark Video Downloader (username or url)

43
scrapingant-client-python
scrapingant-client-python ScrapingAnt Python

ScrapingAnt API client for Python.

43
laravel-scrapingbee
laravel-scrapingbee ziming PHP

PHP Laravel Library for Scrapingbee Web Scraping API. AI querying supported. Also support Google, Walmart, Amazon, YouTube scraping

43
python-vistopia
python-vistopia chazeon Python

看理想 Python 客户端 / 下载器,下载看理想的音频和文稿

43
webmagician-ui
webmagician-ui Jkanon TypeScript

An admin UI project for a configurable web crawler platform

42
Architeuthis
Architeuthis simon987 Go

MITM HTTP(S) proxy with integrated load-balancing, rate-limiting and error handling. Built for automated web scraping.

42
shup
shup pystardust Shell

A POSIX shell script to parse HTML

42
TorScrapper
TorScrapper little-endian-0x01 Python

A Scraper made 100% in Python using BeautifulSoup and Tor. It can be used to scrape both normal and onion links. Happy Scraping :)

42
permaculture
permaculture jwnigel Python

Permaculture design app built on scraped plant databases. Drag-n-drop GUI with detailed design plan generator.

42
noscrape
noscrape schoenbergerb TypeScript

This repository is deprecated

42
goGetJS
goGetJS davemolk Go

a tool for extracting, searching, and saving JavaScript files (with optional headless browser)

42
chew
chew mmatongo Go

Chew is a Go library for processing various content types into markdown/plaintext.

42
emec-api
emec-api pavanad Python

API Python para consulta na base de dados oficial do e-MEC

42
gwaripper
gwaripper nilfoer Python

Tool for conveniently downloading audios from r/gonewildaudio and similar subreddits

42
movie-posters-convnet
movie-posters-convnet adrz Python

Unsupervised clustering of movie posters with features extracted from Convolutional Neural Network

41
myanimelist-data-set-creator
myanimelist-data-set-creator debakarr Python

Collection of some simple python scripts to create https://myanimelist.net/ anime and user data set.

41
html-table-to-json
html-table-to-json brandon93s JavaScript

Generate JSON representations of HTML tables

41
lc-webscraping
lc-webscraping carpentries-incubator Python

Introduction to web scraping

41
linux-monster
linux-monster harkerbyte Python

Ethical Facebook and Gmail bruteforce... Bonus, proxy rotation for every attack based on user preference setting and also a web scrapping tool

41
linkeBot
linkeBot fabiodeandrade HTML

🔎 um bot de Web Scraping para mostrar vagas do LinkedIn

41
njsparser
njsparser novitae HTML

🦩 A NextJS data parser, to scrape peacefully

41
dom-content-extraction
dom-content-extraction oiwn Rust

DOM Based Content Extraction via Text Density

41
nhasixapp
nhasixapp shirokun20 Dart

Unofficial NHentai mobile app with flutter and bloc

40
raiplay-dl
raiplay-dl wetcork Python

The most advanced raiplay.it downloader

40
scrapy-zyte-api
scrapy-zyte-api scrapy-plugins Python

Zyte API integration for Scrapy

40
hyper-sdk-playwright
hyper-sdk-playwright Hyper-Solutions TypeScript

Hyper Solutions SDK for Playwright - Bypass Akamai Bot Manager, Incapsula, Datadome and Kasada.

40
webtranspose
webtranspose mike-gee Python

Web scraping API for building AI applications.

40
OSINTLAB
OSINTLAB Purpl3-Dev Shell

This script automates the installation of 50 OSINT tools for reconnaissance and information gathering.

39