Web data extraction tool implemented as chrome extension
Facebook Group Members Extractor. Download Facebook group members in CSV.
🧰 A collection of automation tools for Instagram 📱| Written in Python 🐍 | Don't forget to ⭐ the repo !
Face recognition system using MTCNN, FACENET, SVM and FAST API to track participants of Big Brother Brasil in real time.
A playwright bot which is implemented to scrape linkedin and store advertisement data in a database and telegram channel
Jsoup Annotations POJO
A python utility for downloading Common Crawl data
Anime-dl is a command-line program to download anime from CrunchyRoll and Funimation.
Universal scraping tool, which allows you to extract data using multiple environments
Image Dataset Tool (idt) is a cli tool designed to make the otherwise repetitive and slow task of creating image datasets into a fast and intuitive pr...
Free Palestine. 📖 This tool is to download course from educative.io for offline usage. It uses your login credentials and download the course.
Node.js package to bypass CloudFlare's anti-bot JavaScript challenges
Transistor, a Python web scraping framework for intelligent use cases.
Simple scripts for Level UP your scraping Skills, and source code for Level UP playlist on Youtube
Grawler is a tool written in PHP which comes with a web interface that automates the task of using google dorks, scrapes the results, and stores them...
Experience for effectively fetching Facebook data by Querying Graph API with Account-based Token and Operating undetectable scraping Bots to extract C...
Perform Google Dork search with Dorkify
Linkedin Learning videos downloader
📚 Python tool to download book summaries and audio from Blinkist.com, and generate some pretty output
simple multi-level scraper json input/output for Cheerio
📜 Framework-agnostic API scraper to load items from any paginated JSON API into a Laravel lazy collection via async HTTP requests.
Spotify scraper
estela, an elastic web scraping cluster 🕸
The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de...
Your will to enroll in Udemy course is here, but the money isn't? Search no more! This python program searches for your desired course in more than [i...
Run Selenium with Python via Github Actions using Headless or Non-Headless browsers!
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library des...
Get structured JSON data from any page.
This script allows you to automate the creation of Gmail accounts using the Selenium automation framework with the Chrome WebDriver. It navigates thro...
Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with gene...
:spider: Google client for SERPS
自学入门 Python 优质中文资源索引,包含 书籍 / 文档 / 视频,适用于 爬虫 / Web / 数据分析 / 机器学习 方向
Pick the most common user-agents on the Internet 👻
Python scraper for Language Pods such as Japanesepod101.com :japanese_ogre: :japan: :sushi: Compatible with Japanese, Chinese, French, German, Italian...
Code relating to scraping public police data.
Extract data or evaluate value from HTML/XML documents using XPath
Generate dispersable airdrops from Twitter threads.
Powerful Telegram bot for web scraping and crawling. Fast, easy, and loved by thousands!
Scrape Algorithm Questions from leetcode and generate html and epub file
⬛⬜⬛ Command line tool to scrape crosswords from online solvers and save them as .puz files ⬛⬜⬛
API to get enormous amount of high resolution satellite images from satellites.pro quickly through multi-threading! create map your own map dataset. B...
Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extraction easy.
The Scripting Engine that Combines Speed, Safety, and Simplicity
Hear local historical markers as you travel on your road-trip. 100% Shared Compose UI, Kotlin native cross-platform codebase. Includes Cocoapods, Goog...
Sasori is a dynamic web crawler powered by Puppeteer, designed for lightning-fast endpoint discovery.
Simple Query Scraping with CSS and Go Reflection (MOVED to Gitlab)
Learn how to send POST requests with cURL.
Distributed crawler, database and web frontend for public directories indexing
The Kemono and Coomer Downloader simplifies downloading posts from Kemono and Coomer websites, allowing users to download individual or multiple posts...
Use the MapReduce's Java interface to distributed crawle the data of Chinese universities and learn basic knowledge of hdfs.