Get clean data from tricky documents, powered by vision-language models ⚡
Linkedin Automation Tool: Describe your product. Define your target market. The AI finds the leads for you.
A Modern Search Engine API for Anime, Movies/TVShows, Books, Light Novels, Manga, etc.
Scrape tweets, profiles, followers and following from Twitter/X, no API key needed. Python library with smart multi-account pooling, proxy support and...
Example end to end data engineering project.
🤖 Scrape data from HTML websites automatically by just providing examples
📰 Diários oficiais brasileiros acessíveis a todos | 📰 Brazilian government gazettes, accessible to everyone.
Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy...
AgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright integrations for interacting with elements an...
An ergonomic Python HTTP Client with TLS fingerprint
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
Lightweight library for scraping web-sites with LLMs
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
Open Source Bulk Auto Gmail Creator Bot with Selenium & Seleniumwire ( Python ). Feel free to contact me with Django/Flask, ML, AI, GPT, Automation, S...
In this tutorial, we showcase how to scrape public Google data with Python and Oxylabs API.
HTTP(S)/SOCKS5 rotating residential proxies - code examples & general information.
Creating Scrapy scrapers via the Django admin interface
Watch everything from your terminal.
Tools for various online judges. Downloading sample cases, generating additional test cases, testing your code, and submitting it.
artoo.js - the client-side scraping companion.
📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
:rocket: An open source alternative to searx which provides a modern-looking :sparkles:, lightning-fast :zap:, privacy respecting :disguised_face:, se...
Crawly, a high-level web crawling & scraping framework for Elixir.
Automatically archive links to videos, images, and social media content from Google Sheets (and more).
🎭 Intelligent browser header & fingerprint generator
Your browser anime experience from the terminal
A completely revamped and redesigned fork, reimagined from scratch based on the original onlyfans-scraper
🧹 Python package for text cleaning
🚀 Web scraping for humans
modular service framework to move and transform network packets
Scalable Python web scraping scripts for +40 popular domains
Scrape the Instagram frontend. Inspired from twitter-scraper by @kennethreitz.
Generate Free Edu Mail(s) within minutes
📄 Python tool to turn Notion.so pages into lightweight, customizable static websites
A CLI toolset to generate table of contents for PDF files automatically.
Simple but useful Python web scraping tutorial code.
DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with minimal code...
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of...
[Unmaintained] A simple and clean video/music/image downloader 👾
:scissors: High performance, multi-threaded image scraper
🥫 The simple, fast, and modern web scraping library
Lookyloo is a web interface that allows users to capture a website page and then display a tree of domains that call each other.
🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON.
😈📚 A curated library of research papers and presentations for counter-detection and web privacy enthusiasts.
Google Search Results via SERP API pip Python Package
The web scraper that's nearly impossible to block - now called @ulixee/hero
Extract structured data from web sites. Web sites scraping.
python script for Google Dorking
Python package for scraping real estate property data
Comic-dl is a command line tool to download manga and comics from various comic and manga sites. Supported sites : readcomiconline.to, mangafox.me, co...