A high effective golang library for parsing big-sized sitemaps and avoiding high memory usage. The sitemap parser was written on golang without extern...
Python for scraping and processing tennis match data from the ATP Tour website.
CobWeb is a Python library for web scraping. The library consists of two classes: Spider and Scraper.
Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Cachin...
Cleaning tool for web scraped text
Enhanced LinkedIn Job Search Chrome Extension
TV Series is a tool that scrapes Episode Synopsis' of popular TV Series' from websites like Wikipedia / IMDb and show in one place with a user-friendl...
Proxy-like server that will show you the DOM of a page after JS runs
Extract social media links and account names from websites.
Production-ready web scraping in a single function call. Built on Crawlee.
Scraps all the open chats, and their last n messages, and saves them in a csv file
📊 Python tool to scrape real-time information about ETFs from the web and mixing them together by proportionally distributing their assets allocation
Cloudflare Turnstile solver & bypass — Python, real Chrome browser, no paid APIs. Local HTTP API service included. Auto-solves invisible and managed (...
A drop-in replacement for puppeteer patched with rebrowser-patches. It allows to pass modern automation detection tests.
Generic REST API for scraping websites. Drop-in replacement for ScrapingBee, ScrapingAnt, and ScraperAPI services. And it is open-source!
🔎Search LinkedIn profile by email address📧
Instagram & TikTok automation via real Android devices. Likes, follows, DMs, scraping. No API abuse. Built with Python, uiautomator2 & ADB.
Scrapy extension that gives you all the scraping monitoring, alerting, scheduling, and data validation you will need straight out of the box.
:soccer: Free API with results from national soccer competitions
This class can retrieve search results from Google.
A Python based web scraping api built with fastapi to get manga contents.
Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the...
Sneakpeek is a framework that helps to quickly and conviniently develop scrapers. It’s the best choice for scrapers that have some specific complex sc...
Twitter bot powering @arichduvet
Collection of scraping recipes to get metadata about what is being streamed on webradios
Simple and friendly Bot for Instagram, using Selenium and Scrapy with Python.
Node library for scraping manga sites
The fast, most optimal, and correct HTML & XML parsing library for Python written in Rust.
Scrape the hotel reviews of a whole city on TripAdvisor
Threat hunting tool for scraping latest scrapes from Pastebin
A short introduction to scraping with Python with given steps and an example scraper script.
Zoominfo scraper with using of rotating proxies and headless Chrome from ScrapingAnt
A collection of tools to (semi-)automatically collect and analyze data from online discussions on Facebook groups and pages.
Main API Flight Git Repository
Image scraping library for creating deep learning datasets
Buy limited edition sneakers
Go library for scraping or downloading files bypassing Cloudflare protection and browser checks
✉️ Use the power of browser-use to contact any person or organization... by any means necessary
Solve the Geetest slider captcha with Puppeteer
Provides a virtual web browser (a.k.a. "headless browser") appearing as a node.
Self hosted AI workflow for scraping Instagram Reels (audio and description). Extracting, summarising and categorising, then storing all relevant info...
Build web scraping agents using AI to auto-extract the data from websites, capture screenshot, generate pdf from URL and web crawling with Agenty
A drop-in replacement for puppeteer-core patched with rebrowser-patches. It allows to pass modern automation detection tests.
🚀 OFFICIAL STARTER TEMPLATE FOR BOTASAURUS SCRAPING FRAMEWORK 🤖
MultiThreaded Application to Scrape Working Web Proxies
A headless browser task/job queue & runner based on Hero (Chrome)
An unofficial Python API wrapper for firstcycling.com
🎙️ TED Talks web scraper
A python package with client to scrape the israeli supermarkets data
Web Scraping Framework