Most popular crawling repositories and open source projects

whiskey-web-scraper annaelizabeth2019 Python

My first web scraper! I used this program to get some whiskey info.

1 0 1

cali-api-youtube-search-lambda-layer team-myadvent Python

AWS Lambda service layer by Youtube data selenium crawling

1 0 1

URLer bambeero1 Python

Web crawler using Playwright. It extracts URLs from a given website and saves them in either JSON or TXT format. It includes options to skip crawling...

1 0 1

radiograph-diagnosis-quiz-crawler gihuncho Jupyter Notebook

Simple data crawler for some radiograph diagnosis quizzes

1 0 1

webcrawl ls-saurabh Python

Webcrawl is a Python web crawler that recursively follows links from a starting URL to extract and print unique HTTP links. Using 'requests and 'Beaut...

1 0 1

LLM-Data-Pipeline simidzija Python

Complete pipeline for obtaining LLM training data at scale

1 0 1

crawlquest leewr9 Python

Smart crawling request utility for Python.

1 0 1

NewsCrawler BelisAliosmanova Java

News crawler

1 0 1

Retrieve_Market Mahdi-mghs Jupyter Notebook

Retrieve an old market

1 0 1

free-games-alerts alejandrov44 TypeScript

🔔 Quick and easy way to get notified from all kind of new free games available from different platforms to claim.

1 0 1

general_email_crawler saycc1982 Python

powerful scripts allow you search all email address under you desired URL with different options, 功能强大的网站邮箱爬虫

1 0 1

browse-anything-quickstart mehdi149 Python

🤖 Ready-to-run Python examples for Browse Anything API. Automate web scraping, price monitoring, multi-step workflows, QA testing, and lead generatio...

1 0 1

employee-api OzoneAnim JavaScript

🏢 Manage employee data efficiently with this RESTful API featuring full CRUD operations using Node.js, Express.js, and Azure SQL Database.

1 0 1

scrapfly-scrapers-scrappey-wrapper pim97 Python

A wrapper for the more cheaper alternative - scrappey - for scraping 40+ sites

1 0 1

GoSpider aryanranderiya Go

A high-performance, concurrent web crawler in Go that extracts URLs, downloads content, and converts tens of thousands of web pages to Markdown in min...

1 1 1

colly rossriserose Go

Elegant Scraper and Crawler Framework for Golang

1 0 1

cv-spider-v5-console-final orassayag C#

A .NET console application that searches multiple search engines for email addresses, validates them, and stores them in a SQL Server database. Built...

1 0 1

firecrawl capt-marbles Python

Web scraping and crawling with Firecrawl API - markdown conversion, screenshots, structured data extraction

1 0 1

sageo-cli Coastal-Programs Go

Open-source SEO CLI — crawl, audit, SERP analysis, backlinks, and keyword research from the command line

1 4 1

spa-crawler hu553in Python

A CLI-friendly crawler that can optionally authenticate, crawl a website, and mirror pages and static assets into a local directory so the result can...

1 0 1

mlops-classification arman-aminian Jupyter Notebook

1 0 1

GitHub-Release-Searcher benni-ben HTML

A GitHub release searcher that searches for repositories with certain file types in the releases. Made in HTML and JS.

1 0 1

kurmanjiscraping cikay Python

Scrape Kurdish Kurmanji pages

1 0 1

raysearch radiata-labs Python

An open-source meta-search engine for AI.

1 0 1

WebProbe KaiavN Rust

An open source, rust-based tool for local load tests and checking all interactive elements, to make sure that a user won't encounter an issue

1 0 1

GolDigger ygp4ph Go

Un crawler web récursif, rapide et efficace

1 0 1

bt-dht J4GL Python

A bittorrent dht scraper

1 0 1

Crawlify lordpaoloo TypeScript

Crawlify is an efficient web scraping tool designed to help developers, researchers, and businesses extract, analyze, and automate data collection fro...

1 0 1

EMail-Miner-Pro khdxsohee JavaScript

EMail Miner Pro is designed specifically for professionals scraping data from search engines like Google, ensuring that generic emails (e.g., Gmail, Y...

1 0 1

arbeitsagentur-germany-job-details-scraper Redbalistic

🔍 Extract job details from Germany’s employment portal and convert them into structured datasets for efficient analysis of the job market.

1 0 1

AI-CONTROL tentaclequing

Multi-Level Approach to Managing AI Crawler Behaviour and Content Protection for the IAB Workshop on AI-CONTROL 2024/25

1 0 1

silkworm RustedBytes Rust

Async-first web scraping framework

1 0 1

dot-net-spiders orassayag C#

A collection of ASP.NET web crawlers for extracting email addresses from online sources including job sites, APIs, public pages. Built in 2012–2016, t...

1 0 1

Extracto nishal21 Python

AI-powered web scraper. Give it a URL and tell it what data you want — it handles the rest.

1 0 1

Web-Scraping-Crash-Course M-Taghizadeh Python

A complete 2-hour training on Web Scraping with Python, featuring practical projects using Camoufox, Playwright, and Scrapy. Includes real-world crawl...

1 0 1

Scrapling Fabianusromarioyeuyanan123 Python

Simplify web scraping by extracting data from modern websites with an easy-to-use Python library designed for efficiency and clarity.

1 0 1

starwars-intro-css3 firestar300 HTML

Original trilogy, prelogy and postlogy introductions of Star Wars in CSS3

1 1 1

web-vuln-scanner AniketBansod Python

Lightweight Python web scanner with BFS crawling, form analysis, multi-threaded requests, and automated tests for XSS, SQLi, and missing security head...

1 0 1