Topic

crawling

Repositories (1350)

whiskey-web-scraper
whiskey-web-scraper annaelizabeth2019 Python

My first web scraper! I used this program to get some whiskey info.

1
cali-api-youtube-search-lambda-layer
cali-api-youtube-search-lambda-layer team-myadvent Python

AWS Lambda service layer by Youtube data selenium crawling

1
URLer
URLer bambeero1 Python

Web crawler using Playwright. It extracts URLs from a given website and saves them in either JSON or TXT format. It includes options to skip crawling...

1
radiograph-diagnosis-quiz-crawler
radiograph-diagnosis-quiz-crawler gihuncho Jupyter Notebook

Simple data crawler for some radiograph diagnosis quizzes

1
webcrawl
webcrawl ls-saurabh Python

Webcrawl is a Python web crawler that recursively follows links from a starting URL to extract and print unique HTTP links. Using 'requests and 'Beaut...

1
LLM-Data-Pipeline
LLM-Data-Pipeline simidzija Python

Complete pipeline for obtaining LLM training data at scale

1
crawlquest
crawlquest leewr9 Python

Smart crawling request utility for Python.

1
NewsCrawler
NewsCrawler BelisAliosmanova Java

News crawler

1
Retrieve_Market
Retrieve_Market Mahdi-mghs Jupyter Notebook

Retrieve an old market

1
free-games-alerts
free-games-alerts alejandrov44 TypeScript

🔔 Quick and easy way to get notified from all kind of new free games available from different platforms to claim.

1
general_email_crawler
general_email_crawler saycc1982 Python

powerful scripts allow you search all email address under you desired URL with different options, 功能强大的网站邮箱爬虫

1
browse-anything-quickstart
browse-anything-quickstart mehdi149 Python

🤖 Ready-to-run Python examples for Browse Anything API. Automate web scraping, price monitoring, multi-step workflows, QA testing, and lead generatio...

1
employee-api
employee-api OzoneAnim JavaScript

🏢 Manage employee data efficiently with this RESTful API featuring full CRUD operations using Node.js, Express.js, and Azure SQL Database.

1
scrapfly-scrapers-scrappey-wrapper
scrapfly-scrapers-scrappey-wrapper pim97 Python

A wrapper for the more cheaper alternative - scrappey - for scraping 40+ sites

1
GoSpider
GoSpider aryanranderiya Go

A high-performance, concurrent web crawler in Go that extracts URLs, downloads content, and converts tens of thousands of web pages to Markdown in min...

1
colly
colly rossriserose Go

Elegant Scraper and Crawler Framework for Golang

1
cv-spider-v5-console-final
cv-spider-v5-console-final orassayag C#

A .NET console application that searches multiple search engines for email addresses, validates them, and stores them in a SQL Server database. Built...

1
firecrawl
firecrawl capt-marbles Python

Web scraping and crawling with Firecrawl API - markdown conversion, screenshots, structured data extraction

1
sageo-cli
sageo-cli Coastal-Programs Go

Open-source SEO CLI — crawl, audit, SERP analysis, backlinks, and keyword research from the command line

1
spa-crawler
spa-crawler hu553in Python

A CLI-friendly crawler that can optionally authenticate, crawl a website, and mirror pages and static assets into a local directory so the result can...

1
mlops-classification
mlops-classification arman-aminian Jupyter Notebook
1
GitHub-Release-Searcher
GitHub-Release-Searcher benni-ben HTML

A GitHub release searcher that searches for repositories with certain file types in the releases. Made in HTML and JS.

1
kurmanjiscraping
kurmanjiscraping cikay Python

Scrape Kurdish Kurmanji pages

1
raysearch
raysearch radiata-labs Python

An open-source meta-search engine for AI.

1
WebProbe
WebProbe KaiavN Rust

An open source, rust-based tool for local load tests and checking all interactive elements, to make sure that a user won't encounter an issue

1
GolDigger
GolDigger ygp4ph Go

Un crawler web récursif, rapide et efficace

1
bt-dht
bt-dht J4GL Python

A bittorrent dht scraper

1
Crawlify
Crawlify lordpaoloo TypeScript

Crawlify is an efficient web scraping tool designed to help developers, researchers, and businesses extract, analyze, and automate data collection fro...

1
EMail-Miner-Pro
EMail-Miner-Pro khdxsohee JavaScript

EMail Miner Pro is designed specifically for professionals scraping data from search engines like Google, ensuring that generic emails (e.g., Gmail, Y...

1
arbeitsagentur-germany-job-details-scraper
arbeitsagentur-germany-job-details-scraper Redbalistic

🔍 Extract job details from Germany’s employment portal and convert them into structured datasets for efficient analysis of the job market.

1
AI-CONTROL
AI-CONTROL tentaclequing

Multi-Level Approach to Managing AI Crawler Behaviour and Content Protection for the IAB Workshop on AI-CONTROL 2024/25

1
silkworm
silkworm RustedBytes Rust

Async-first web scraping framework

1
dot-net-spiders
dot-net-spiders orassayag C#

A collection of ASP.NET web crawlers for extracting email addresses from online sources including job sites, APIs, public pages. Built in 2012–2016, t...

1
Extracto
Extracto nishal21 Python

AI-powered web scraper. Give it a URL and tell it what data you want — it handles the rest.

1
Web-Scraping-Crash-Course
Web-Scraping-Crash-Course M-Taghizadeh Python

A complete 2-hour training on Web Scraping with Python, featuring practical projects using Camoufox, Playwright, and Scrapy. Includes real-world crawl...

1
Scrapling
Scrapling Fabianusromarioyeuyanan123 Python

Simplify web scraping by extracting data from modern websites with an easy-to-use Python library designed for efficiency and clarity.

1
starwars-intro-css3
starwars-intro-css3 firestar300 HTML

Original trilogy, prelogy and postlogy introductions of Star Wars in CSS3

1
web-vuln-scanner
web-vuln-scanner AniketBansod Python

Lightweight Python web scanner with BFS crawling, form analysis, multi-threaded requests, and automated tests for XSS, SQLi, and missing security head...

1
awesome-web-crawler
awesome-web-crawler NickG1978 HTML

🕷️ Discover and use popular web crawlers across various programming languages to efficiently extract data from the web.

1
easy-php-crawler
easy-php-crawler Lexxtor PHP

Simple yet flexible URL crawler.

0
pyproxyroulette
pyproxyroulette Tortuginator Python

Random Proxy Wrapper for Python Requests

0
textminingproject
textminingproject jjimini98 Jupyter Notebook

textmining project in Github

0
ecommerce-scrapy-crawler
ecommerce-scrapy-crawler Harwindersandhu Python
0
shopCrawler
shopCrawler omoniyi289 PHP

under development

0
clojure-crawling
clojure-crawling u4bi-sev Clojure

clojure-crawling

0
proxy
proxy kleicht Go

Very basic proxy

0
image_crawling
image_crawling hiMinju Python

Crawling many images in google

0
c4k
c4k oxgl Kotlin

Kotlin Web Crawler library

0
crawling-system
crawling-system SlowpokeStudio Java

crawling system

0
jf-crawler
jf-crawler shashwatx Python

Crawls job advertisements from a popular spanish site using bs4

0