Most popular crawling repositories and open source projects

minigun-requests umihico Python

Web scraping API to outsource tons of GET & xpath to cloud computing

7 0 7

awesome-scraping ScrapeRouter Python

The definitive list of the latest libraries, tools, APIs and providers for web scraping. The only daily-updated collection of web scraping resources.

7 2 7

dotlas_odyssey dotlas Jupyter Notebook

⛵️ A take-home assignment for the full-time Data Engineering position at Dotlas

7 2 7

DBpia_crawler chanhee-kang Python

국내 논문 서지정보 사이트 DBpia 크롤링 프로그램

7 3 7

Advanced-proxy-Scraper FuckingToasters

Advanced Proxy Scraper Crawler fetcher

7 1 7

clausea lvndry Python

Transform complex legal documents into clear, actionable insights with AI-powered analysis.

7 0 7

LicencePlateScraper Momotoculteur Python

Système automatique pour constituer un dataset de plaque d'immatriculation de voiture par scraping et crawling

7 3 7

jsonld-extract capturr TypeScript

A damn simple tool to extract json-ld metadata from webpage using jquery like api (jQuery, Cheerio, CashDom ...).

7 0 7

leetcode-summary-crawler HUANGXUANKUN Python

A leetcode crawler built with selenium and requests. Generate a revision guide for your coding interview

6 0 6

ScienceProject Kimdonghyeon7645 Python

🔭🌦 과학프로젝트, 날씨에 학교를 더하다 (with Django)

6 1 6

proxypool franklingu Python

A proxy poll: get free and high quality proxies

6 4 6

chrome-php helloiamlukas PHP

A PHP Wrapper for Chrome Headless. Get the DOM of any webpage.

6 8 6

crawling-scraping-scripts soccer-it JavaScript

Collection of brazilian soccer data crawling/scraping scripts.

6 0 6

Scrapy Decodo Python

Scrapy proxy authentication example for Decodo

6 0 6

scrap-superloto erseler Python

A web scrapping project to fetch all lottery winning numbers, date, prizes etc.

6 1 6

actual-deeplearning suites Jupyter Notebook

파이썬을 이용한 머신러닝, 딥러닝 실전개발 입문

6 14 6

craw-BadanPusatStatistik RomySaputraSihananda Python

craw-BadanPusatStatistik adalah program untuk mengambil data dari website Badan Pusat Statistik Indonesia.

6 1 6

langchain-advertools eliasdabbas Jupyter Notebook

LangChain integration for advertools

6 0 6

SLR-Tools maurice-schleussinger Python

Python scripts to perform a systematic literature review for Google Scholar and others

6 2 6

woocommerce-scraper vanquan805 PHP

The best scraping solution for WooCommerce

6 1 6

Booklify.me LillySchramm TypeScript

Booklify.me is an open-source platform for keeping track of everything in your bookshelf.

6 1 6

Web-Crawler 0MeMo07 Python

Web Crawler with Python

6 0 6

craw-Pinterest RomySaputraSihananda Python

melakukan web scraping dan mengambil gambar berdasarkan keyword pencarian pinterest.

6 1 6

Scrapy-Middleware Decodo Python

Scrapy Middleware for proxy authentication with Decodo

6 1 6

Slic sw-song Python

Single line image classifier

6 3 6

Puppeteer Decodo JavaScript

Puppeteer proxy authentication example for Decodo

6 3 6

namu-soup anteater333 JavaScript

숲Soup - 나무위키 인기 검색어 크롤러

6 0 6

GitHub_Crawling_TextMining_Project park1997 Jupyter Notebook

Data collection and processing for intelligent technology ecosystem analysis

6 2 6

quotes-crawler dori-dev Python

Quotes crawler using scrapy and python.

6 0 6

telegramBot_instaDP codenashwan PHP

A simple BOT Telegram to downloading Instagram profiles photo

6 0 6

Cloud_Player_V2 amirhoseinsb Python

You can use the cloudplayer tool to listen to the music of the singer you want without going to a specific website and at a very high speed.

6 0 6

bot-detector lula73 Shell

🚫 IP list to block bots, scrapers, AI crawlers & malicious traffic. +17.000 IP Updated every week.

6 0 6

imperium-crawl SadikinAraf TypeScript

Extract, crawl, and scrape web data efficiently with a powerful open-source CLI tool requiring no API keys and minimal setup.

6 0 6

Playwright Decodo JavaScript

Playwright proxy authentication & scraping example for Decodo

6 0 6

Crawling-Data-From-Tokopedia aqilwahid Jupyter Notebook

Repositori ini berisi proyek web scraping atau crawling data dari situs Tokopedia. Proyek ini bertujuan untuk mengumpulkan informasi produk seperti na...

6 2 6

knu-lms-scheduler HyeokjaeLee JavaScript

:mortar_board: 공주대학교 온라인 강의 시스템 편의성 향상 프로그램

6 1 6

crawl-agoda mandes95 Python

6 3 6

PlatformsCrawler eric2788 Go

多平台爬蟲 + 模塊化管理，用於搜集資料並經 redis pubsub 發送

6 2 6

everytime-timetable-crawling wwlee94 Python

에브리타임 수업 강좌 시간표 크롤링

5 2 5

node-crawling-framework JimmyLaurent JavaScript

✨ NodeJs crawling & scraping framework heavily inspired by Scrapy

5 1 5

bm25-ranking-php sumairz PHP

Ranked the reuter's document using bm25 ranking algorithm.

5 3 5

vba-crawler bokhua Visual Basic

VBA web crawler using http GET/POST

5 3 5

GooglePlayDatabaseMirror BaseMax PHP

Repository of designing a crawler script to update a mirror database from Google Play on PHP.

5 0 5

crwlr busterc JavaScript

🕷a minimal puppeteer crawler api

5 0 5

Migale cth-latest C#

Migale was born out of a need to extract data quickly and with a very low development cost. This package is not intended to replace complete and struc...

5 0 5

SiteMapperChromeExtension MatthewMariner JavaScript

Discover and navigate website structure with smart sitemap detection, visual tree view, and export tools for SEO audits

5 0 5

tider ZLotusRain Python

A fast, simple, extensible and powerful framework for web crawling.

5 0 5

kafka-ES-DataPrakiraanCuaca RomySaputraSihananda Python

Simulasi transmisi data hasil crawling dari DataPrakiraanCuaca menggunakan Python, Kafka, dan Elasticsearch.

5 0 5

proxycrawl-java crawlbase Java

ProxyCrawl Java library for scraping and crawling

5 0 5

EPhoto360 LordDeveloper PHP

Create text effects online , Effects online for free, photo frames, make face photo montages, custom greeting cards, add vintage filters, turn photos...

5 4 5

crawling

Repositories (1350)