Topic

crawling

Repositories (1230)

telegramBot_instaDP
telegramBot_instaDP codenashwan PHP

A simple BOT Telegram to downloading Instagram profiles photo

6
Cloud_Player_V2
Cloud_Player_V2 amirhoseinsb Python

You can use the cloudplayer tool to listen to the music of the singer you want without going to a specific website and at a very high speed.

6
knu-lms-scheduler
knu-lms-scheduler HyeokjaeLee JavaScript

:mortar_board: 공주대학교 온라인 강의 시스템 편의성 향상 프로그램

6
Advanced-proxy-Scraper
Advanced-proxy-Scraper FuckingToasters

Advanced Proxy Scraper Crawler fetcher

6
crawl-agoda
crawl-agoda mandes95 Python
6
Formosan-languages
Formosan-languages howard-haowen HTML

台灣南島語-華語句庫資料集(Dataset of Formosan-Mandarin sentence pairs)

6
everytime-timetable-crawling
everytime-timetable-crawling wwlee94 Python

에브리타임 수업 강좌 시간표 크롤링

5
Scrapy
Scrapy Decodo Python

Scrapy proxy authentication example for Decodo

5
scrap-superloto
scrap-superloto erseler Python

A web scrapping project to fetch all lottery winning numbers, date, prizes etc.

5
crawler-webpage
crawler-webpage hatttruong Python

crawling data from vnexpress.net for my subjects at school

5
bm25-ranking-php
bm25-ranking-php sumairz PHP

Ranked the reuter's document using bm25 ranking algorithm.

5
ScrapySub
ScrapySub ENGRZULQARNAIN Python

ScrapySub is a Python library designed to recursively scrape website content, including subpages. It fetches the visible text from web pages and store...

5
gumbo-parser-cpp
gumbo-parser-cpp cschanaj C++

C++ Library to Extract Information from the Google Gumbo HTML Parse Tree

5
Learning-By-Crawling
Learning-By-Crawling PSigfridsson Haskell

Riot Games API crawler and a machine learning project. Created in Haskell.

5
Shopee-Crawler
Shopee-Crawler minhlong TypeScript

Crawl data from the shopee.vn

5
craw-BadanPusatStatistik
craw-BadanPusatStatistik RomySaputraSihananda Python

craw-BadanPusatStatistik adalah program untuk mengambil data dari website Badan Pusat Statistik Indonesia.

5
Spider
Spider enigma522 Python

This asynchronous web crawler is designed for reconnaissance tasks. It crawls a specified URL up to a defined depth, extracting useful information

5
sitemapr
sitemapr alphaprime-dev Python

sitemapr is a library that generates sitemaps for SPA websites by reading site structures defined in declarative configuration.

5
order-metrics-data-automation
order-metrics-data-automation ssaadh Ruby

OrderMetrics.io Automation for data from there to Google Sheets (spreadsheets). Mainly used for e-commerce Shopify, Facebook advertising, Google Adwor...

5
GooglePlayDatabaseMirror
GooglePlayDatabaseMirror BaseMax PHP

Repository of designing a crawler script to update a mirror database from Google Play on PHP.

5
crwlr
crwlr busterc JavaScript

🕷a minimal puppeteer crawler api

5
CountriesSearchEngine
CountriesSearchEngine anshul1004 Python

A search engine built to retrieve geographical information of any country.

5
woocommerce-scraper
woocommerce-scraper vanquan805 PHP

The best scraping solution for WooCommerce

5
Puppeteer
Puppeteer Decodo JavaScript

Puppeteer proxy authentication example for Decodo

5
namu-soup
namu-soup anteater333 JavaScript

숲Soup - 나무위키 인기 검색어 크롤러

5
estela-entrypoint
estela-entrypoint bitmakerla Python

estela entrypoint for job runner 🕸

5
craw-Pinterest
craw-Pinterest RomySaputraSihananda Python

melakukan web scraping dan mengambil gambar berdasarkan keyword pencarian pinterest.

5
Migale
Migale Velka-DEV C#

Migale was born out of a need to extract data quickly and with a very low development cost. This package is not intended to replace complete and struc...

5
Fundamentus_scraping
Fundamentus_scraping GuilhermeUchoa Python

Crawler do site Fundamentus.com com o uso do framework scrapy, tanto da aba detalhada como a de resumo. (Todas as infomações)

5
tider
tider ZLotusRain Python

A fast, simple, extensible and powerful framework for web crawling.

5
proxycrawl-java
proxycrawl-java crawlbase Java

ProxyCrawl Java library for scraping and crawling

5
naver_webtoon
naver_webtoon hey411 Python

딥러닝과 머신러닝을 활용한 독자 반응 기반 웹툰 데뷔작 성공 예측 모델

5
webArchive
webArchive mrrfv JavaScript

Crawls websites and saves found URLs to a file.

5
bot-safe-agents
bot-safe-agents ivan-sincek Python

A library for fetching a list of bot-safe user agents.

5
Text_mining
Text_mining Jimin980921 Jupyter Notebook

텍스트마이닝을 이용한 소비자분석 _네이버쇼핑 리뷰크롤링

5
Scraping-IMDB
Scraping-IMDB RaedAddala Jupyter Notebook

This Python script extracts comprehensive movie data from IMDB, focusing on top-grossing movies from 1920 to 2025. The scraper collects detailed infor...

5
FALL
FALL DevanshRaghav75 Python

A automated penetration testing tool

5
amazon_luwak_coffee_scraper
amazon_luwak_coffee_scraper omar-elmaria Python

This repo contains a Python-based web crawler that scrapes data on Luwak coffee products from amazon.de. It is designed to surpass Amazon's anti-bot m...

5
coronaflight-hkg
coronaflight-hkg poyea JavaScript

😷 Crawler and history manager for dangerous, coronavirus-infected flights to Hong Kong (VHHH)

5
Instagram-image-downloader
Instagram-image-downloader KAispread Java

💟 Instagram Image Downloader

5
PlatformsCrawler
PlatformsCrawler eric2788 Go

多平台爬蟲 + 模塊化管理,用於搜集資料並經 redis pubsub 發送

5
FirstSelenium
FirstSelenium BaseMax Python

Some sample codes for using selenium in Python just for fun.

4
node-crawling-framework
node-crawling-framework JimmyLaurent JavaScript

✨ NodeJs crawling & scraping framework heavily inspired by Scrapy

4
simple-crawler
simple-crawler hseghetti Shell

Simple crawler using apache nutch and elasticsearch

4
bonobo-selenium
bonobo-selenium python-bonobo Python

PRE-ALPHA - Write web crawlers using Bonobo

4
Krawler
Krawler YektaDev Kotlin

A configurable HTML Crawler written in Kotlin (JVM), powered by Coroutines, Kotlin Serialization (JSON), Ktor Client, Exposed, and SQLite.

4
rag-backend
rag-backend thevladdo HTML

Retrieval-Augmented Generation server with Pinecone and OpenAI

4
mindfactory_crawling
mindfactory_crawling RobMcH Python

A Python 3 Crawler for Mindfactory.de

4
EPhoto360
EPhoto360 LordDeveloper PHP

Create text effects online , Effects online for free, photo frames, make face photo montages, custom greeting cards, add vintage filters, turn photos...

4
buscando-meu-carro
buscando-meu-carro FelipeGaleao Jupyter Notebook

O buscando-meu-carro é um repositório que contém um projeto Python que utiliza técnicas de scrapping para criar um Data Warehouse (DW) contendo inform...

4