Topic

scraping

Repositories (1626)

Django-Search-Engine
Django-Search-Engine SandyUndefined Python

Django Search Engine, Scrape links and text from different search engine as mentioned in Readme and Display it.

12
aws-outages
aws-outages outages

Track AWS outages via Git History

12
scrapingant-client-js
scrapingant-client-js ScrapingAnt JavaScript

ScrapingAnt API client for JavaScript / Node.js.

12
cloudflare-iuam-solver
cloudflare-iuam-solver ninja-beans Java

CloudflareIuamSolver is the Java library for breaking through the Cloudflare's "I am Under Attack Mode"

12
nautiljon-scraper
nautiljon-scraper barthofu JavaScript

🍙 An unofficial scraping tool for https://nautiljon.com, a french anime and manga data website

12
FAQs-automatic-scraper
FAQs-automatic-scraper Mogady Python

Automatic Scraping project for extracting FAQ and Help center articles

12
pyjpboatrace
pyjpboatrace hmasdev Python

pyjpboatrace :speedboat: provides you with useful tools for data analysis and auto-betting for boatrace.

12
SuperStarInfoFetch
SuperStarInfoFetch brainiac19 Python

超星学习通 收件箱、任务、章节、讨论、作业、考试、资料、错题集、学习记录 获取,以及资料批量下载(学生端)

12
game-watch
game-watch Agreon TypeScript

Overview of game release dates, prices and news

12
Cutlery
Cutlery Gingerbreadfork Python

Python Script for Copywriters to Gather Data from Competing Content and Find Keyword Overlap

12
TD-Spider
TD-Spider LandWhale2 Go

Via Text Density Simple Web Crawler With Go

12
puppeteer-copy-content
puppeteer-copy-content Aziz-AXG JavaScript

A basic web script for copying contents

12
spido
spido yazan-zoghbi TypeScript

Web crawler/spider for node.js & nest.js server

12
google-scholar-scraper
google-scholar-scraper sheikhartin Jupyter Notebook

Export the articles and reports from Google Scholar easily! (The spider structure is inspired by the Scrapy framework)

12
ReicheltAPI
ReicheltAPI jkreucher Python

A very simple Python web scraping module for Reichelt Elektronik

12
Sanfoundry-To-PDF
Sanfoundry-To-PDF AhmedMohamedAbdelaty JavaScript

A Chrome extension designed to help students and educators easily convert multiple-choice questions from sanfoundry.com into PDF format, facilitating...

12
omitplastic
omitplastic gavinmgrant TypeScript

A full-stack e-commerce affiliate app focusing on reducing plastic consumption that uses Next.js and Prisma.

12
webtric
webtric destilabs Shell

Universal Python script to scrape many typical websites

12
web-scraping-lvl1
web-scraping-lvl1 FirasKahlaoui Jupyter Notebook

The "web-scraping-lvl1" project is a beginner-level exercise in web scraping using Python's Beautiful Soup library and Requests module.

12
LeadGenerationAPI
LeadGenerationAPI Ambitious-Concepts-Labs Python

This API is built for lead generation and provides users with a streamlined platform for capturing and tracking leads, which integrates with existing...

12
scrapetoon
scrapetoon RoloEdits Rust

A tool for scraping information from Webtoons.

12
Web2LLM
Web2LLM yamasammy Python

An advanced Python tool for extracting data from websites, cleaning the content, and converting it to high-quality Markdown for optimal use by LLM sys...

12
rcdb-api
rcdb-api fabianrguez TypeScript

🎢RCDB non official Rest API created by scrape RCDB website.

12
webtap
webtap webtap-ai

AI web scraping python library for efficient and reliable web scraping.

12
price_tracker
price_tracker emredkyc TypeScript

Dive into web scraping and build a Next.js 14 eCommerce price tracker within one project that teaches you data scraping, cron jobs, sending emails, de...

12
facebook-event-aggregator
facebook-event-aggregator Denperidge HTML

Scrape all upcoming events from specific FB pages, export them to a static website & .ics files, publish it automatically to Git(Hub Pages).

12
zenrows-python-sdk
zenrows-python-sdk ZenRows Python

SDK to access ZenRows API directly from Python. We handle proxies rotation, headless browsers and CAPTCHAs for you.

12
YN_Exchange
YN_Exchange YashaNajafi Python

Crypto python directory (Open source)

12
covid19br-pub
covid19br-pub fgrehm Ruby

Projeto de monitoramento de publicações oficiais relacionadas a COVID-19 no Brasil.

11
gathering_data
gathering_data dkhramov R

Примеры к книге "Сбор данных в Интернете на языке R".

11
Predicting-Trump
Predicting-Trump mm909 Python

Predicting Trump's Tweets With A Character Level Recurrent Neural Network - Generating author and task specific text with a LSTM RNN.

11
hll-algorithm-sample
hll-algorithm-sample gvolpe Scala

HLL Algorithm and Web Scraping sample

11
scraper-engine
scraper-engine lukluk JavaScript

Async Scraper Framework Based Nodejs

11
web-scraper
web-scraper mdibaiee Python

simple web scraping: extract texts and links as CSV, and save images of multiple websites

11
journal
journal raphaelberly Python

A movie journal coupled with open IMDb data, and a Flask web-app for easy movie insertion.

11
BK
BK wb-08 Python

Finds a fork in the tennis matches betting marathonbet and plusminus

11
vidpy
vidpy preeteshjain Python

A Python based customizable script for scraping links to videos hosted on any website. Based on Scrapy and BeautifulSoup.

11
scrapy-proxycrawl-middleware
scrapy-proxycrawl-middleware crawlbase Python

Scrapy middleware interface to scrape using ProxyCrawl proxy service

11
Advanced_PHP_Scrapping
Advanced_PHP_Scrapping FavyTeam PHP

Enhanment Scrapping API for six hotel booking website from Expedia.com, Booking.com, Bookhotelbeds.com. Hotels.com, Bestday.com, despegar.com

11
camp-collective
camp-collective the-eater Python

An incomplete bandcamp python toolset (mainly downloads your collection)

11
Nepali-news-portal-kbd
Nepali-news-portal-kbd hemanta212 Python

Online webapp that scrapes news from different new portals of Nepal and worldwide. Hosted at heroku.

11
India-Trade-Data
India-Trade-Data lakshyaag Jupyter Notebook

A web scraper written in Python to gather trade data for India across commodities and countries

11
music-festivals
music-festivals BBC-Data-Unit Python

Festivals dominated by male acts, study shows, as Glastonbury begins

11
TED-Scraper
TED-Scraper The-Gupta Jupyter Notebook

Complete Web Scraping of TED.com for Metadata, Transcript, Audio, Video, Images using Parallel Programming

11
get-dream
get-dream Tiger-0512 Python

予約困難な東京ディズニーリゾートの予約を自動化

11
OneFreeMonth
OneFreeMonth CamTosh Python

Get free month on Netflix, OCS... using Selenium and Python

11
Scraping-Jumia-Ecommerce
Scraping-Jumia-Ecommerce krizten Python

Using the Scrapy framework to scrape data consisting of name, brand, rating, price, product URL and image URLs of laptops on Jumia e-commerce (https:/...

11
Saffron
Saffron UniStudents HTML

A fairly intuitive & powerful framework that enables you to collect & save articles and news from all over the web.

11
Price-Monitoring-Tool
Price-Monitoring-Tool akshaysharmajs Python

Tool for retrieving cheapest price for any product comparing across e-commerce platforms like Amazon, Flipkart, Snapdeal, Ebay

11
t0m
t0m MattMoony Python

Tellonym scraper and information gathering tool. Might be useful for getting background-info on a person, etc. 🔍

11