Most popular scraping repositories and open source projects

scraper

Webscraping by using electron

3   4   4  

ScrapeDroid

An effective android library that can be used for web scraping by supp...

1   4   4  

criticalsyncing

Get the different prospective on news

1   4   4  

photo-dataset-scraper

Tools to create a labelled dataset of photos for deep learning image c...

1   4   4  

AISearchImage

Intelligent Search Engine Of Services Image-Based

0   4   4  

html-parser

Xpath, regex and CSS Selector parser

1   4   4  

news-lens

Filtering Political Bias in American News Media

1   4   4  

naruto-name-generator

Foi treinada uma rede neural recorrente (RNN) em um conjunto de dados...

0   4   4  

srcget

Download latest *stable* source

1   4   4  

data-scraper

❤️ The data scraper for big data

5   4   4  

mojok_archive

Website archiving project for Mojok.co website, based on scrapping.

1   4   4  

python-sentimentbrexit-analysis

A project about sentiment analysis and graph & network analysis made w...

1   4   4  

wpoke

python library tool to gather wordpress information

0   4   4  

bitsky-builder

Build BitSky Desktop Application, Web Application, and Docker images

0   4   4  

YTscheduledVideos2Ical

Extract your scheduled videos publish date to a ical file so that it a...

0   4   4  

MP-Transportation-Analysis

Program to analyze MP Transportation data and comparing

1   4   4  

lycos

✨All the goodies you'll ever need to scrape the web (NodeJs / Browse...

1   4   4  

node-itunes-rss

Overengineered iTunes rss feed lister https://rss.itunes.apple.com/

0   4   4  

samehadaku.sh

link graber for samehadaku.tv

1   4   4  

Job-Scraper-Bot

幫朋友做好玩的Telegram機器人,已部署到Heroku

1   4   4  

scraping

Resources on scraping

2   4   4  

youcos

:rocket: A simple Python package for scraping YouTube videos and comme...

2   4   4  

Soup-for-Pharo

3   4   4  

spider_project

Template of scraping project using Grab scraping framework

2   4   4  

scielo-scraping

Download all files from SciELO

1   4   4  

git-screened

Automating Github Repository Assessment.

1   4   4  

piculet

Extract data from XML or HTML documents using XPath.

0   4   4  

Scrapy-Middleware

Scrapy Middleware for proxy authentication with Smartproxy

1   4   4  

cuphic

Transform or scrape Hiccup with a declarative DSL.

0   4   4  

OLX-Crawler

This repository contains code of olx crawler to extract public phone n...

1   4   4  

order-metrics-data-automation

OrderMetrics.io Automation for data from there to Google Sheets (sprea...

2   4   4  

comicthread

Get random comics from multiple sources (XKCD, Commitstrip, ...)

0   4   4  

yahoo-login

Gobble up an authenticated Yahoo session by mechanizing the login proc...

0   4   4  

scrape-x

Simple .NET library that provides generic web scraping abilities using...

2   4   4  

site-pages-graph

Site pages into graph for some SEO-analysis.

1   4   4  

chrome-vision

Advanced cross-platform web automation with a convenient Go API

0   4   4  

MappingYoutube

Generate a visual map of youtube by using channel relations.

1   4   4  

hltb-scraper

A web spider that crawls HowLongToBeat to extract game and completion...

0   4   4  

search_it

Asyncio package for scraping major Search Engines, supports pagination...

7   4   4  

vsco-js

VSCO image downloader

0   4   4  

mnemonic-maker-backend

A web scraping application that generates acrostic mnemonic devices by...

1   4   4  

classify

Classify is an efficient tool for extraction of structured field seque...

0   4   4  

searchpro

gives top 5 results of google,yahoo,bing

0   4   4  

gradcafestats

Visualize/filter results from TheGradCafe.com

0   4   4  

Python-Regex

Regular expressions using re Python module

23   4   4  

web-scraping-with-playwright

🎭 Playwright allows for reliable end-to-end testing for web applicati...

0   4   4  

create-proxy

HTTP / HTTPS Proxy

1   4   4  

duckduckgo-api

An unofficial Duckduckgo.com API with performance and simplicity in mi...

0   4   4  

BuzzPlan

A scheduling app for Georgia Tech with some advanced tools

0   4   4  

biblescholar

Tools for multi-translation Bible search over voice apis (currently al...

1   4   4