:spider: This is an ES6 adaptation of the original PHP library CrawlerDetect, this library will help you detect bots/crawlers/spiders vie the useragen...
LFITester is a Python3 program that automates the detection and exploitation of Local File Inclusion (LFI) vulnerabilities on a server.
Dynamic configurable crawl (动态可配置化爬虫)
Verify that a request is from Google crawlers using Google's DNS verification steps
Price tracker of Amazon
Selenium automation test framework
Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawler
web scraping extension
This repo is a part of blog series on several web scraping projects where we will explore scraping techniques to crawl data from simple websites to we...
Backup your friends' Instagram Stories forever and get to keep them even after 24 hours.
Simple Weibo Scraper
Proxy List Scrapper
This script scrapes the HTML from different web pages to get the information from the video (XVideos, PornHub, RedTube) and you can use it in your own...
A package to get list of user agents based on filters such as operating system, software name etc..
这是一个用Python写的小说爬虫软件
Discover hidden deepweb pages
a puppeteer walker 🕷 🕸
NTU CEIBA 資料下載工具
tumblr解析网站
Powerful web scraping framework for Crystal
Crawl sites for RSS, Atom, and JSON feeds.
Some classic web crawler projects.一些经典的爬虫
Some scrapy and web.py exmaples
xSMTP 🦟 Lightning fast, multithreaded smtp scanner targeting open-relay and unsecured servers in multiple network ranges.
The LAW next generation crawler.
爬取及整理Freebuf\安全客\先知\知道创宇等站点的”web安全“类优质文章
Scrape Learning (ctrip)
Tumblr Download Tool with High Speed and Customization. 高性能&高定制化的Tumblr下载工具。
fetchman is a simple crawler system/简单好用的爬虫框架
Just a crawler based on tg-cli for Telegram. Deprecated by now, please use telegram-export.
a simplified directed customizable website crawler
Your preferred open source focused crawler for the deep web.
A collection of Python tools, scripts and utilities to make your life easier.
超高速异步协程Python爬虫
基于NodeJS的基金数据爬虫,爬取的数据存于github的@nullpointer/fund-data。
a quick start python mutil thread crawl
python crawler spider
A crawler for automated functional testing of a web application
Search through all your personal data efficiently like web search.
Crawlzone is a fast asynchronous internet crawling framework for PHP.
Privacy Web Search Engine (not meta, own crawler)
Instagram user's photos and videos downloader. Download all media files from any username. Working 2022!
练手项目:Comment of Interest 电商文本评论数据挖掘 (爬虫 + 观点抽取 + 句子级和观点级情感分析)
Golang爬虫 爬取汽车之家 二手车产品库
Golang 实现的 IP 代理池, 涉及到的技术点: go gorm proxy proxypool ip crawler 爬虫 mysql viper cobra
When you solve the problem of Baekjoon Online Judge, it automatically commits and pushes to the remote repository.
robots.txt file parsing and checking for R
Darkweb Crawler Project
Python Crawler
Tiktok (Musically) PHP scraper