browsertrix-crawler

webrecorder

Run a high-fidelity browser-based web archiving crawler in a single Docker container

crawling crawler

View on GitHub Website

1k Stars

137 Forks

1k Watchers

TypeScript Language

agpl-3.0 License

100 SrcLog Score

Cost to Build

$3.77M

Market Value

$17.53M

How is this calculated?

Growth over time

6 data points · 2025-07-23 → 2026-04-24

Stars Forks Watchers

💬

How do you feel about this project?

Ask AI about browsertrix-crawler

Question copied to clipboard

What is the webrecorder/browsertrix-crawler GitHub project? Description: "Run a high-fidelity browser-based web archiving crawler in a single Docker container". Written in TypeScript. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone browsertrix-crawler

Clone via HTTPS

git clone https://github.com/webrecorder/browsertrix-crawler.git

Clone via SSH

[email protected]:webrecorder/browsertrix-crawler.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the browsertrix-crawler issue tracker:

Open GitHub Issues

Similar to browsertrix-crawler

scrapy pyspider webmagic newspaper colly pholcus node-crawler proxy_pool lux scrapy-redis headless-chrome-crawler awesome-crawler toapi haipproxy WechatSogou Photon arachni gocrawl gain scylla DotnetSpider dom-crawler go_spider gecco Python wombat lightcrawler PSpider SwiftLinkPreview ProxyBroker