ClawBench

Open-source benchmark for browser AI agents on 153 everyday online tasks across 144 live websites. 5-layer recording + DOM-match + LLM judge. Top score 33.3%.

benchmark

View on GitHub Website

93 Stars

8 Forks

93 Watchers

Python Language

apache-2.0 License

100 SrcLog Score

Cost to Build

$28.5K

Market Value

$81.5K

How is this calculated?

Growth over time

2 data points · 2026-04-18 → 2026-04-26

Stars Forks Watchers

💬

How do you feel about this project?

Ask AI about ClawBench

Question copied to clipboard

What is the reacher-z/ClawBench GitHub project? Description: "Open-source benchmark for browser AI agents on 153 everyday online tasks across 144 live websites. 5-layer recording + DOM-match + LLM judge. Top score 33.3%.". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone ClawBench

Clone via HTTPS

git clone https://github.com/reacher-z/ClawBench.git

Clone via SSH

[email protected]:reacher-z/ClawBench.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the ClawBench issue tracker:

Open GitHub Issues

Similar to ClawBench

netdata fashion-mnist FrameworkBenchmarks BenchmarkDotNet jmeter awesome-semantic-segmentation sysbench hyperfine tsung benchmark_results across web-frameworks php-framework-benchmark jsperf.com go-web-framework-benchmark huststore phoronix-test-suite Attabench ann-benchmarks sbt-jmh caffenet-benchmark chillout IocPerformance prophiler TBCF NBench sympact awesome-http-benchmark BlurTestAndroid pytest-benchmark