Most popular benchmark repositories and open source projects

http-benchmark-tornado

基于Python Tornado的高性能http性能测试工具。Java Netty版: https://gith...

48   81   81  

trajectopy

Trajectopy - Trajectory Evaluation in Python

4   81   81  

llm-benchmark

LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。

14   81   81  

tasty-bench

Featherlight benchmark framework, drop-in replacement for criterion an...

13   81   81  

WritingBench

WritingBench: A Comprehensive Benchmark for Generative Writing

9   81   81  

vllm-safety-benchmark

[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are...

4   81   81  

WildScenes

[IJRR2024] The official repository for the WildScenes: A Benchmark for...

4   80   80  

ASR_benchmark

Program to benchmark various speech recognition APIs

18   80   80  

benchinit

Benchmark the init cost of Go packages

3   80   80  

MDBenchmark

Quickly generate, start and analyze benchmarks for molecular dynamics...

17   80   80  

perforator

Record "perf" performance metrics for individual functions/regions of...

5   80   80  

router-benchmark

Benchmark of the most commonly used http routers

17   79   79  

MedAgentBench

MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medica...

17   79   79  

vector-db-benchmark

Framework for benchmarking fully-managed vector databases

19   79   79  

sugar-crepe

[NeurIPS 2023] A faithful benchmark for vision-language compositionali...

9   79   79  

gocannon

:boom: Performance-focused HTTP load testing tool written in Go

8   78   78  

contender

run highly configurable benchmarks for EVM-based execution nodes over...

27   78   78  

Uniaa

Unified Multi-modal IAA Baseline and Benchmark

5   78   78  

Mercury

Code Efficiency Benchmark

9   78   78  

PointCloudMatters

[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Diffe...

3   78   78  

LearnedSort

Learned Sort: a model-enhanced sorting algorithm

12   78   78  

php-di-container-benchmarks

Benchmark for some popular PHP Dependency Injection Containers.

26   77   77  

gobench

A benchmark framework based on Golang

15   77   77  

OpenRCA

[ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of...

7   77   77  

arewefastyet

Automated Benchmarking System for Vitess

57   76   76  

benchable

Write benchmarks without the hassle.

1   75   75  

sightglass

A benchmark suite and tool to compare different implementations of the...

36   75   75  

RWKU

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language...

6   75   75  

lhbench

Lakehouse storage system benchmark

10   74   74  

Elysium

[ECCV 2024] Elysium: Exploring Object-level Perception in Videos via M...

4   74   74  

TLCBench

Benchmark scripts for TVM

28   74   74  

ipc_benchmark

IPC benchmark on Linux

55   74   74  

the-cpp-abstraction-penalty

Modern C++ benchmarking

1   73   73  

indonlg

The first-ever vast natural language generation benchmark for Indonesi...

14   73   73  

rsb

a http server benchmark tool written in rust 🦀

4   73   73  

apebench

[Neurips 2024] A benchmark suite for autoregressive neural emulation o...

1   73   73  

evoeval

EvoEval: Evolving Coding Benchmarks via LLM

8   73   73  

go-interface-values

When storing a value in a Go interface allocates memory on the heap.

7   72   72  

MedXpertQA

[ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning an...

0   72   72  

One-shot-Human-Parsing

[AAAI 2021] (oral) Progressive One-shot Human Parsing, [TPAMI 2023] En...

8   72   72  

web-components-benchmark

Web Components benchmark for a various Web Components technologies

19   72   72  

ncnn-benchmark

The benchmark of ncnn that is a high-performance neural network infere...

19   72   72  

scalajs-benchmark

Benchmarks: write in Scala or JS, run in your browser. Live demo:

7   72   72  

go-cache-benchmark

Cache benchmark for Golang

14   72   72  

CMI

[IJCAI-2021] Contrastive Model Inversion for Data-Free Knowledge Disti...

17   72   72  

logbench

Golang logging library benchmarks

15   71   71  

BenchmarkFcns

A Python and MATLAB implementation of mathematical test functions for...

14   71   71  

LruClockCache

A low-latency LRU approximation cache in C++ using CLOCK second-chance...

6   71   71  

DotNet-Collections-Benchmark

🚀 A comprehensive performance comparison benchmark between different...

7   71   71  

TaskMeAnything

[NeurIPS 2024] A task generation and model evaluation system for mult...

3   71   71