Topic

benchmark

Repositories (1623)

dnstrace
dnstrace redsift Go

Command-line DNS benchmark

82
MedAgentBench
MedAgentBench stanfordmlgroup Python

MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents

82
logs-benchmark
logs-benchmark SigNoz Shell

Logs performance benchmark repo: Comparing Elastic, Loki and SigNoz

82
ruby-performance-tools
ruby-performance-tools JuanitoFatas

List of Ruby Tools for doing Performance.

82
php-orm-benchmark
php-orm-benchmark sergeyklay PHP

The benchmark to compare performance of PHP ORM solutions.

81
llm-benchmark
llm-benchmark lework Python

LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。

81
tasty-bench
tasty-bench Bodigrim Haskell

Featherlight benchmark framework, drop-in replacement for criterion and gauge.

81
WritingBench
WritingBench X-PLUG Python

WritingBench: A Comprehensive Benchmark for Generative Writing

81
vllm-safety-benchmark
vllm-safety-benchmark UCSC-VLAA Python

[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"

81
http-benchmark-tornado
http-benchmark-tornado junneyang Python

基于Python Tornado的高性能http性能测试工具。Java Netty版: https://github.com/junneyang/http-benchmark-netty 。

81
benchinit
benchinit mvdan Go

Benchmark the init cost of Go packages

81
perforator
perforator zyedidia Go

Record "perf" performance metrics for individual functions/regions of an ELF binary.

81
LearnedSort
LearnedSort anikristo C++

Learned Sort: a model-enhanced sorting algorithm

81
WildScenes
WildScenes csiro-robotics Python

[IJRR2024] The official repository for the WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Natural Environments

80
ASR_benchmark
ASR_benchmark Franck-Dernoncourt Python

Program to benchmark various speech recognition APIs

80
indonlg
indonlg IndoNLP Python

The first-ever vast natural language generation benchmark for Indonesian, Sundanese, and Javanese. We provide multiple downstream tasks, pre-trained I...

80
MDBenchmark
MDBenchmark bio-phys Python

Quickly generate, start and analyze benchmarks for molecular dynamics simulations.

80
gobench
gobench gobench-io HTML

A benchmark framework based on Golang

80
DotNet-Collections-Benchmark
DotNet-Collections-Benchmark mjebrahimi C#

🚀 A comprehensive performance comparison benchmark between different .NET collections.

79
router-benchmark
router-benchmark delvedor JavaScript

Benchmark of the most commonly used http routers

79
vector-db-benchmark
vector-db-benchmark myscale Python

Framework for benchmarking fully-managed vector databases

79
sugar-crepe
sugar-crepe RAIVNLab Python

[NeurIPS 2023] A faithful benchmark for vision-language compositionality

79
gocannon
gocannon kffl Go

:boom: Performance-focused HTTP load testing tool written in Go

78
contender
contender flashbots Rust

run highly configurable benchmarks for EVM-based execution nodes over JSON-RPC

78
Uniaa
Uniaa KwaiVGI Python

Unified Multi-modal IAA Baseline and Benchmark

78
PointCloudMatters
PointCloudMatters HaoyiZhu Python

[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning

78
Mercury
Mercury Elfsong Jupyter Notebook

Code Efficiency Benchmark

78
php-di-container-benchmarks
php-di-container-benchmarks kocsismate PHP

Benchmark for some popular PHP Dependency Injection Containers.

77
OpenRCA
OpenRCA microsoft Python

[ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?

77
arewefastyet
arewefastyet vitessio Go

Automated Benchmarking System for Vitess

76
benchable
benchable MatheusRich Ruby

Write benchmarks without the hassle.

75
sightglass
sightglass bytecodealliance C

A benchmark suite and tool to compare different implementations of the same primitives.

75
RWKU
RWKU jinzhuoran Python

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024

75
ipc_benchmark
ipc_benchmark detailyang Python

IPC benchmark on Linux

74
TLCBench
TLCBench tlc-pack Python

Benchmark scripts for TVM

74
Elysium
Elysium Hon-Wong Python

[ECCV 2024] Elysium: Exploring Object-level Perception in Videos via MLLM

74
lhbench
lhbench lhbench Scala

Lakehouse storage system benchmark

74
rsb
rsb gamelife1314 Rust

a http server benchmark tool written in rust 🦀

73
the-cpp-abstraction-penalty
the-cpp-abstraction-penalty germandiagogomez C++

Modern C++ benchmarking

73
evoeval
evoeval evo-eval Python

EvoEval: Evolving Coding Benchmarks via LLM

73
apebench
apebench tum-pbs Python

[Neurips 2024] A benchmark suite for autoregressive neural emulation of PDEs. (≥46 PDEs in 1D, 2D, 3D; Differentiable Physics; Unrolled Training; Roll...

73
CMI
CMI zju-vipa Python

[IJCAI-2021] Contrastive Model Inversion for Data-Free Knowledge Distillation

72
scalajs-benchmark
scalajs-benchmark japgolly Scala

Benchmarks: write in Scala or JS, run in your browser. Live demo:

72
BenchmarkFcns
BenchmarkFcns mazhar-ansari-ardeh C++

A Python and MATLAB implementation of mathematical test functions for benchmarking optimization algorithms.

72
ncnn-benchmark
ncnn-benchmark BUG1989 CMake

The benchmark of ncnn that is a high-performance neural network inference framework optimized for the mobile platform

72
MedXpertQA
MedXpertQA TsinghuaC3I Python

[ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

72
go-interface-values
go-interface-values akutz Go

When storing a value in a Go interface allocates memory on the heap.

72
One-shot-Human-Parsing
One-shot-Human-Parsing Charleshhy Python

[AAAI 2021] (oral) Progressive One-shot Human Parsing, [TPAMI 2023] End-to-end One-shot Human Parsing

72
go-cache-benchmark
go-cache-benchmark vmihailenco Go

Cache benchmark for Golang

72
web-components-benchmark
web-components-benchmark vogloblinsky JavaScript

Web Components benchmark for a various Web Components technologies

72