Topic

benchmark

Repositories (1623)

dnstrace
dnstrace redsift Go

Command-line DNS benchmark

82
ruby-performance-tools
ruby-performance-tools JuanitoFatas

List of Ruby Tools for doing Performance.

82
MedAgentBench
MedAgentBench stanfordmlgroup Python

MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents

82
logs-benchmark
logs-benchmark SigNoz Shell

Logs performance benchmark repo: Comparing Elastic, Loki and SigNoz

82
LearnedSort
LearnedSort anikristo C++

Learned Sort: a model-enhanced sorting algorithm

81
php-orm-benchmark
php-orm-benchmark sergeyklay PHP

The benchmark to compare performance of PHP ORM solutions.

81
llm-benchmark
llm-benchmark lework Python

LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。

81
tasty-bench
tasty-bench Bodigrim Haskell

Featherlight benchmark framework, drop-in replacement for criterion and gauge.

81
WritingBench
WritingBench X-PLUG Python

WritingBench: A Comprehensive Benchmark for Generative Writing

81
vllm-safety-benchmark
vllm-safety-benchmark UCSC-VLAA Python

[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"

81
perforator
perforator zyedidia Go

Record "perf" performance metrics for individual functions/regions of an ELF binary.

81
benchinit
benchinit mvdan Go

Benchmark the init cost of Go packages

81
http-benchmark-tornado
http-benchmark-tornado junneyang Python

基于Python Tornado的高性能http性能测试工具。Java Netty版: https://github.com/junneyang/http-benchmark-netty 。

81
gobench
gobench gobench-io HTML

A benchmark framework based on Golang

80
indonlg
indonlg IndoNLP Python

The first-ever vast natural language generation benchmark for Indonesian, Sundanese, and Javanese. We provide multiple downstream tasks, pre-trained I...

80
MDBenchmark
MDBenchmark bio-phys Python

Quickly generate, start and analyze benchmarks for molecular dynamics simulations.

80
WildScenes
WildScenes csiro-robotics Python

[IJRR2024] The official repository for the WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Natural Environments

80
ASR_benchmark
ASR_benchmark Franck-Dernoncourt Python

Program to benchmark various speech recognition APIs

80
vector-db-benchmark
vector-db-benchmark myscale Python

Framework for benchmarking fully-managed vector databases

79
sugar-crepe
sugar-crepe RAIVNLab Python

[NeurIPS 2023] A faithful benchmark for vision-language compositionality

79
DotNet-Collections-Benchmark
DotNet-Collections-Benchmark mjebrahimi C#

🚀 A comprehensive performance comparison benchmark between different .NET collections.

79
router-benchmark
router-benchmark delvedor JavaScript

Benchmark of the most commonly used http routers

79
gocannon
gocannon kffl Go

:boom: Performance-focused HTTP load testing tool written in Go

78
contender
contender flashbots Rust

run highly configurable benchmarks for EVM-based execution nodes over JSON-RPC

78
Uniaa
Uniaa KwaiVGI Python

Unified Multi-modal IAA Baseline and Benchmark

78
Mercury
Mercury Elfsong Jupyter Notebook

Code Efficiency Benchmark

78
PointCloudMatters
PointCloudMatters HaoyiZhu Python

[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning

78
OpenRCA
OpenRCA microsoft Python

[ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?

77
php-di-container-benchmarks
php-di-container-benchmarks kocsismate PHP

Benchmark for some popular PHP Dependency Injection Containers.

77
arewefastyet
arewefastyet vitessio Go

Automated Benchmarking System for Vitess

76
RWKU
RWKU jinzhuoran Python

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024

75
sightglass
sightglass bytecodealliance C

A benchmark suite and tool to compare different implementations of the same primitives.

75
benchable
benchable MatheusRich Ruby

Write benchmarks without the hassle.

75
Elysium
Elysium Hon-Wong Python

[ECCV 2024] Elysium: Exploring Object-level Perception in Videos via MLLM

74
lhbench
lhbench lhbench Scala

Lakehouse storage system benchmark

74
TLCBench
TLCBench tlc-pack Python

Benchmark scripts for TVM

74
ipc_benchmark
ipc_benchmark detailyang Python

IPC benchmark on Linux

74
apebench
apebench tum-pbs Python

[Neurips 2024] A benchmark suite for autoregressive neural emulation of PDEs. (≥46 PDEs in 1D, 2D, 3D; Differentiable Physics; Unrolled Training; Roll...

73
rsb
rsb gamelife1314 Rust

a http server benchmark tool written in rust 🦀

73
evoeval
evoeval evo-eval Python

EvoEval: Evolving Coding Benchmarks via LLM

73
the-cpp-abstraction-penalty
the-cpp-abstraction-penalty germandiagogomez C++

Modern C++ benchmarking

73
ncnn-benchmark
ncnn-benchmark BUG1989 CMake

The benchmark of ncnn that is a high-performance neural network inference framework optimized for the mobile platform

72
CMI
CMI zju-vipa Python

[IJCAI-2021] Contrastive Model Inversion for Data-Free Knowledge Distillation

72
scalajs-benchmark
scalajs-benchmark japgolly Scala

Benchmarks: write in Scala or JS, run in your browser. Live demo:

72
go-interface-values
go-interface-values akutz Go

When storing a value in a Go interface allocates memory on the heap.

72
go-cache-benchmark
go-cache-benchmark vmihailenco Go

Cache benchmark for Golang

72
MedXpertQA
MedXpertQA TsinghuaC3I Python

[ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

72
One-shot-Human-Parsing
One-shot-Human-Parsing Charleshhy Python

[AAAI 2021] (oral) Progressive One-shot Human Parsing, [TPAMI 2023] End-to-end One-shot Human Parsing

72
web-components-benchmark
web-components-benchmark vogloblinsky JavaScript

Web Components benchmark for a various Web Components technologies

72
BenchmarkFcns
BenchmarkFcns mazhar-ansari-ardeh C++

A Python and MATLAB implementation of mathematical test functions for benchmarking optimization algorithms.

72