Most popular benchmark repositories and open source projects

benchmarking-fft project-gemmi C++

choosing FFT library...

150 11 150

segment houbb Java

The jieba-analysis tool for java.（基于结巴分词词库实现的更加灵活优雅易用，高性能的 java 分词实现。支持词性标注。）

150 28 150

NAS-Benchmark antoyang Python

[ICLR 2020] NAS evaluation is frustratingly hard

149 24 149

RHEL9-CIS ansible-lockdown YAML

Automated CIS Benchmark Compliance Remediation for RHEL 9 with Ansible

149 104 149

mqperf softwaremill Scala

148 38 148

TurtleBench mazzzystar Jupyter Notebook

TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.

148 9 148

c2clat rigtorp C++

A tool to measure CPU core to core latency

148 24 148

BALROG balrog-ai Python

Benchmarking Agentic LLM and VLM Reasoning On Games

147 28 147

serverless-faas-workbench ddps-lab Python

FunctionBench : A Suite of Workloads for Serverless Cloud Function Service

147 49 147

HPOBench automl Python

Collection of hyperparameter optimization benchmark problems

147 36 147

bucketbench estesp Go

Go-based framework for running benchmarks against Docker, containerd, runc, or any CRI-compliant runtime

146 38 146

compiler-benchmark nordlow Python

Benchmarks compilation speeds of different combinations of languages and compilers.

146 18 146

MMTrustEval thu-ml Python

A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)

146 10 146

math-parser-benchmark-project ArashPartow C++

C++ Mathematical Expression Parser Benchmark

145 29 145

bsuccinct-rs beling Rust

Rust libraries and programs focused on succinct data structures

144 10 144

ossf-cve-benchmark ossf-cve-benchmark TypeScript

The OpenSSF CVE Benchmark consists of code and metadata for over 200 real life CVEs, as well as tooling to analyze the vulnerable codebases using a va...

144 38 144

ecs andygeiss Go

Build your own Game-Engine based on the Entity Component System concept in Golang.

144 11 144

VPR-methods-evaluation gmberton Python

Easily download and evaluate pre-trained Visual Place Recognition methods. Code built for the ICCV 2023 paper "EigenPlaces: Training Viewpoint Robust...

144 16 144

php-orm-benchmark kenjis PHP

PHP ORM Benchmark

143 14 143

ClassEval FudanSELab Python

Benchmark ClassEval for class-level code generation.

143 15 143

jsbench-me psiho

jsbench.me - JavaScript performance benchmarking playground

143 2 143

video-quality-metrics CrypticSignal Python

Uses FFmpeg to benchmark video encoders to compare VMAF, SSIM and PSNR with different encoder settings.

143 21 143

MMToM-QA chuanyangjin Python

[🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind Question Answering

143 18 143

memory-maze jurgisp Python

Evaluating long-term memory of reinforcement learning algorithms

142 16 142

benchmarks lmdbjava Java

Benchmark of open source, embedded, memory-mapped, key-value stores available from Java (JMH)

141 22 141

iai-callgrind iai-callgrind Rust

High-precision and consistent benchmarking framework/harness for Rust

141 16 141

plf_nanotimer mattreecebentley C++

A simple C++ 03/11/etc timer class for ~microsecond-precision cross-platform benchmarking. The implementation is as limited and as simple as possible...

141 14 141

TCPDBench alan-turing-institute

The Turing Change Point Detection Benchmark: An Extensive Benchmark Evaluation of Change Point Detection Algorithms on real-world data

141 31 141

web-bench bytedance JavaScript

Web-Bench is a benchmark designed to evaluate the performance of LLMs in actual Web development.

139 11 139

service-mesh-benchmark kinvolk Shell

139 35 139

deepchange PengBoXiangShang

ICCV 2023, project page of the paper "DeepChange: A Long-term Person Re-identification Benchmark"

139 5 139

Windows-2019-CIS ansible-lockdown YAML

Automated CIS Benchmark Compliance Remediation for Windows Server 2019 with Ansible

139 76 139

goku jcaromiq Rust

Goku is an HTTP load testing application written in Rust

139 5 139

TeaStore DescartesResearch Java

A micro-service reference test application for model extraction, cloud management, energy efficiency, power prediction, single- and multi-tier auto-sc...

138 163 138

VPR-datasets-downloader gmberton Python

Automatic download VPR datasets in a standard format

138 18 138

wake-word-benchmark Picovoice Python

wake word engine benchmark framework

137 28 137

golang-benchmarks SimonWaldherr Go

Go(lang) benchmarks - (measure the speed of golang)

136 18 136

arewefastyet mozilla JavaScript

NOT MAINTAINED ANYMORE! New project is located on https://github.com/mozilla-frontend-infra/js-perf-dashboard -- AreWeFastYet is a set of tools used f...

135 50 135