Most popular benchmark repositories and open source projects

cpp-serialization-benchmark

Comparison of C++ Serialization Libraries for Graph Data

3   35   35  

MaskedFaceRepresentation

Masked face recognition focuses on identifying people using their faci...

6   35   35  

ConBench

[NeurIPS'24] Official implementation of paper "Unveiling the Tapestry...

2   35   35  

sceneflow_from_blender

Get 3D motion vectors / scene flow directly from Blender

4   35   35  

GenoArmory

GenoArmory: A Unified Evaluation Framework for Adversarial Attacks on...

1   35   35  

EMB

EvoMaster Benchmark (EMB): a set of web/enterprise applications for ex...

20   35   35  

raytriangle-test

Ray-triangle intersection performance tests in various languages

5   35   35  

ViHOS

Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans D...

10   35   35  

hdrhistogram-swift

Swift port of HdrHistogram

8   35   35  

SparkDataset

Instant search for and access to many datasets in Pyspark.

8   34   34  

lua-vs-vimscript

A simple benchmark comparing Lua performance to Vimscript (because no...

1   34   34  

LREBench

[EMNLP 2022 Findings] Towards Realistic Low-resource Relation Extracti...

1   34   34  

MACSum

Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable...

3   34   34  

iohk-monitoring-framework

This framework provides logging, benchmarking and monitoring.

15   34   34  

wasm-render

Software 3D renderer & rasteriser written in WASM/C & TypeScript to te...

5   34   34  

goku

goku is a HTTP load testing application written in Rust

3   34   34  

benchbox

🧀 The Benchmark Testing Box

11   34   34  

MileBench

This repo contains evaluation code for the paper "MileBench: Benchmark...

2   34   34  

TypeEvalPy

A Micro-benchmarking Framework for Python Type Inference Tools

2   34   34  

benchmark-privesc-linux

A comprehensive local Linux Privilege-Escalation Benchmark

6   34   34  

Mess-benchmark

A Multiplatform benchmark designed to provide holistic, detailed and c...

6   34   34  

indivi_collection

A collection of std-like containers written in C++11. Features fast un...

2   34   34  

redis-benchmarks-specification

The Redis benchmarks specification describes the cross-language/tools...

13   34   34  

DafnyBench

DafnyBench: A Benchmark for Formal Software Verification

5   34   34  

tiny_qa_benchmark_pp

Tiny QA Benchmark++ a micro-benchmark suite (52-item gold + on-demand...

0   34   34  

MSAD

[VLDB 2023] Model Selection for Anomaly Detection in Time Series

6   34   34  

zapbench

The Zebrafish Activity Prediction Benchmark measures progress on the p...

6   34   34  

rvv-bench-results

A collection of RISC-V Vector (RVV) benchmarks to help developers writ...

8   33   33  

imread_benchmark

I/O benchmark for different image processing python libraries.

4   33   33  

BeHonest

BeHonest: Benchmarking Honesty in Large Language Models

0   33   33  

critdd

Critical difference diagrams with Python and Tikz

3   33   33  

WfCommons

WfCommons: A Framework for Enabling Scientific Workflow Research and D...

13   33   33  

cmdbench

Quick and easy resource usage monitoring and benchmarking for any comm...

6   33   33  

videocube-toolkit

The official python toolkit for running experiments and evaluate perfo...

6   33   33  

criterion-table

Generate markdown comparison tables from `cargo-criterion` JSON output

3   33   33  

CIS-Settings

CIS settings bootstrapper for Mac

5   33   33  

rex

Pleasures for Web in Golang

3   33   33  

saca-bench

Collection of Suffix Array Construction Algorithms (SACAs)

4   33   33  

cpp2lua-buindings-battle

Lua <-> C++ bindings libraries benchmark

1   32   32  

rapidash

🔥 Collection of useful javascript snippets with automated benchmarks

11   32   32  

swords

The Stanford Word Substitution (Swords) Benchmark

6   32   32  

gwvault

ansible-vault CLI reimplemented in go

10   32   32  

powerqe

An unified framework of quality enhancement approaches for compressed...

1   32   32  

divergent

LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique w...

1   32   32  

auto-pen-bench

This repo contains the codes of the penetration test benchmark for Gen...

5   32   32  

VCR

Official Repo for the paper: VCR: Visual Caption Restoration. Check ar...

2   32   32  

llm-compressive

Longitudinal Evaluation of LLMs via Data Compression

0   32   32  

wasm-score

A benchmark for standalone WebAssembly

5   32   32  

Cotempqa

Code and data for "Living in the Moment: Can Large Language Models Gra...

1   32   32  

ECS.CSharp.Benchmark-common-use-cases

C# ECS Benchmarks

4   32   32