Most popular benchmark repositories and open source projects

TaskMeAnything

[NeurIPS 2024] A task generation and model evaluation system for mult...

3   71   71  

tsnkit

A scheduling and benchmark toolkit for Time-Sensitive Networking in Py...

21   70   70  

crypto-bench

Benchmarks for crypto libraries (in Rust, or with Rust bindings)

11   70   70  

rpc-bench

RPC Benchmark of gRPC, Aeron and KryoNet

12   70   70  

Turbo-Histogram

Fastest Histogram Construction

7   70   70  

http-benchmarks

Benchmarks for common embedded Java and Kotlin web frameworks

10   69   69  

SRsurvey

A Deep Journey into Super-resolution: A Survey, ACM Computing Surveys

7   69   69  

golang-docker-cache

Improved docker Golang module dependency cache for faster builds.

3   69   69  

dataracebench

Data race benchmark suite for evaluating OpenMP correctness tools aime...

29   69   69  

js-diff-benchmark

Simple benchmark for testing your DOM diffing algorithm.

7   69   69  

food-recognition-benchmark-starter-kit

This repository is the main Food Recognition Benchmark template and St...

42   69   69  

scrolls

The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Ove...

9   69   69  

godnsbench

Simple DNS bench util that supports encrypted protocols.

2   69   69  

S-Eval

S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety...

3   69   69  

raid

RAID is the largest and most challenging benchmark for AI-generated te...

22   69   69  

YAIB

🧪Yet Another ICU Benchmark: a holistic framework for the standardizat...

20   69   69  

pdf-extraction-agenda

Overview of pipelines related to PDF to Markdown document processing.

0   69   69  

browserating

Compare performance of macOS browsers based on Speedometer 3.1

1   69   69  

GMAI-MMBench

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards...

3   68   68  

tum-traffic-dataset-dev-kit

TUM Traffic Dataset Development Kit

7   68   68  

hash-bench

Java Hashing, CRC and Checksum Benchmark (JMH)

11   68   68  

database

A digital representation of Sikh Bani and other Panthic texts with a p...

30   68   68  

benchdiff

2   68   68  

python-pytest-harvest

Store data created during your `pytest` tests execution, and retrieve...

9   68   68  

physics-benchmarking-neurips2021

Repo for "Physion: Evaluating Physical Prediction from Vision in Human...

4   68   68  

icor-codon-optimization

RNN-based Codon Optimization Tool. Publication: https://doi.org/10.118...

13   68   68  

IMLCGui

Intel Memory Latency Checker GUI

6   68   68  

FinTSB

FinTSB: A Comprehensive and Practical Benchmark for Financial Time Ser...

10   67   67  

Awsome-Multi-modal-based-PHM

Awsome-Multi-modal-based PHM (基于多模态的故障诊断和预测)

2   67   67  

foundational_fsod

This repository contains the implementation for the paper "Revisiting...

5   67   67  

llm-benchmark

A list of LLM benchmark frameworks.

7   67   67  

TFM

Tyler's Frame Machine is a simple, free, educational, and portable too...

4   67   67  

pdf-text-extraction-benchmark

A project about benchmarking and evaluating existing PDF extraction to...

11   67   67  

smart-beta-portfolio-optimization

Built a smart beta portfolio and compared it to a benchmark index by c...

27   67   67  

ben

Your benchmark assistant, written in Go.

1   66   66  

Okutama-Action

Okutama-Action: An Aerial View Video Dataset for Concurrent Human Acti...

7   66   66  

criterion-compare-action

⚡️📊 Compare the performance of Rust project branches

26   66   66  

GraphOmni

Enable Comprehensive LLM Evaluation on Graph Reasoning

2   66   66  

MEGA-Bench

This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluat...

5   66   66  

Impossible-Videos

ICML 2025 - Impossible Videos

6   66   66  

MultiMedEval

A Python tool to evaluate the performance of VLM on the medical domain...

3   66   66  

php-version-benchmarks

Official PHP benchmark suite

5   65   65  

MEDFAIR

[ICLR 2023 spotlight] MEDFAIR: Benchmarking Fairness for Medical Imagi...

13   65   65  

Revisiting-PLMs

Exploring Evolution-aware & free protein language models as protein fu...

10   64   64  

PythonProjectTemplate

Python project template with unit-tests, documentation, ci-testing and...

55   64   64  

ReDe

A Redis dehydrator module

11   64   64  

php-simple-benchmark-script

Очень простой скрипт тестирования быстродействия PHP | Very simple scr...

27   64   64  

MultiCorrupt

[IV2024] MultiCorrupt: A benchmark for robust multi-modal 3D object de...

6   64   64  

ESBench

Modern JavaScript benchmarking tool.

1   64   64  

CMExam

A Chinese National Medical Licensing Examination dataset and large lan...

8   64   64