Most popular benchmark repositories and open source projects

scalajs-benchmark japgolly Scala

Benchmarks: write in Scala or JS, run in your browser. Live demo:

72 8 72

pdf-text-extraction-benchmark ckorzen TeX

A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF documents, es...

71 11 71

icor-codon-optimization Lattice-Automation Python

RNN-based Codon Optimization Tool. Publication: https://doi.org/10.1186/s12859-023-05246-8

71 15 71

sharkbench sharkbench Rust

Benchmarking programming languages and web frameworks.

71 22 71

benchpark llnl Python

An open collaborative repository for reproducible specifications of HPC benchmarks and cross site benchmarking environments

71 45 71

KoMT-Bench LG-AI-EXAONE Python

Official repository for KoMT-Bench built by LG AI Research

71 3 71

logbench rs Go

Golang logging library benchmarks

71 15 71

database shabados TypeScript

A digital representation of Sikh Bani and other Panthic texts with a public logbook of sangat-sourced corrections.

70 32 70

smart-beta-portfolio-optimization sanjeevai HTML

Built a smart beta portfolio and compared it to a benchmark index by calculating the tracking error. Built a portfolio using quadratic programming to...

70 28 70

zapbench google-research Python

The Zebrafish Activity Prediction Benchmark measures progress on the problem of predicting cellular-resolution neural activity throughout an entire ve...

70 13 70

rust-storage-bench marvin-j97 Rust

Benchmarking Rust key-value storage engines

70 12 70

BirdSet DBD-research-group Jupyter Notebook

A benchmark dataset collection for bird sound classification

70 22 70

generalization lechmazur

Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a small set of ex...

70 2 70

SRsurvey saeed-anwar

A Deep Journey into Super-resolution: A Survey, ACM Computing Surveys

70 7 70

crypto-bench briansmith Rust

Benchmarks for crypto libraries (in Rust, or with Rust bindings)

70 11 70

food-recognition-benchmark-starter-kit AIcrowd Jupyter Notebook

This repository is the main Food Recognition Benchmark template and Starter kit. Clone the repository to compete now!

69 42 69

scrolls tau-nlp Python

The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".

69 9 69

quic_vs_tcp Shenggan Python

A Survey and Benchmark of QUIC

69 9 69

physical-ai-bench SHI-Labs Python

[CVPR 2026 Oral] PAI-Bench: A Comprehensive Benchmark for Physical AI

69 4 69

MAVBench harvard-edge Python

Simulator + benchmark suite for Micro Aerial Vehicle design.

69 31 69

js-diff-benchmark luwes JavaScript

Simple benchmark for testing your DOM diffing algorithm.

69 7 69

PythonProjectTemplate franneck94 Python

Python project template with unit-tests, documentation, ci-testing and workflows.

69 57 69

ESBench ESBenchmark TypeScript

Modern JavaScript benchmarking tool.

69 2 69

TFM Tylemagne C#

Tyler's Frame Machine is a simple, free, educational, and portable tool for testing, benchmarking, comparison, and demonstration. TFM supports OpenGL,...

69 4 69

benchdiff WillAbides Go

68 2 68

criterion-compare-action boa-dev JavaScript

⚡️📊 Compare the performance of Rust project branches

68 30 68

CourtSI Visionary-Laboratory Python

Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports

68 0 68

All-Angles-Bench Chenyu-Wang567 Python

Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs

68 3 68

ToMBench zhchen18 Python

ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.

68 7 68

golang-docker-cache montanaflynn Dockerfile

Improved docker Golang module dependency cache for faster builds.

68 3 68

hash-bench bp-alex Java

Java Hashing, CRC and Checksum Benchmark (JMH)

68 11 68

http-benchmarks orangy Kotlin

Benchmarks for common embedded Java and Kotlin web frameworks

68 10 68

php-version-benchmarks kocsismate Shell

Official PHP benchmark suite

67 5 67

DataGen HowieHwong Python

[ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models

67 4 67

conflictbank zhaochen0110 Python

Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and Benchmarks)

67 2 67

ChemBench shenwanxiang HTML

MoleculeNet benchmark dataset & MolMapNet dataset

66 18 66

GenExam OpenGVLab Python

GenExam: A Multidisciplinary Text-to-Image Exam

66 4 66

CALM The-FinAI Python

A LLM training and evaluation benchmark for credit scoring

66 12 66

planetarium BatsResearch Python

Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL

66 6 66

Filipino-Text-Benchmarks jcblaisecruz02 Python

Open-source benchmark datasets and pretrained transformer models in the Filipino language.

66 9 66

ben drish Go

Your benchmark assistant, written in Go.

66 1 66

KITAB-Bench mbzuai-oryx Python

[ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding

66 4 66

word-benchmarks vecto-ai

Benchmarks for intrinsic word embeddings evaluation.

66 24 66

untangle bmucsanyi Python

Large-scale uncertainty benchmark in deep learning.

65 7 65

syntherela martinjurkovic Python

A package for benchmarking synthetic relational data generation methods

65 1 65

GeoBench aim-uofa Python

A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.

65 2 65

AR-Touch erfansn Kotlin

🔮 Obtain the power of touchless interaction with display screens

65 7 65

NPB-CPP GMAP C++

The NAS Parallel Benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures

65 26 65

ReDe TamarLabs C

A Redis dehydrator module

65 12 65

PedestrianActionBenchmark ykotseruba Python

Code and models for the WACV 2021 paper "Benchmark for evaluating pedestrian action prediction"

65 19 65

benchmark

Repositories (1763)