Topic

benchmark

Repositories (1763)

scalajs-benchmark
scalajs-benchmark japgolly Scala

Benchmarks: write in Scala or JS, run in your browser. Live demo:

72
pdf-text-extraction-benchmark
pdf-text-extraction-benchmark ckorzen TeX

A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF documents, es...

71
icor-codon-optimization
icor-codon-optimization Lattice-Automation Python

RNN-based Codon Optimization Tool. Publication: https://doi.org/10.1186/s12859-023-05246-8

71
sharkbench
sharkbench sharkbench Rust

Benchmarking programming languages and web frameworks.

71
benchpark
benchpark llnl Python

An open collaborative repository for reproducible specifications of HPC benchmarks and cross site benchmarking environments

71
KoMT-Bench
KoMT-Bench LG-AI-EXAONE Python

Official repository for KoMT-Bench built by LG AI Research

71
logbench
logbench rs Go

Golang logging library benchmarks

71
database
database shabados TypeScript

A digital representation of Sikh Bani and other Panthic texts with a public logbook of sangat-sourced corrections.

70
smart-beta-portfolio-optimization
smart-beta-portfolio-optimization sanjeevai HTML

Built a smart beta portfolio and compared it to a benchmark index by calculating the tracking error. Built a portfolio using quadratic programming to...

70
zapbench
zapbench google-research Python

The Zebrafish Activity Prediction Benchmark measures progress on the problem of predicting cellular-resolution neural activity throughout an entire ve...

70
rust-storage-bench
rust-storage-bench marvin-j97 Rust

Benchmarking Rust key-value storage engines

70
BirdSet
BirdSet DBD-research-group Jupyter Notebook

A benchmark dataset collection for bird sound classification

70
generalization
generalization lechmazur

Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a small set of ex...

70
SRsurvey
SRsurvey saeed-anwar

A Deep Journey into Super-resolution: A Survey, ACM Computing Surveys

70
crypto-bench
crypto-bench briansmith Rust

Benchmarks for crypto libraries (in Rust, or with Rust bindings)

70
food-recognition-benchmark-starter-kit
food-recognition-benchmark-starter-kit AIcrowd Jupyter Notebook

This repository is the main Food Recognition Benchmark template and Starter kit. Clone the repository to compete now!

69
scrolls
scrolls tau-nlp Python

The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".

69
quic_vs_tcp
quic_vs_tcp Shenggan Python

A Survey and Benchmark of QUIC

69
physical-ai-bench
physical-ai-bench SHI-Labs Python

[CVPR 2026 Oral] PAI-Bench: A Comprehensive Benchmark for Physical AI

69
MAVBench
MAVBench harvard-edge Python

Simulator + benchmark suite for Micro Aerial Vehicle design.

69
js-diff-benchmark
js-diff-benchmark luwes JavaScript

Simple benchmark for testing your DOM diffing algorithm.

69
PythonProjectTemplate
PythonProjectTemplate franneck94 Python

Python project template with unit-tests, documentation, ci-testing and workflows.

69
ESBench
ESBench ESBenchmark TypeScript

Modern JavaScript benchmarking tool.

69
TFM
TFM Tylemagne C#

Tyler's Frame Machine is a simple, free, educational, and portable tool for testing, benchmarking, comparison, and demonstration. TFM supports OpenGL,...

69
benchdiff
benchdiff WillAbides Go
68
criterion-compare-action
criterion-compare-action boa-dev JavaScript

⚡️📊 Compare the performance of Rust project branches

68
CourtSI
CourtSI Visionary-Laboratory Python

Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports

68
All-Angles-Bench
All-Angles-Bench Chenyu-Wang567 Python

Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs

68
ToMBench
ToMBench zhchen18 Python

ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.

68
golang-docker-cache
golang-docker-cache montanaflynn Dockerfile

Improved docker Golang module dependency cache for faster builds.

68
hash-bench
hash-bench bp-alex Java

Java Hashing, CRC and Checksum Benchmark (JMH)

68
http-benchmarks
http-benchmarks orangy Kotlin

Benchmarks for common embedded Java and Kotlin web frameworks

68
php-version-benchmarks
php-version-benchmarks kocsismate Shell

Official PHP benchmark suite

67
DataGen
DataGen HowieHwong Python

[ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models

67
conflictbank
conflictbank zhaochen0110 Python

Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and Benchmarks)

67
ChemBench
ChemBench shenwanxiang HTML

MoleculeNet benchmark dataset & MolMapNet dataset

66
GenExam
GenExam OpenGVLab Python

GenExam: A Multidisciplinary Text-to-Image Exam

66
CALM
CALM The-FinAI Python

A LLM training and evaluation benchmark for credit scoring

66
planetarium
planetarium BatsResearch Python

Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL

66
Filipino-Text-Benchmarks
Filipino-Text-Benchmarks jcblaisecruz02 Python

Open-source benchmark datasets and pretrained transformer models in the Filipino language.

66
ben
ben drish Go

Your benchmark assistant, written in Go.

66
KITAB-Bench
KITAB-Bench mbzuai-oryx Python

[ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding

66
word-benchmarks
word-benchmarks vecto-ai

Benchmarks for intrinsic word embeddings evaluation.

66
untangle
untangle bmucsanyi Python

Large-scale uncertainty benchmark in deep learning.

65
syntherela
syntherela martinjurkovic Python

A package for benchmarking synthetic relational data generation methods

65
GeoBench
GeoBench aim-uofa Python

A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.

65
AR-Touch
AR-Touch erfansn Kotlin

🔮 Obtain the power of touchless interaction with display screens

65
NPB-CPP
NPB-CPP GMAP C++

The NAS Parallel Benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures

65
ReDe
ReDe TamarLabs C

A Redis dehydrator module

65
PedestrianActionBenchmark
PedestrianActionBenchmark ykotseruba Python

Code and models for the WACV 2021 paper "Benchmark for evaluating pedestrian action prediction"

65