Benchmarks: write in Scala or JS, run in your browser. Live demo:
A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF documents, es...
RNN-based Codon Optimization Tool. Publication: https://doi.org/10.1186/s12859-023-05246-8
Benchmarking programming languages and web frameworks.
An open collaborative repository for reproducible specifications of HPC benchmarks and cross site benchmarking environments
Official repository for KoMT-Bench built by LG AI Research
Golang logging library benchmarks
A digital representation of Sikh Bani and other Panthic texts with a public logbook of sangat-sourced corrections.
Built a smart beta portfolio and compared it to a benchmark index by calculating the tracking error. Built a portfolio using quadratic programming to...
The Zebrafish Activity Prediction Benchmark measures progress on the problem of predicting cellular-resolution neural activity throughout an entire ve...
Benchmarking Rust key-value storage engines
A benchmark dataset collection for bird sound classification
Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a small set of ex...
A Deep Journey into Super-resolution: A Survey, ACM Computing Surveys
Benchmarks for crypto libraries (in Rust, or with Rust bindings)
This repository is the main Food Recognition Benchmark template and Starter kit. Clone the repository to compete now!
The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".
A Survey and Benchmark of QUIC
[CVPR 2026 Oral] PAI-Bench: A Comprehensive Benchmark for Physical AI
Simulator + benchmark suite for Micro Aerial Vehicle design.
Simple benchmark for testing your DOM diffing algorithm.
Python project template with unit-tests, documentation, ci-testing and workflows.
Modern JavaScript benchmarking tool.
Tyler's Frame Machine is a simple, free, educational, and portable tool for testing, benchmarking, comparison, and demonstration. TFM supports OpenGL,...
⚡️📊 Compare the performance of Rust project branches
Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs
ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.
Improved docker Golang module dependency cache for faster builds.
Java Hashing, CRC and Checksum Benchmark (JMH)
Benchmarks for common embedded Java and Kotlin web frameworks
Official PHP benchmark suite
[ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models
Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and Benchmarks)
MoleculeNet benchmark dataset & MolMapNet dataset
GenExam: A Multidisciplinary Text-to-Image Exam
A LLM training and evaluation benchmark for credit scoring
Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL
Open-source benchmark datasets and pretrained transformer models in the Filipino language.
Your benchmark assistant, written in Go.
[ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding
Benchmarks for intrinsic word embeddings evaluation.
Large-scale uncertainty benchmark in deep learning.
A package for benchmarking synthetic relational data generation methods
A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.
🔮 Obtain the power of touchless interaction with display screens
The NAS Parallel Benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures
A Redis dehydrator module
Code and models for the WACV 2021 paper "Benchmark for evaluating pedestrian action prediction"