Supplementary material for our paper "THERE IS NO DATA LIKE MORE DATA" is provided.
GAP Benchmark Suite
Inspect and Debug your Tauri applications in style 💃
Framework to evaluate peformance of ROS 2
Compression Benchmark
Gym Electric Motor (GEM): An OpenAI Gym Environment for Electric Motors
Benchmark your model on out-of-distribution datasets with carefully collected human comparison data (NeurIPS 2021 Oral)
RoNIN: Robust Neural Inertial Navigation in the Wild
AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility...
Benchmarking Legal Knowledge of Large Language Models
:twisted_rightwards_arrows: Convert small PNG images to SVG Tiny 1.2
STREAM, for lots of devices written in many programming models
Benchmarks for the Optimal Power Flow Problem
🛍 A real-world e-commerce dataset for session-based recommender systems research.
TPC-DS benchmark kit with some modifications/fixes
Framework of performance testing
Advanced benchmarks for +15 Go ORMs.
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
💥Performance testing tool (Go), It is also a GUI gRPC client.
Golang benchmarks used for optimizing code
Scientific machine learning (SciML) benchmarks, AI for science, and (differential) equation solvers. Covers Julia, Python (PyTorch, Jax), MATLAB, R
Awesome Deep Learning for Time-Series Imputation, including an unmissable paper and tool list about applying neural networks to impute incomplete time...
Your defacto guide on monorepos, and in depth feature comparisons of tooling solutions.
Join Order Benchmark (JOB)
A lightweight, scalable, and general framework for visual question answering research
It's just a simple regex benchmark of different programming languages.
Framework for benchmarking vector search engines
Sysbench scripts to generate a tpcc-like workload for MySQL and PostgreSQL
Visually explore your JMH Benchmarks
Official repository for the ProteinGym benchmarks
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
Lanczos Network, Graph Neural Networks, Deep Graph Convolutional Networks, Deep Learning on Graph Structured Data, QM8 Quantum Chemistry Benchmark, IC...
A validation and profiling tool for AI infrastructure
A generic latency benchmarking library.
A node.js tool to benchmark APIs
A ground-truth fuzzing benchmark suite based on real programs with real bugs.
Comprehensive benchmarks of C++ maps
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
Turbo Base64 - Fastest Base64 SIMD:SSE/AVX2/AVX512/Neon/Altivec - Faster than memcpy!
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardw...
Open-Source Framework for Development, Simulation and Benchmarking of Behavior Planning Algorithms for Autonomous Driving
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
BLUE benchmark consists of five different biomedicine text-mining tasks with ten corpora.
A simple command line tool to interact with hundreds of servers around the world.
Automated CIS Benchmark Compliance Remediation for RHEL 8 with Ansible
Distributed database benchmark tester
Official repository of the paper "HiFaceGAN: Face Renovation via Collaborative Suppression and Replenishment".
A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private conversations,...
Official code repository of < CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph >