VPS Fusion Monster Server Test GO Version Aiming to be the most comprehensive server testing project, implemented in Go with zero environment dependen...
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
A machine learning toolkit for log parsing [ICSE'19, DSN'16]
An objective comparison of multiple frameworks that allow us to "transform" our web apps to desktop applications.
benchmarks for implementation of servers which support 1 million connections
Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, netwo...
Tracking Any Point (TAP)
Efficient Retrieval Augmentation and Generation Framework
Benchmarking Knowledge Transfer in Lifelong Robot Learning
Reference implementations of MLPerf® training benchmarks
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024
Simple, fast, accurate single-header microbenchmarking functionality for C++11/14/17/20
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
Reference implementations of MLPerf® inference benchmarks
:bar_chart: Benchmark multiple object trackers (MOT) in Python
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
Rust Performance Profiler & Channels Monitoring Toolkit (TUI, MCP)
BEHAVIOR-1K: a platform for accelerating Embodied AI research. Join our Discord for support: https://discord.gg/bccR5vGFEx
pytest fixture for benchmarking code
C++20 μ(micro)/Unit Testing framework
Fast and simple benchmarking for Rust projects
Fast Compiler for C# Expression Trees and the lightweight LightExpression alternative. Diagnostic and code generation tools for the expressions.
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
SMAC: The StarCraft Multi-Agent Challenge
Latest Advances on System-2 Reasoning
[pip install medmnist] 18x Standardized Datasets for 2D and 3D Biomedical Image Classification
jsperf.com v2. https://github.com/h5bp/lazyweb-requests/issues/174
Microbenchmarking app for Swift with nice log-log plots
A better load generator for locust, written in golang.
GitHub Action for continuous benchmarking to keep performance
A compilation of Linux server benchmarking scripts.
Computational framework for reinforcement learning in traffic control
Application Performance Optimization Summary
LongBench v2 and LongBench (ACL 25'&24')
Tracking Benchmark for Correlation Filters
PDEBench: An Extensive Benchmark for Scientific Machine Learning
[NeurIPS '25] Knowledge Graph Generation from Any Text
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning
🚀 Fast prime number generator
Benchmark for vector databases.
τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains
lzbench is an in-memory benchmark of open-source compressors
NoSQL Redis and Memcache traffic generation and benchmarking tool.
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
PHP Framework Benchmark
Python Performance Benchmark Suite
Official Implement of "ADBench: Anomaly Detection Benchmark", NeurIPS 2022.
Airspeed Velocity: A simple Python benchmarking tool with web-based reporting