An objective comparison of multiple frameworks that allow us to "transform" our web apps to desktop applications.
Reference implementations of MLPerf™ training benchmarks
Tracking Any Point (TAP)
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024
Efficient Retrieval Augmentation and Generation Framework
Simple, fast, accurate single-header microbenchmarking functionality for C++11/14/17/20
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
:bar_chart: Benchmark multiple object trackers (MOT) in Python
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
Reference implementations of MLPerf™ inference benchmarks
C++20 μ(micro)/Unit Testing framework
pytest fixture for benchmarking code
jsperf.com v2. https://github.com/h5bp/lazyweb-requests/issues/174
Fast Compiler for C# Expression Trees and the lightweight LightExpression alternative. Diagnostic and code generation tools for the expressions.
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Microbenchmarking app for Swift with nice log-log plots
Fast and simple benchmarking for Rust projects
A better load generator for locust, written in golang.
SMAC: The StarCraft Multi-Agent Challenge
[pip install medmnist] 18x Standardized Datasets for 2D and 3D Biomedical Image Classification
A compilation of Linux server benchmarking scripts.
Application Performance Optimization Summary
Tracking Benchmark for Correlation Filters
Computational framework for reinforcement learning in traffic control
GitHub Action for continuous benchmarking to keep performance
Latest Advances on System-2 Reasoning
PHP Framework Benchmark
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
🚀 Fast prime number generator
NoSQL Redis and Memcache traffic generation and benchmarking tool.
ClickBench: a Benchmark For Analytical Databases
lzbench is an in-memory benchmark of open-source compressors
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
Official Implement of "ADBench: Anomaly Detection Benchmark", NeurIPS 2022.
Monocular Depth Estimation Toolbox based on MMSegmentation.
Python Performance Benchmark Suite
A High Performance HTTP Server for Ruby
Airspeed Velocity: A simple Python benchmarking tool with web-based reporting
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning
[CVPR 25 Highlight & ECCV Workshop 24 Best Paper] RoboTwin Dual-arm Robot Manipulation Simulation Platform
Various gRPC benchmarks
PDEBench: An Extensive Benchmark for Scientific Machine Learning
LongBench v2 and LongBench (ACL 2024)
Molecular Sets (MOSES): A Benchmarking Platform for Molecular Generation Models
VPS benchmark script — based on the popular bench.sh, plus CPU and ioping tests, and dual-stack IPv4 and v6 speedtests by default
AoE (AI on Edge,终端智能,边缘计算) 是一个终端侧AI集成运行时环境 (IRE),帮助开发者提升效率。
Performance comparison of .NET IoC containers
Measure Amazon S3's performance from any location.