Topic

benchmark

Repositories (1623)

web-to-desktop-framework-comparison
web-to-desktop-framework-comparison Elanis JavaScript

An objective comparison of multiple frameworks that allow us to "transform" our web apps to desktop applications.

1.7k
training
training mlcommons Python

Reference implementations of MLPerf™ training benchmarks

1.7k
tapnet
tapnet google-deepmind Jupyter Notebook

Tracking Any Point (TAP)

1.6k
evalplus
evalplus evalplus Python

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

1.6k
fastRAG
fastRAG IntelLabs Python

Efficient Retrieval Augmentation and Generation Framework

1.6k
nanobench
nanobench martinus C++

Simple, fast, accurate single-header microbenchmarking functionality for C++11/14/17/20

1.6k
LLM-eval-survey
LLM-eval-survey MLGroupJLU

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

1.5k
Awesome-LLM-Long-Context-Modeling
Awesome-LLM-Long-Context-Modeling Xnhyacinth

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1.5k
py-motmetrics
py-motmetrics cheind Python

:bar_chart: Benchmark multiple object trackers (MOT) in Python

1.5k
llm-colosseum
llm-colosseum OpenGenerativeAI Jupyter Notebook

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

1.4k
inference
inference mlcommons Python

Reference implementations of MLPerf™ inference benchmarks

1.4k
ut
ut boost-ext C++

C++20 μ(micro)/Unit Testing framework

1.4k
pytest-benchmark
pytest-benchmark ionelmc Python

pytest fixture for benchmarking code

1.4k
jsperf.com
jsperf.com jsperf JavaScript

jsperf.com v2. https://github.com/h5bp/lazyweb-requests/issues/174

1.3k
FastExpressionCompiler
FastExpressionCompiler dadhi C#

Fast Compiler for C# Expression Trees and the lightweight LightExpression alternative. Diagnostic and code generation tools for the expressions.

1.3k
SLM-Lab
SLM-Lab kengz Python

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

1.3k
Attabench
Attabench attaswift Swift

Microbenchmarking app for Swift with nice log-log plots

1.3k
divan
divan nvzqz Rust

Fast and simple benchmarking for Rust projects

1.2k
boomer
boomer myzhan Go

A better load generator for locust, written in golang.

1.2k
smac
smac oxwhirl Python

SMAC: The StarCraft Multi-Agent Challenge

1.2k
MedMNIST
MedMNIST MedMNIST Python

[pip install medmnist] 18x Standardized Datasets for 2D and 3D Biomedical Image Classification

1.2k
bench-scripts
bench-scripts haydenjames

A compilation of Linux server benchmarking scripts.

1.2k
appdocs
appdocs sjtuhjh

Application Performance Optimization Summary

1.2k
TBCF
TBCF Escheee Objective-C

Tracking Benchmark for Correlation Filters

1.1k
flow
flow flow-project Python

Computational framework for reinforcement learning in traffic control

1.1k
github-action-benchmark
github-action-benchmark benchmark-action TypeScript

GitHub Action for continuous benchmarking to keep performance

1.1k
Awesome-System2-Reasoning-LLM
Awesome-System2-Reasoning-LLM zzli2022 Python

Latest Advances on System-2 Reasoning

1.1k
php-framework-benchmark
php-framework-benchmark kenjis PHP

PHP Framework Benchmark

1k
VBench
VBench Vchitect Python

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

1k
primesieve
primesieve kimwalisch C++

🚀 Fast prime number generator

1k
memtier_benchmark
memtier_benchmark RedisLabs C++

NoSQL Redis and Memcache traffic generation and benchmarking tool.

1k
ClickBench
ClickBench ClickHouse HTML

ClickBench: a Benchmark For Analytical Databases

981
lzbench
lzbench inikep C

lzbench is an in-memory benchmark of open-source compressors

962
omnisafe
omnisafe PKU-Alignment Python

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

958
benchmark
benchmark pytorch Python

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

947
ADBench
ADBench Minqi824 Python

Official Implement of "ADBench: Anomaly Detection Benchmark", NeurIPS 2022.

940
Monocular-Depth-Estimation-Toolbox
Monocular-Depth-Estimation-Toolbox zhyever Python

Monocular Depth Estimation Toolbox based on MMSegmentation.

938
pyperformance
pyperformance python Python

Python Performance Benchmark Suite

937
agoo
agoo ohler55 C

A High Performance HTTP Server for Ruby

927
asv
asv airspeed-velocity Python

Airspeed Velocity: A simple Python benchmarking tool with web-based reporting

927
OpenSTL
OpenSTL chengtan9907 Python

OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning

925
RoboTwin
RoboTwin TianxingChen Python

[CVPR 25 Highlight & ECCV Workshop 24 Best Paper] RoboTwin Dual-arm Robot Manipulation Simulation Platform

924
grpc_bench
grpc_bench LesnyRumcajs Dockerfile

Various gRPC benchmarks

917
PDEBench
PDEBench pdebench Python

PDEBench: An Extensive Benchmark for Scientific Machine Learning

907
LongBench
LongBench THUDM Python

LongBench v2 and LongBench (ACL 2024)

892
moses
moses molecularsets Python

Molecular Sets (MOSES): A Benchmarking Platform for Molecular Generation Models

890
nench
nench n-st Shell

VPS benchmark script — based on the popular bench.sh, plus CPU and ioping tests, and dual-stack IPv4 and v6 speedtests by default

888
AoE
AoE didi C++

AoE (AI on Edge,终端智能,边缘计算) 是一个终端侧AI集成运行时环境 (IRE),帮助开发者提升效率。

887
IocPerformance
IocPerformance danielpalme C#

Performance comparison of .NET IoC containers

884
s3-benchmark
s3-benchmark dvassallo Go

Measure Amazon S3's performance from any location.

881