Most popular benchmark repositories and open source projects

Celero

C++ Benchmark Authoring Library/Framework

98   847   847  

mimic3-benchmarks

Python suite to construct benchmark machine learning datasets from the...

338   837   837  

huststore

High-performance Distributed Storage

174   831   831  

blazehttp

BlazeHTTP 是一款简单易用的 WAF 防护效果测试工具。BlazeHTTP stands as a...

95   823   823  

human-learn

Natural Intelligence is still a pretty good idea.

55   814   814  

ecs

VPS融合怪服务器测评项目 GO版本 VPS Fusion Monster Server Test GO Versi...

53   793   793  

sbt-jmh

"Trust no one, bench everything." - sbt plugin for JMH (Java Microbenc...

87   790   790  

CBLUE

[CBLUE1] 中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Unde...

132   787   787  

meta-dataset

A dataset of datasets for learning to learn from few examples

142   787   787  

typescript-runtime-type-benchmarks

📊 Benchmark Comparison of Packages with Runtime Validation and TypeSc...

82   777   777  

opencv_zoo

Model Zoo For OpenCV DNN and Benchmarks.

240   777   777  

WeatherBench

A benchmark dataset for data-driven weather forecasting

172   762   762  

caffenet-benchmark

Evaluation of the CNN design choices performance on ImageNet-2012.

152   742   742  

bencher

🐰 Bencher - Continuous Benchmarking

33   739   739  

r3f-perf

Easily monitor your ThreeJS performances.

31   720   720  

py-frameworks-bench

Another benchmark for some python frameworks

87   720   720  

Programming-Language-Benchmarks

Yet another implementation of computer language benchmarks game

149   719   719  

robustbench

RobustBench: a standardized adversarial robustness benchmark [NeurIPS...

100   714   714  

microservices-framework-benchmark

Raw benchmarks on throughput, latency and transfer of Hello World on p...

126   710   710  

nvbench

CUDA Kernel Benchmarking Library

87   706   706  

tape

Tasks Assessing Protein Embeddings (TAPE), a set of five biologically...

132   697   697  

HammerDB

HammerDB Database Load Testing and Benchmarking Tool

139   689   689  

ffi-overhead

comparing the c ffi (foreign function interface) overhead on various p...

41   675   675  

caliper

A blockchain benchmark framework to measure performance of multiple b...

407   674   674  

PointTinyBenchmark

Point based and tiny object detection and localization code set of UC...

79   671   671  

http_bench

golang HTTP stress testing tool, support single and distributed, http/...

31   653   653  

openmixup

CAIRI Supervised, Semi- and Self-Supervised Visual Representation Lear...

59   651   651  

BenchmarkTools.jl

A benchmarking framework for the Julia language

101   643   643  

warp

S3 benchmarking tool

132   642   642  

datasets

A repository of pretty cool datasets that I collected for network scie...

84   633   633  

long-form-factuality

Benchmarking long-form factuality in large language models. Original c...

76   631   631  

CrossPlatformDiskTest

Windows, macOS and Android storage (HDD, SSD, RAM) speed testing/perfo...

41   626   626  

indonlu

The first-ever vast natural language processing benchmark for Indonesi...

204   612   612  

rspec-benchmark

Performance testing matchers for RSpec

21   609   609  

kotlinx-benchmark

Kotlin multiplatform benchmarking toolkit

43   605   605  

TextClassificationBenchmark

A Benchmark of Text Classification in PyTorch

137   602   602  

NIID-Bench

Federated Learning Benchmark - Federated Learning on Non-IID Data Silo...

122   598   598  

chillout

Reduce CPU usage by non-blocking async loop and psychologically speed...

19   596   596  

rl4co

A PyTorch library for all things Reinforcement Learning (RL) for Combi...

107   593   593  

TrustLLM

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

58   589   589  

LIBERO

Benchmarking Knowledge Transfer in Lifelong Robot Learning

101   582   582  

KLUE

📖 Korean NLU Benchmark

56   574   574  

DeeperForensics-1.0

[CVPR 2020] A Large-Scale Dataset for Real-World Face Forgery Detectio...

72   552   552  

completely-unscientific-benchmarks

Naive performance comparison of a few programming languages (JavaScrip...

69   550   550  

NBench

Performance benchmarking and testing framework for .NET applications :...

47   540   540  

rewrk

A more modern http framework benchmarker supporting HTTP/1 and HTTP/2...

41   539   539  

Awesome-LLM-Eval

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos,...

44   538   538  

SensatUrban

🔥Urban-scale point cloud dataset (CVPR 2021 & IJCV 2022)

57   535   535  

rpc-benchmark

java rpc benchmark, 灵感源自 https://www.techempower.com/benchmarks/

124   527   527  

CValues

面向中文大模型价值观的评估与对齐研究

20   519   519