Topic

benchmark

Repositories (1763)

rust-web-frameworks-benchmark
rust-web-frameworks-benchmark rousan Rust

A hello world benchmark for the available Rust Web Frameworks: hyper vs gotham vs actix-web vs warp vs rocket

112
SubpopBench
SubpopBench YyzHarry Python

[ICML 2023] Change is Hard: A Closer Look at Subpopulation Shift

112
Knowledge_distillation_via_TF2.0
Knowledge_distillation_via_TF2.0 sseung0703 Python

The codes for recent knowledge distillation algorithms and benchmark results via TF2.0 low-level API

112
flo
flo wkwan Rust

Fast Vulkan 3D Renderer Integrated with the Bevy Game Engine

111
sec-code-bench
sec-code-bench alibaba Python

SecCodeBench is a benchmark suite focusing on evaluating the security of code generated by large language models (LLMs).

111
ORBIT-Dataset
ORBIT-Dataset microsoft Python

The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mobile phone. Th...

111
benchyou
benchyou xelabs Go

benchyou is a benchmark tool for MySQL, real-time monitoring TPS and vmstat/iostat

111
text2image-benchmark
text2image-benchmark boomb0om Jupyter Notebook

Benchmark for generative image models

111
OmniBenchmark
OmniBenchmark ZhangYuanhan-AI Python

[ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.

110
kubernetes-iperf3
kubernetes-iperf3 Pharb Shell

Simple wrapper around iperf3 to measure network bandwidth from all nodes of a Kubernetes cluster

110
gpumembench
gpumembench ekondis C++

A GPU benchmark suite for assessing on-chip GPU memory bandwidth

110
ruby-bench
ruby-bench ruby Ruby

Set of benchmarks for the Ruby programming language

109
Mind2Web-2
Mind2Web-2 OSU-NLP-Group Python

[NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge

109
pytorch-benchmark
pytorch-benchmark LukasHedegaard Python

Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption

109
mini-nbody
mini-nbody harrism C

A simple gravitational N-body simulation in less than 100 lines of C code, with CUDA optimizations.

108
EvalNE
EvalNE Dru-Mara Python

Source code for EvalNE, a Python library for evaluating Network Embedding methods.

107
RMBench
RMBench RoboTwin-Platform Python

Memory-Dependent Manipulation Benchmark based on RoboTwin

107
kaggle-dogs-vs-cats-caffe
kaggle-dogs-vs-cats-caffe mrgloom Python

Kaggle dogs vs cats solution in Caffe

107
video_object_detection_paper
video_object_detection_paper junliang230

update some video object detection papers (视频目标检测论文和代码整理)

106
peaks-consolidation
peaks-consolidation hkpeaks Go

The Peaks Consolidation is equipped with state-of-the-art algorithms and data structures that support high-performance databending exercises. It speci...

106
VisualNews-Repository
VisualNews-Repository FuxiaoLiu Jupyter Notebook

[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning

106
dc-rl
dc-rl HewlettPackard HTML

SustainDC is a set of Python environments for Data Center simulation and control using Heterogeneous Multi Agent Reinforcement Learning. Includes cust...

105
benchmark-websocket
benchmark-websocket oatpp C++

Websocket Client and Server for benchmarks with Millions of concurrent connections.

104
solidity-benchmarks
solidity-benchmarks alephao Solidity

Benchmarks of popular contract implementations in solidity

104
deepmark
deepmark IngestAI PHP

Deepmark AI enables a unique testing environment for language models (LLM) assessment on task-specific metrics and on your own data so your GenAI-powe...

104
tastylib
tastylib chuyangliu C++

C++ implementations of data structures, algorithms, and system designs.

104
build-tools-performance
build-tools-performance rstackjs JavaScript

Benchmarks for bundlers and build tools, including Rspack, Rsbuild, webpack, Vite, Rolldown, esbuild, Parcel and Farm.

104
pplbench
pplbench facebookresearch Python

Evaluation Framework for Probabilistic Programming Languages

104
cec2017-py
cec2017-py tilleyd Python

Python module for CEC 2017 single objective optimization test function suite.

103
ansibench
ansibench nfinit C

A selection of ANSI C benchmarks and programs useful as benchmarks

103
vault-benchmark
vault-benchmark hashicorp Go

A tool for benchmarking usage of Vault.

103
holisticai
holisticai holistic-ai Jupyter Notebook

This is an open-source tool to assess and improve the trustworthiness of AI systems.

103
tsnkit
tsnkit ChuanyuXue Python

A scheduling and benchmark toolkit for Time-Sensitive Networking in Python

103
Awsome-Multi-modal-based-PHM
Awsome-Multi-modal-based-PHM CHAOZHAO-1

Awsome-Multi-modal-based PHM (基于多模态的故障诊断和预测)

103
XRAutomatedTests
XRAutomatedTests Unity-Technologies C#

XRAutomatedTests is where you can find functional, graphics, performance, and other types of automated tests for your XR Unity development.

103
annbench
annbench matsui528 Python

A lightweight benchmark for approximate nearest neighbor search

103
mqtt-mock
mqtt-mock daoshenzzg Go

mqtt压测工具。支持subscribe、publish压测方式,支持模拟客户端连接数。

102
ddio-bench
ddio-bench aliireza Makefile

Reexamining Direct Cache Access to Optimize I/O Intensive Applications for Multi-hundred-gigabit Networks

102
zk-Harness
zk-Harness zkCollective Python

Benchmarking framework for general purpose zero-knowledge proofs languages and libraries

102
PAD
PAD EricLee0224 Python

[NeurlPS 2023] A Dataset and Benchmark for Pose-agnostic Anomaly Detection.

102
dm_nevis
dm_nevis google-deepmind Python

NEVIS'22: Benchmarking the next generation of never-ending learners

102
PPM
PPM ZHKKKe

A High-Quality Photograpy Portrait Matting Benchmark

102
hash-smith
hash-smith bluuewhale Java

Fast & memory efficient hash tables for Java

102
playwright-test
playwright-test hugomrdias JavaScript

Run unit tests with several test runners or benchmark inside real browsers with playwright and other Javascript runtimes.

102
datacenter-speed-tests
datacenter-speed-tests jakejarvis Shell

⚡ Test speed and pings to all DigitalOcean, Linode, AWS, GCP, and Vultr regions

101
WebAssembly-benchmark
WebAssembly-benchmark takahirox HTML
99
cwe-bench-java
cwe-bench-java iris-sast Python

A manually vetted dataset for security vulnerability detection in Java projects

99
apebench
apebench tum-pbs Python

[Neurips 2024] A benchmark suite for autoregressive neural emulation of PDEs. (≥46 PDEs in 1D, 2D, 3D; Differentiable Physics; Unrolled Training; Roll...

99
mPLUG-HalOwl
mPLUG-HalOwl X-PLUG Python

mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating

99
mlip-arena
mlip-arena atomind-ai Jupyter Notebook

🌟 [NeurIPS '25 Spotlight] Fair and transparent benchmark of machine learning interatomic potentials (MLIPs), beyond basic error metrics https://openr...

98