A hello world benchmark for the available Rust Web Frameworks: hyper vs gotham vs actix-web vs warp vs rocket
[ICML 2023] Change is Hard: A Closer Look at Subpopulation Shift
The codes for recent knowledge distillation algorithms and benchmark results via TF2.0 low-level API
Fast Vulkan 3D Renderer Integrated with the Bevy Game Engine
SecCodeBench is a benchmark suite focusing on evaluating the security of code generated by large language models (LLMs).
The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mobile phone. Th...
benchyou is a benchmark tool for MySQL, real-time monitoring TPS and vmstat/iostat
Benchmark for generative image models
[ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.
Simple wrapper around iperf3 to measure network bandwidth from all nodes of a Kubernetes cluster
A GPU benchmark suite for assessing on-chip GPU memory bandwidth
Set of benchmarks for the Ruby programming language
[NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge
Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption
A simple gravitational N-body simulation in less than 100 lines of C code, with CUDA optimizations.
Source code for EvalNE, a Python library for evaluating Network Embedding methods.
Memory-Dependent Manipulation Benchmark based on RoboTwin
Kaggle dogs vs cats solution in Caffe
update some video object detection papers (视频目标检测论文和代码整理)
The Peaks Consolidation is equipped with state-of-the-art algorithms and data structures that support high-performance databending exercises. It speci...
[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning
SustainDC is a set of Python environments for Data Center simulation and control using Heterogeneous Multi Agent Reinforcement Learning. Includes cust...
Websocket Client and Server for benchmarks with Millions of concurrent connections.
Benchmarks of popular contract implementations in solidity
Deepmark AI enables a unique testing environment for language models (LLM) assessment on task-specific metrics and on your own data so your GenAI-powe...
C++ implementations of data structures, algorithms, and system designs.
Benchmarks for bundlers and build tools, including Rspack, Rsbuild, webpack, Vite, Rolldown, esbuild, Parcel and Farm.
Evaluation Framework for Probabilistic Programming Languages
Python module for CEC 2017 single objective optimization test function suite.
A selection of ANSI C benchmarks and programs useful as benchmarks
A tool for benchmarking usage of Vault.
This is an open-source tool to assess and improve the trustworthiness of AI systems.
A scheduling and benchmark toolkit for Time-Sensitive Networking in Python
Awsome-Multi-modal-based PHM (基于多模态的故障诊断和预测)
XRAutomatedTests is where you can find functional, graphics, performance, and other types of automated tests for your XR Unity development.
A lightweight benchmark for approximate nearest neighbor search
mqtt压测工具。支持subscribe、publish压测方式,支持模拟客户端连接数。
Reexamining Direct Cache Access to Optimize I/O Intensive Applications for Multi-hundred-gigabit Networks
Benchmarking framework for general purpose zero-knowledge proofs languages and libraries
[NeurlPS 2023] A Dataset and Benchmark for Pose-agnostic Anomaly Detection.
NEVIS'22: Benchmarking the next generation of never-ending learners
A High-Quality Photograpy Portrait Matting Benchmark
Fast & memory efficient hash tables for Java
Run unit tests with several test runners or benchmark inside real browsers with playwright and other Javascript runtimes.
⚡ Test speed and pings to all DigitalOcean, Linode, AWS, GCP, and Vultr regions
A manually vetted dataset for security vulnerability detection in Java projects
[Neurips 2024] A benchmark suite for autoregressive neural emulation of PDEs. (≥46 PDEs in 1D, 2D, 3D; Differentiable Physics; Unrolled Training; Roll...
mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating
🌟 [NeurIPS '25 Spotlight] Fair and transparent benchmark of machine learning interatomic potentials (MLIPs), beyond basic error metrics https://openr...