Command-line DNS benchmark
MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents
Logs performance benchmark repo: Comparing Elastic, Loki and SigNoz
List of Ruby Tools for doing Performance.
The benchmark to compare performance of PHP ORM solutions.
LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。
Featherlight benchmark framework, drop-in replacement for criterion and gauge.
WritingBench: A Comprehensive Benchmark for Generative Writing
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
基于Python Tornado的高性能http性能测试工具。Java Netty版: https://github.com/junneyang/http-benchmark-netty 。
Benchmark the init cost of Go packages
Record "perf" performance metrics for individual functions/regions of an ELF binary.
Learned Sort: a model-enhanced sorting algorithm
[IJRR2024] The official repository for the WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Natural Environments
Program to benchmark various speech recognition APIs
The first-ever vast natural language generation benchmark for Indonesian, Sundanese, and Javanese. We provide multiple downstream tasks, pre-trained I...
Quickly generate, start and analyze benchmarks for molecular dynamics simulations.
A benchmark framework based on Golang
🚀 A comprehensive performance comparison benchmark between different .NET collections.
Benchmark of the most commonly used http routers
Framework for benchmarking fully-managed vector databases
[NeurIPS 2023] A faithful benchmark for vision-language compositionality
:boom: Performance-focused HTTP load testing tool written in Go
run highly configurable benchmarks for EVM-based execution nodes over JSON-RPC
Unified Multi-modal IAA Baseline and Benchmark
[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning
Code Efficiency Benchmark
Benchmark for some popular PHP Dependency Injection Containers.
[ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?
Automated Benchmarking System for Vitess
Write benchmarks without the hassle.
A benchmark suite and tool to compare different implementations of the same primitives.
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
IPC benchmark on Linux
Benchmark scripts for TVM
[ECCV 2024] Elysium: Exploring Object-level Perception in Videos via MLLM
Lakehouse storage system benchmark
a http server benchmark tool written in rust 🦀
Modern C++ benchmarking
EvoEval: Evolving Coding Benchmarks via LLM
[Neurips 2024] A benchmark suite for autoregressive neural emulation of PDEs. (≥46 PDEs in 1D, 2D, 3D; Differentiable Physics; Unrolled Training; Roll...
[IJCAI-2021] Contrastive Model Inversion for Data-Free Knowledge Distillation
Benchmarks: write in Scala or JS, run in your browser. Live demo:
A Python and MATLAB implementation of mathematical test functions for benchmarking optimization algorithms.
The benchmark of ncnn that is a high-performance neural network inference framework optimized for the mobile platform
[ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding
When storing a value in a Go interface allocates memory on the heap.
[AAAI 2021] (oral) Progressive One-shot Human Parsing, [TPAMI 2023] End-to-end One-shot Human Parsing
Cache benchmark for Golang
Web Components benchmark for a various Web Components technologies