The fastest path to AI-powered full stack observability, even for lean teams.
A command-line benchmarking tool
A MNIST-like fashion product database. Benchmark :point_down:
Powerful .NET library for benchmarking
:metal: awesome-semantic-segmentation
Ohayou(おはよう), HTTP load generator, inspired by rakyll/hey with tui animation.
A microbenchmark support library
Apache JMeter open-source load testing tool for analyzing and measuring the performance of a variety of services
Source for the TechEmpower Framework Benchmarks project
OpenMMLab Pose Estimation Toolbox and Benchmark.
Which is the fastest web framework?
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100...
Scriptable database and system performance benchmark
VPS 融合怪服务器测评项目 更推荐使用无环境依赖的Go版本 VPS Fusion Monster Server Test Script – More recommended to use the Go version with no environme...
YABS - a simple bash script to estimate Linux server performance using fio, iperf3, & Geekbench
An elegant PyTorch deep reinforcement learning library.
Benchmarks of approximate nearest neighbor libraries in Python
dperf: High-Performance Network Load Testing Tool Based on DPDK
Statistics-driven benchmarking library for Rust
Across the Great Wall we can reach every corner in the world
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
Kodezi Chronos is a debugging-first language model that achieves state-of-the-art results on SWE-bench Lite (80.33%) and 67% real-world fix accuracy,...
SWE-bench: Can Language Models Resolve Real-world Github Issues?
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
A tiny boost library in C++11.
Python package for the evaluation of odometry and SLAM
A series of large language models developed by Baichuan Intelligent Technology
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Visual Tracking Paper List
HTTP(S) benchmark tools, testing/debugging, & restAPI (RESTful)
XcodeBenchmark measures the compilation time of a large codebase on iMac, MacBook, and Mac Pro
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
Up to 100x faster strings for C, C++, CUDA, Python, Rust, Swift, JS, & Go, leveraging NEON, AVX2, AVX-512, SVE, GPGPU, & SWAR to accelerate search, ha...
MTEB: Massive Text Embedding Benchmark
The Phoronix Test Suite open-source, cross-platform automated testing/benchmarking software.
Prime number projects in 100+ programming languages, to compare their speed - and their programmer's cleverness
A 13B large language model developed by Baichuan Intelligent Technology
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
A unified evaluation framework for large language models
Java web common vulnerabilities and security code which is base on springboot and spring security
Tsung is a high-performance benchmark framework for various protocols including HTTP, XMPP, LDAP, etc.
CPU-X is a Free software that gathers information on CPU, motherboard and more
benchmark tooling that loves you ❤️
🔎 A simple, tiny and lightweight benchmarking library!
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
RoboTwin 2.0 Offical Repo
Cista is a simple, high-performance, zero-copy C++ serialization & reflection library.
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
:zap: Go web framework benchmark