PIMeval simulator and PIMbench suite
[ACL 2026 Findings] Official code repo for the paper "LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark"
MIPHEI-ViT: Repository to train Image-to-Image H&E to Immunofluorescence models
VPS Speedtest for WordPress with 160 results: 🏆 UpCloud (raw memory and CPU benchmark)
Benchmarks for the RediSearch module
Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour under certain...
Linking Haxe to the role of a web server
Benchmarking Litestar vs other ASGI API framework
An benchmark for evaluating the capabilities of large vision-language models (LVLMs)
A Scandinavian Benchmark for sentence embeddings
A comprehensive local Linux Privilege-Escalation Benchmark
PQC-LEO is a comprehensive benchmarking and evaluation framework for Post-Quantum Cryptography (PQC), built for researchers. Automates the setup, test...
GameVerse: Can Vision-Language Models Learn from Video-based Reflection?
Pipeline to evaluate and validate the accuracy of variant calling methods in genomic research
Cloud WorkBench (CWB) is a web-based framework that is grounded on the notion of Infrastructure-as-Code (IaC) to foster simple definition, execution,...
A JMeter plugin supports load test gRPC
Marketing Mix Modeling Data Generator
Multi-dataset stance detection and robustness experiments
CoreMark 1.0 ported to WebAssembly
A micro-benchmark framework to use with cargo bench
IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL
[NeurIPS25 D&B Spotlight] A tile-level histopathology image understanding benchmark
[Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions
ML4CO-Bench-101: Benchmark Machine Learning for Classic Combinatorial Problems on Graphs.
Benchmarks of Spring Boot REST service comparing Java 21 Virtual Threads (Project Loom) with WebFlux (Project Reactor).
MCPToolBench++ MCP Model Context Protocol Tool Use Benchmark on AI Agent and Model Tool Use Ability
Benchmark and self-optimize SDK/CLI/MCP guidance so every agent model can use your tool reliably.
ORM Benchmarking
Provides a set of nodes to enable an extendable design pattern for flows.
ROP Benchmark is a tool to compare ROP compilers
Fuzzle: Making a Puzzle for Fuzzers (ASE'22)
Track Go benchmark performance over time by storing results in InfluxDB
Crowd sourced benchmarking
Extended version of the Berkeley Segmentation Benchmark [1] used for evaluation in [2].
:hammer: Make the exact performance measurements of the public methods for public classes using this NuGet Package with fluent interface. Requires .Ne...
Performance Evaluation of SHA-256 using SHA New Instructions.
E-commerce search benchmark is the first end-to-end application benchmark for e-commerce search system with personalized recommendations.This work is...
Bash script for comparing NPM and Yarn performance
LLM 100k portfolio management benchmark
WFCommons: A Framework for Enabling Scientific Workflow Research and Development
基于Java Netty的HTTP客户端工具 & HTTP高性能测试工具。参数灵活定制、支持邮件报表等。Python Tornado版: https://github.com/junneyang/http-benchmark-torna...
A Benchmark of Real-world Image Dataset for Federated Learning
🗂 Graph Learning Indexer: a contributor-friendly and metadata-rich platform for graph learning benchmarks. Dataloading, Benchmarking, Tagging, and mor...
Image compression codecs benchmark inspired by Google's "Full Resolution Image Compression with Recurrent Neural Networks"
CentOS Bench for Security is a script that implements checks which follows the CIS CentOS Linux 7 Benchmark.
A R framework for pipeline benchmarking, with application to single-cell RNAseq
This repo contains a bunch of crude benchmark tests to test the performance of MySQL queries with UUIDs in various scenarios
:bar_chart: A visual chart for Java MicroBenchmark Harness.