choosing FFT library...
The jieba-analysis tool for java.(基于结巴分词词库实现的更加灵活优雅易用,高性能的 java 分词实现。支持词性标注。)
[ICLR 2020] NAS evaluation is frustratingly hard
Automated CIS Benchmark Compliance Remediation for RHEL 9 with Ansible
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.
A tool to measure CPU core to core latency
Benchmarking Agentic LLM and VLM Reasoning On Games
FunctionBench : A Suite of Workloads for Serverless Cloud Function Service
Collection of hyperparameter optimization benchmark problems
Go-based framework for running benchmarks against Docker, containerd, runc, or any CRI-compliant runtime
Benchmarks compilation speeds of different combinations of languages and compilers.
A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)
C++ Mathematical Expression Parser Benchmark
Rust libraries and programs focused on succinct data structures
The OpenSSF CVE Benchmark consists of code and metadata for over 200 real life CVEs, as well as tooling to analyze the vulnerable codebases using a va...
Build your own Game-Engine based on the Entity Component System concept in Golang.
Easily download and evaluate pre-trained Visual Place Recognition methods. Code built for the ICCV 2023 paper "EigenPlaces: Training Viewpoint Robust...
PHP ORM Benchmark
Benchmark ClassEval for class-level code generation.
jsbench.me - JavaScript performance benchmarking playground
Uses FFmpeg to benchmark video encoders to compare VMAF, SSIM and PSNR with different encoder settings.
[🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind Question Answering
Evaluating long-term memory of reinforcement learning algorithms
Benchmark of open source, embedded, memory-mapped, key-value stores available from Java (JMH)
High-precision and consistent benchmarking framework/harness for Rust
A simple C++ 03/11/etc timer class for ~microsecond-precision cross-platform benchmarking. The implementation is as limited and as simple as possible...
The Turing Change Point Detection Benchmark: An Extensive Benchmark Evaluation of Change Point Detection Algorithms on real-world data
Web-Bench is a benchmark designed to evaluate the performance of LLMs in actual Web development.
ICCV 2023, project page of the paper "DeepChange: A Long-term Person Re-identification Benchmark"
Automated CIS Benchmark Compliance Remediation for Windows Server 2019 with Ansible
Goku is an HTTP load testing application written in Rust
A micro-service reference test application for model extraction, cloud management, energy efficiency, power prediction, single- and multi-tier auto-sc...
Automatic download VPR datasets in a standard format
wake word engine benchmark framework
Go(lang) benchmarks - (measure the speed of golang)
NOT MAINTAINED ANYMORE! New project is located on https://github.com/mozilla-frontend-infra/js-perf-dashboard -- AreWeFastYet is a set of tools used f...
⚖️ ORM benchmarking for Node.js applications written in TypeScript
This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.org/abs/2404.12...
Evaluation of API and performance of different actor libraries
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.
Helper tool for manual Go code optimization.
benchmark of golang GraphQL framework.
Templated hierarchical spatial trees designed for high-peformance.
A powerful Node.js benchmark library
DocILE: Document Information Localization and Extraction Benchmark
You can find the most recent KGQA benchmark numbers from publications here.
A library to make benchmarks from PHP frameworks.