Topic

benchmark

Repositories (1623)

benchmarking-fft
benchmarking-fft project-gemmi C++

choosing FFT library...

150
segment
segment houbb Java

The jieba-analysis tool for java.(基于结巴分词词库实现的更加灵活优雅易用,高性能的 java 分词实现。支持词性标注。)

150
NAS-Benchmark
NAS-Benchmark antoyang Python

[ICLR 2020] NAS evaluation is frustratingly hard

149
RHEL9-CIS
RHEL9-CIS ansible-lockdown YAML

Automated CIS Benchmark Compliance Remediation for RHEL 9 with Ansible

149
mqperf
mqperf softwaremill Scala
148
TurtleBench
TurtleBench mazzzystar Jupyter Notebook

TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.

148
c2clat
c2clat rigtorp C++

A tool to measure CPU core to core latency

148
BALROG
BALROG balrog-ai Python

Benchmarking Agentic LLM and VLM Reasoning On Games

147
serverless-faas-workbench
serverless-faas-workbench ddps-lab Python

FunctionBench : A Suite of Workloads for Serverless Cloud Function Service

147
HPOBench
HPOBench automl Python

Collection of hyperparameter optimization benchmark problems

147
bucketbench
bucketbench estesp Go

Go-based framework for running benchmarks against Docker, containerd, runc, or any CRI-compliant runtime

146
compiler-benchmark
compiler-benchmark nordlow Python

Benchmarks compilation speeds of different combinations of languages and compilers.

146
MMTrustEval
MMTrustEval thu-ml Python

A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)

146
math-parser-benchmark-project
math-parser-benchmark-project ArashPartow C++

C++ Mathematical Expression Parser Benchmark

145
bsuccinct-rs
bsuccinct-rs beling Rust

Rust libraries and programs focused on succinct data structures

144
ossf-cve-benchmark
ossf-cve-benchmark ossf-cve-benchmark TypeScript

The OpenSSF CVE Benchmark consists of code and metadata for over 200 real life CVEs, as well as tooling to analyze the vulnerable codebases using a va...

144
ecs
ecs andygeiss Go

Build your own Game-Engine based on the Entity Component System concept in Golang.

144
VPR-methods-evaluation
VPR-methods-evaluation gmberton Python

Easily download and evaluate pre-trained Visual Place Recognition methods. Code built for the ICCV 2023 paper "EigenPlaces: Training Viewpoint Robust...

144
php-orm-benchmark
php-orm-benchmark kenjis PHP

PHP ORM Benchmark

143
ClassEval
ClassEval FudanSELab Python

Benchmark ClassEval for class-level code generation.

143
jsbench-me
jsbench-me psiho

jsbench.me - JavaScript performance benchmarking playground

143
video-quality-metrics
video-quality-metrics CrypticSignal Python

Uses FFmpeg to benchmark video encoders to compare VMAF, SSIM and PSNR with different encoder settings.

143
MMToM-QA
MMToM-QA chuanyangjin Python

[🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind Question Answering

143
memory-maze
memory-maze jurgisp Python

Evaluating long-term memory of reinforcement learning algorithms

142
benchmarks
benchmarks lmdbjava Java

Benchmark of open source, embedded, memory-mapped, key-value stores available from Java (JMH)

141
iai-callgrind
iai-callgrind iai-callgrind Rust

High-precision and consistent benchmarking framework/harness for Rust

141
plf_nanotimer
plf_nanotimer mattreecebentley C++

A simple C++ 03/11/etc timer class for ~microsecond-precision cross-platform benchmarking. The implementation is as limited and as simple as possible...

141
TCPDBench
TCPDBench alan-turing-institute

The Turing Change Point Detection Benchmark: An Extensive Benchmark Evaluation of Change Point Detection Algorithms on real-world data

141
web-bench
web-bench bytedance JavaScript

Web-Bench is a benchmark designed to evaluate the performance of LLMs in actual Web development.

139
service-mesh-benchmark
service-mesh-benchmark kinvolk Shell
139
deepchange
deepchange PengBoXiangShang

ICCV 2023, project page of the paper "DeepChange: A Long-term Person Re-identification Benchmark"

139
Windows-2019-CIS
Windows-2019-CIS ansible-lockdown YAML

Automated CIS Benchmark Compliance Remediation for Windows Server 2019 with Ansible

139
goku
goku jcaromiq Rust

Goku is an HTTP load testing application written in Rust

139
TeaStore
TeaStore DescartesResearch Java

A micro-service reference test application for model extraction, cloud management, energy efficiency, power prediction, single- and multi-tier auto-sc...

138
VPR-datasets-downloader
VPR-datasets-downloader gmberton Python

Automatic download VPR datasets in a standard format

138
wake-word-benchmark
wake-word-benchmark Picovoice Python

wake word engine benchmark framework

137
golang-benchmarks
golang-benchmarks SimonWaldherr Go

Go(lang) benchmarks - (measure the speed of golang)

136
arewefastyet
arewefastyet mozilla JavaScript

NOT MAINTAINED ANYMORE! New project is located on https://github.com/mozilla-frontend-infra/js-perf-dashboard -- AreWeFastYet is a set of tools used f...

135
typescript-orm-benchmark
typescript-orm-benchmark emanuelcasco TypeScript

⚖️ ORM benchmarking for Node.js applications written in TypeScript

135
BLINK_Benchmark
BLINK_Benchmark zeyofu Python

This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.org/abs/2404.12...

134
actors
actors plokhotnyuk Scala

Evaluation of API and performance of different actor libraries

133
BRIGHT
BRIGHT xlang-ai Python

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

133
Shot2Story
Shot2Story bytedance Python

A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.

132
go-perftuner
go-perftuner go-perf Go

Helper tool for manual Go code optimization.

132
golang-graphql-benchmark
golang-graphql-benchmark appleboy Go

benchmark of golang GraphQL framework.

130
THST
THST tuxalin C++

Templated hierarchical spatial trees designed for high-peformance.

129
bench-node
bench-node RafaelGSS JavaScript

A powerful Node.js benchmark library

129
docile
docile rossumai Python

DocILE: Document Information Localization and Extraction Benchmark

129
leaderboard
leaderboard KGQA Jupyter Notebook

You can find the most recent KGQA benchmark numbers from publications here.

128
PHP-Frameworks-Bench
PHP-Frameworks-Bench myaaghubi PHP

A library to make benchmarks from PHP frameworks.

128