Topic

benchmark

Repositories (1623)

open-stream-processing-benchmark
open-stream-processing-benchmark Klarrio Jupyter Notebook

This repository contains the code base for the Open Stream Processing Benchmark.

55
benchmark
benchmark cnlh Go

a simple benchmark testing tool implemented in golang with some small features

55
Science-T2I
Science-T2I Jialuo-Li Python

[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis

55
unity-netcode-benchmark
unity-netcode-benchmark StinkySteak C#

Unity Netcode/Network Benchmark Comparison. Fusion, Fishnet, Mirror, Mirage, Netick, NGO

55
DSBench
DSBench LiqiangJing Jupyter Notebook

DSBench: How Far are Data Science Agents from Becoming Data Science Experts?

55
Benchpress
Benchpress StartAutomating PowerShell

Easy Benchmarking with PowerShell

55
benchmarkstt
benchmarkstt ebu Python

Open Source AI Benchmarking toolkit for benchmarking speech to text services

55
text-style-transfer-benchmark
text-style-transfer-benchmark ykshi

Text style transfer benchmark

55
perfy
perfy onury JavaScript

A simple, light-weight NodeJS utility for measuring code execution in high-resolution real times.

55
faster-php
faster-php devmount PHP

Testing different approaches to improve PHP script performance

55
NPB-CPP
NPB-CPP GMAP C++

The NAS Parallel Benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures

55
Human-Benchmark
Human-Benchmark PrintN Dart

Human Benchmark is a Flutter app for Android, it has many tests to test your abilities.

54
sorry-bench
sorry-bench SORRY-Bench Jupyter Notebook

Benchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal" (ICLR 2025)

54
LeakDB
LeakDB KIOS-Research Python

LeakDB (Leakage Diagnosis Benchmark) is a realistic leakage dataset for water distribution networks. The dataset is comprised of a large number of art...

54
objects-with-lighting
objects-with-lighting isl-org Python

Repository for the Objects With Lighting Dataset

54
cve-bench
cve-bench uiuc-kang-lab Python

CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities

54
tls-perf
tls-perf tempesta-tech C++

TLS handshakes benchnarking tool

54
LFattNet
LFattNet LIAGM Python

Attention-based View Selection Networks for Light-field Disparity Estimation

54
fast_retraining
fast_retraining Azure Jupyter Notebook

Show how to perform fast retraining with LightGBM in different business cases

54
BenchmarkCI.jl
BenchmarkCI.jl tkf Julia
54
benchtable
benchtable izuzak JavaScript

Benchmark.js results in ASCII tables for NodeJS

54
benchmarksql
benchmarksql petergeoghegan Java

Unmaintained, prefer these BenchmarkSQL forks: https://github.com/wieck/benchmarksql and https://github.com/pgsql-io/benchmarksql

53
fanoutqa
fanoutqa zhudotexe Python

Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)

53
BenchmarkDotNetVisualizer
BenchmarkDotNetVisualizer mjebrahimi HTML

🌈 Visualizes your BenchmarkDotNet benchmarks to Colorful images and Feature-rich HTML (and maybe powerful charts in the future!)

53
SUES-200-Benchmark
SUES-200-Benchmark Reza-Zhu Python

SUES-200: A Multi-height Multi-scene Cross-view Image Benchmark Across Drone and Satellite

53
Hetero-Mark
Hetero-Mark NUCAR-DEV Jupyter Notebook

A Benchmark Suite for Heterogeneous System Computation

53
nvidia_libs_test
nvidia_libs_test google C++

Tests and benchmarks for cudnn (and in the future, other nvidia libraries)

53
synthmark
synthmark google C++

Audio performance benchmark for jitter, theoretical latency, etc.

53
PHPench
PHPench mre PHP

Realtime benchmarks for PHP code

53
StreamBench
StreamBench lsds C++

Measuring the performance of popular streaming engines with Yahoo's Streaming Benchmark

53
mofdscribe
mofdscribe lamalab-org Python

An ecosystem for digital reticular chemistry

52
evolin
evolin prime-slam Python

Evaluation of Line Detection and Association

52
Portrait-Mode-Video
Portrait-Mode-Video bytedance Python

Video dataset dedicated to portrait-mode video recognition.

52
m1-cpu-benchmarks
m1-cpu-benchmarks tlkh Jupyter Notebook
52
issues
issues idrinth-api-bench

This is the issue repository for a typescript framework meant to performance test anything even remotely rest-like and related tools

52
java-2-times-faster-than-c
java-2-times-faster-than-c xemantic Rust

An inquiry into nondogmatic software development. An experiment showing double performance of the code running on JVM comparing to equivalent native C...

52
gosbench
gosbench mulbc Go

Distributed S3 benchmarking tool - Replacement of Cosbench

51
Robust-Gymnasium
Robust-Gymnasium SafeRL-Lab Python

[ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.

51
BirdSet
BirdSet DBD-research-group Jupyter Notebook

A benchmark dataset collection for bird sound classification

51
planetarium
planetarium BatsResearch Python

Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL

51
step_game
step_game lechmazur

Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLMs to engage i...

51
geotips
geotips kadyb HTML

Collection of tips for faster spatial data processing in R

51
Performance-Wars-Benchmarking-CSharp
Performance-Wars-Benchmarking-CSharp mjebrahimi C#

🔥Performance Wars Benchmarking C# - This repository contains a collection of C# benchmarks to compare the performance of different approaches to solv...

51
utils
utils gofiber Go

:zap: A collection of common functions but with better performance, less allocations and less dependencies created for Fiber.

51
benchllama
benchllama srikanth235 Python

Benchmark your local LLMs.

51
nl2code-dataset
nl2code-dataset aixcoder-plugin Java

Aix-bench, the Java benchmark for code synthesis problem.

51
bmi
bmi cbg-ethz Python

Mutual information estimators and benchmark

50
Advbench
Advbench thunlp Python

Code and data of the EMNLP 2022 paper "Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversarial NLP".

50
OpenMEVA
OpenMEVA thu-coai Python

Benchmark for evaluating open-ended generation

50
rtb
rtb processone Erlang

Benchmarking tool to stress real-time protocols

50