Topic

benchmark

Repositories (1623)

secbench
secbench TQRG Python

🪐 A Database of Existing Security Vulnerabilities Patches to Enable Evaluation of Techniques (single-commit; multi-language)

40
7GUIs
7GUIs vangelov TypeScript
40
Scenario-Wise-Rec
Scenario-Wise-Rec Xiaopengli1 Python

Benchmark for Multi-Scenario-Recommendation.

40
build-tools-performance
build-tools-performance rspack-contrib JavaScript

Performance comparisons of bundlers and build tools, including Rspack, Rsbuild, webpack, Vite and Farm.

40
ocr-benchmark
ocr-benchmark video-db Python

Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments

40
DatabaseBenchmark
DatabaseBenchmark YuriyIvon C#

A universal database query benchmark tool

40
bench
bench golang-design Go

⏱️ Reliable performance measurement for Go programs. All in one design.

39
core-latency
core-latency ajakubek C++

A simple benchmark which measures latency between CPU cores.

39
catalyst-rl-framework
catalyst-rl-framework Scitator Python

Catalyst.RL: A Distributed Framework for Reproducible RL Research

39
validator-benchmark
validator-benchmark icebob JavaScript

JS validators benchmark

39
h5bench
h5bench hpc-io C

A benchmark suite for measuring HDF5 performance.

39
muld
muld ghomasHudson Python

The Multitask Long Document Benchmark

39
perftester
perftester nyggus Python

A lightweight Python package for performance testing of Python functions.

39
compression_benchmark
compression_benchmark mbhall88 Python

Benchmarking FASTQ compression with 'mature' compression algorithms

39
KITAB-Bench
KITAB-Bench mbzuai-oryx Python

[ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding

39
ExecutorBenchmark
ExecutorBenchmark mslinn Scala
38
javafilters-benchmarks
javafilters-benchmarks volkodavs Java

java filter benchmarks

38
crlmaze
crlmaze Pervasive-AI-Lab Python

Continual Reinforcement Learning in 3D Non-stationary Environments

38
snowman
snowman HPI-Information-Systems TypeScript

Welcome to Snowman App – a Data Matching Benchmark Platform.

38
TPCH-sqlite
TPCH-sqlite lovasoa Shell

SQLite TPCH database

38
dsr-benchmark
dsr-benchmark raphaelsulzer Python

[TPAMI 2024] A Survey and Benchmark for Automatic Surface Reconstruction from Point Clouds

38
scandinavian-embedding-benchmark
scandinavian-embedding-benchmark KennethEnevoldsen Python

A Scandinavian Benchmark for sentence embeddings

38
GHOSTS
GHOSTS friederrr

GHOSTS dataset

38
NodeBench
NodeBench LloydAsp Shell

vps聚合测试脚本,直接输出排版好的markdown格式,方便粘贴

38
MLLM-CompBench
MLLM-CompBench RaptorMai Jupyter Notebook

[NeurIPS'25] MLLM-CompBench evaluates the comparative reasoning of MLLMs with 40K image pairs and questions across 8 dimensions of relative comparison...

38
circle-guard-bench
circle-guard-bench whitecircle-ai Python

First-of-its-kind AI benchmark for evaluating the protection capabilities of large language model (LLM) guard systems (guardrails and safeguards)

38
ColdRec
ColdRec YuanchenBei Python

ColdRec: An Open-Source Benchmark Toolbox for Cold-Start Recommendation.

38
All-Angles-Bench
All-Angles-Bench Chenyu-Wang567 Python

Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs

38
rest-bench
rest-bench dotchev Lua

Compare simple REST server performance in Node.js and Go

37
Long-Map-Benchmarks
Long-Map-Benchmarks austinv11 Java

Benchmarking the best way to store long, Object value pairs in a map.

37
stringbench
stringbench almondtools Java

String matching algorithm benchmark

37
NAS-Bench-Macro
NAS-Bench-Macro xiusu Python

NAS Benchmark in "Prioritized Architecture Sampling with Monto-Carlo Tree Search", CVPR2021

37
2020a_SSH_mapping_NATL60
2020a_SSH_mapping_NATL60 ocean-data-challenges Jupyter Notebook

A challenge on the mapping of satellite altimeter sea surface height data organised by MEOM@IGE, Ocean-Next and CLS.

37
tensortrade
tensortrade StephanAkkerman Python

This repository contains my TensorTrade-focused code, including the core program and supplemental tools used in my bachelor's thesis on trading low ma...

37
Fair_Credit_Scoring
Fair_Credit_Scoring kozodoi Python

Fair ML in credit scoring: Assessment, implementation and profit implications

37
asreview-insights
asreview-insights asreview Python

Tools such as plots and metrics to analyze (simulated) reviews for ASReview LAB

37
NVMe-SSD-HDD-S.M.A.R.T-Monitoring
NVMe-SSD-HDD-S.M.A.R.T-Monitoring 0xDiSk Shell

🛸 NVMe / 🚀 SSD / 🖴 HDD S.M.A.R.T Monitoring. Site: https://diskcheck.monster

37
rust-storage-bench
rust-storage-bench marvin-j97 Rust

Benchmarking Rust storage engines

37
cssegmentation
cssegmentation SegmentationBLWX Python

CSSegmentation: An Open Source Continual Semantic Segmentation Toolbox Based on PyTorch.

37
WfCommons
WfCommons wfcommons Python

WfCommons: A Framework for Enabling Scientific Workflow Research and Development

37
segmentation-networks-benchmark
segmentation-networks-benchmark BloodAxe Python

Evaluation framework for testing segmentation networks in Keras

36
embedding_evaluation
embedding_evaluation EloiZ Python

Evaluate your word embeddings

36
horoscope
horoscope PingCAP-QE Go

horoscope is an optimizer inspector for DBMS.

36
pytest-patterns
pytest-patterns smarie Python

A couple of examples showing how pytest and its plugins can be combined to solve real-world needs.

36
DL-Hard
DL-Hard grill-lab

Deep Learning Hard (DL-HARD) is a new annotated dataset extending TREC Deep Learning benchmark.

36
embeddings
embeddings CLARIN-PL Python

Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language

36
ssd-vs-pm
ssd-vs-pm sfu-dis C++

Cost/performance analysis of index structures on SSD and persistent memory (CIDR 2022)

36
pa-bench
pa-bench pairwise-alignment Jupyter Notebook

Benchmarking pairwise aligners

36
AeroPath
AeroPath raidionics Jupyter Notebook

:hugs: AeroPath: An airway segmentation benchmark dataset with challenging pathology

36
PhyX
PhyX NastyMarcus Python

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

36