Topic

benchmark

Repositories (1623)

tasksource
tasksource sileod Python

Datasets collection and preprocessings framework for NLP extreme multitask learning

183
UBUNTU20-CIS
UBUNTU20-CIS ansible-lockdown YAML

Automated CIS Benchmark Compliance Remediation for Ubuntu 20 with Ansible

182
globalping-probe
globalping-probe jsdelivr TypeScript

The globalping probe code that runs on your hardware and connects to the global community network of probes

182
mlx-benchmark
mlx-benchmark TristanBilot Python

Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.

182
SiliconeCalculator
SiliconeCalculator erfansn Kotlin

🎨 Simple but attractive graphic a calculator built with Jetpack Compose

182
prometheus-benchmark
prometheus-benchmark VictoriaMetrics Go

Benchmark for Prometheus-compatible systems

181
gpu-rodinia
gpu-rodinia yuhc C

Rodinia benchmark

179
glow
glow turkaysoftware C#

System Analysis Software

178
small-object-detection-benchmark
small-object-detection-benchmark fcakyon Python

icip2022 paper: sahi benchmark on visdrone and xview datasets using fcos, vfnet and tood detectors

177
ConvolutionalNeuralOperator
ConvolutionalNeuralOperator camlab-ethz Python

This repository is the official implementation of the paper Convolutional Neural Operators for robust and accurate learning of PDEs

177
smartbugs-wild
smartbugs-wild smartbugs Python

This repository contains 47,398 smart contracts extracted from the Ethereum network.

176
k8s-security-policies
k8s-security-policies raspbernetes Open Policy Agent

This repository offers a comprehensive library of security policies designed to enhance the security of Kubernetes cluster configurations. The policie...

175
DenseMatchingBenchmark
DenseMatchingBenchmark DeepMotionAIResearch Python

Dense Matching Benchmark

175
Single-Image-Deraining
Single-Image-Deraining panda-lab

Single Image Deraining: A Comprehensive Benchmark Analysis

175
JSONBench
JSONBench ClickHouse Shell

JSONBench: a Benchmark For Data Analytics On JSON

174
freqbench
freqbench kdrag0n Python

Comprehensive CPU frequency performance/power benchmark

173
dnspyre
dnspyre Tantalor93 Go

CLI tool for a high QPS DNS benchmark

173
benchmarks
benchmarks catboost Jupyter Notebook

Comparison tools

172
json-benchmark
json-benchmark serde-rs C++

nativejson-benchmark in Rust

172
xFinder
xFinder IAAR-Shanghai Python

[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation

171
pytorch-retraining
pytorch-retraining ahirner Jupyter Notebook

Transfer Learning Shootout for PyTorch's model zoo (torchvision)

169
dlbench
dlbench hclhkbu Python

Benchmarking State-of-the-Art Deep Learning Software Tools

169
physics-IQ-benchmark
physics-IQ-benchmark google-deepmind Python

Benchmarking physical understanding in generative video models

168
p2plab
p2plab Netflix Go

performance benchmark infrastructure for IPLD DAGs

168
fast-crystal
fast-crystal icyleaf Crystal

💨 Writing Fast Crystal 😍 -- Collect Common Crystal idioms.

167
jsbenchmark
jsbenchmark jsbenchmark Vue

A straightforward JavaScript benchmarking tool and REPL with support for ES modules and libraries.

167
UHGEval
UHGEval IAAR-Shanghai Python

[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.

167
confabulations
confabulations lechmazur HTML

Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.

166
db-benchmarks
db-benchmarks db-benchmarks PHP

Fair database benchmarks framework and datasets

163
backup-bench
backup-bench deajan Shell

Quick and dirty backup tool benchmark with reproducible results

163
Large-Scale-Medical
Large-Scale-Medical Luffy03 Python

[CVPR 2024 Extension] 160K volumes (42M slices) datasets, 31M-1.2B pre-trained models, various pre-training recipes, 50+ downstream tasks implementati...

162
LLM-RGB
LLM-RGB babelcloud TypeScript

LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.

161
http-router
http-router sunrise-php PHP

A powerful solution as the foundation of your project.

160
storage
storage mlcommons Python

MLPerf® Storage Benchmark Suite

160
PointCloud-C
PointCloud-C ldkong1205 Python

Benchmarking and Analyzing Point Cloud Perception Robustness under Corruptions

159
OSWorld-G
OSWorld-G xlang-ai TypeScript

[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis

158
matbench
matbench materialsproject Python

Matbench: Benchmarks for materials science property prediction

158
PoseBench
PoseBench BioinfoMachineLearning Jupyter Notebook

Comprehensive benchmarking of protein-ligand structure prediction methods. (ICML 2024 AI4Science)

157
benchmark-driver
benchmark-driver benchmark-driver Ruby

Fully-featured benchmark driver for Ruby 3x3

156
python-benchmark-harness
python-benchmark-harness JoeyHendricks Python

A micro/macro benchmark framework for the Python programming language that helps with optimizing your software.

156
qpbenchmark
qpbenchmark qpsolvers Python

Benchmark for quadratic programming solvers available in Python

155
7guis
7guis 7guis JavaScript

7GUIs is a GUI programming usability benchmark.

154
BinKit
BinKit SoftSec-KAIST Shell

Binary Code Similarity Analysis (BCSA) Benchmark

153
zBench
zBench hendriknielaender Zig

📊 zig benchmark

152
sltbench
sltbench ivafanas C++

C++ benchmark tool. Practical, stable and fast performance testing framework.

151
gatling-dubbo
gatling-dubbo youzan Scala

A gatling plugin for running load tests on Apache Dubbo(https://github.com/apache/incubator-dubbo) and other java ecosystem.

151
face-occlusion-generation
face-occlusion-generation kennyvoo Python

[CVPRW 2022] Delving into High-Quality Synthetic Face Occlusion Segmentation Datasets

151
ChemLLMBench
ChemLLMBench ChemFoundationModels Jupyter Notebook

What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks

151
benchi
benchi ConduitIO Go

Benchmark any tool from the CLI

150
segment
segment houbb Java

The jieba-analysis tool for java.(基于结巴分词词库实现的更加灵活优雅易用,高性能的 java 分词实现。支持词性标注。)

150