Topic

benchmark

Repositories (1763)

benchmarks-of-javascript-package-managers
benchmarks-of-javascript-package-managers pnpm JavaScript

Benchmarks of JavaScript Package Managers

433
Awesome-Evaluation-of-Visual-Generation
Awesome-Evaluation-of-Visual-Generation ziqihuangg

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

430
srs-bench
srs-bench ossrs Go

SB(SRS Bench) is a set of benchmark and regression test tools, for SRS and other media servers, supports HTTP-FLV, RTMP, HLS, WebRTC and GB28181.

430
package-benchmark
package-benchmark ordo-one Swift

Swift benchmark runner with many performance metrics and great CI support

429
LawBench
LawBench open-compass Python

Benchmarking Legal Knowledge of Large Language Models

426
ChineseBLUE
ChineseBLUE alibaba-research Python

Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)

423
BlurTestAndroid
BlurTestAndroid patrickfav Java

This is a simple App to test some blur algorithms on their visual quality and performance.

422
DynamicMap_Benchmark
DynamicMap_Benchmark KTH-RPL Jupyter Notebook

The First Dynamic Map Removal Benchmark | Included 8 SOTA methods | Continous updating

422
ProteinGym
ProteinGym OATML-Markslab HTML

Official repository for the ProteinGym benchmarks

420
Awesome_Imputation
Awesome_Imputation WenjieDu Python

Awesome Deep Learning for Time-Series Imputation, including an unmissable paper and tool list about applying neural networks to impute incomplete time...

418
oltpbench
oltpbench oltpbenchmark Java

Database Benchmarking Framework

417
DG-PHM
DG-PHM CHAOZHAO-1

This is a reposotory that includes paper、code and datasets about domain generalization-based fault diagnosis and prognosis. (基于领域泛化的故障诊断和...

415
FedScale
FedScale SymbioticLab Python

FedScale is a scalable and extensible open-source federated learning (FL) platform.

415
pyaf
pyaf antoinecarme Python

PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.

414
superpixel-benchmark
superpixel-benchmark davidstutz C++

An extensive evaluation and comparison of 28 state-of-the-art superpixel algorithms on 5 datasets.

413
mcpmark
mcpmark eval-sys Python

MCPMark is a comprehensive, stress-testing MCP benchmark designed to evaluate model and agent capabilities in real-world MCP use.

413
devtools
devtools crabnebula-dev TypeScript

Inspect and Debug your Tauri applications in style 💃

412
modclean
modclean ModClean JavaScript

Remove unwanted files and directories from your node_modules folder

409
glow
glow turkaysoftware C#

Advanced System Analysis Software

409
ronin
ronin Sachini Python

RoNIN: Robust Neural Inertial Navigation in the Wild

405
jetson_benchmarks
jetson_benchmarks NVIDIA-AI-IOT Python

Jetson Benchmark

401
gym-electric-motor
gym-electric-motor upb-lea Python

Gym Electric Motor (GEM): An OpenAI Gym Environment for Electric Motors

401
Craftax
Craftax MichaelTMatthews Python

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

398
pglib-opf
pglib-opf power-grid-lib MATLAB

Benchmarks for the Optimal Power Flow Problem

397
GraphRAG-Benchmark
GraphRAG-Benchmark GraphRAG-Bench Python

The official repo of GraphRAG-Bench for evaluating GraphRAG models. "When to use Graphs in RAG: A Comprehensive Analysis for Graph Retrieval-Augmented...

397
ros2-performance
ros2-performance irobot-ros C++

Framework to evaluate peformance of ROS 2

396
are-we-fast-yet
are-we-fast-yet smarr Java

Are We Fast Yet? Comparing Language Implementations with Objects, Closures, and Arrays

394
vtebench
vtebench alacritty Rust

Generate benchmarks for terminal emulators

393
SKAB
SKAB waico Jupyter Notebook

SKAB - Skoltech Anomaly Benchmark. Time-series data for evaluating Anomaly Detection algorithms.

393
cob
cob knqyf263 Go

Continuous Benchmark for Go Project

390
dance
dance OmicsML Python

DANCE: a deep learning library and benchmark platform for single-cell analysis

390
gapbs
gapbs sbeamer C++

GAP Benchmark Suite

387
CSS-IN-JS-Benchmarks
CSS-IN-JS-Benchmarks A-gambit JavaScript
383
InfiniteBench
InfiniteBench OpenBMB Python

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

382
EasyCompressor
EasyCompressor mjebrahimi C#

⚡An Easy-to-Use and Optimized compression library for .NET that unified several compression algorithms including LZ4, Snappy, Zstd, LZMA, Brotli, GZi...

382
recsys-dataset
recsys-dataset otto-de Python

🛍 A real-world e-commerce dataset for session-based recommender systems research.

379
CRUD_RAG
CRUD_RAG IAAR-Shanghai Python

CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models

376
Face-landmarks-detection-benchmark
Face-landmarks-detection-benchmark mrgloom

Face landmarks(fiducial points) detection benchmark

372
MOSE-api
MOSE-api henghuiding Python

[ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes

372
superbenchmark
superbenchmark microsoft Python

A validation and profiling tool for AI infrastructure

370
RGBD-SODsurvey
RGBD-SODsurvey taozh2017 MATLAB

RGB-D Salient Object Detection: A Survey

370
puck
puck baidu Jupyter Notebook

Puck is a high-performance ANN search engine

368
TurboBench
TurboBench powturbo C

Compression Benchmark

363
BabelStream
BabelStream UoB-HPC C++

STREAM, for lots of devices written in many programming models

362
Awesome_Satellite_Benchmark_Datasets
Awesome_Satellite_Benchmark_Datasets Seyed-Ali-Ahmadi Jupyter Notebook

Supplementary material for our paper "THERE IS NO DATA LIKE MORE DATA" is provided.

361
tpcds-kit
tpcds-kit gregrahn C

TPC-DS benchmark kit with some modifications/fixes

360
vector-db-benchmark
vector-db-benchmark qdrant Python

Framework for benchmarking vector search engines

360
ollama-benchmark
ollama-benchmark aidatatools Python

LLM Benchmark for Throughput via Ollama (Local LLMs)

359
model-vs-human
model-vs-human bethgelab Python

Benchmark your model on out-of-distribution datasets with carefully collected human comparison data (NeurIPS 2021 Oral)

359
diff-sampler
diff-sampler zju-pi Jupyter Notebook

An open-source toolbox for fast sampling of diffusion models. Official implementations of our works published in ICML, NeurIPS, CVPR, J. Stat. Mech.

359