Topic

benchmark

Repositories (1623)

CValues
CValues X-PLUG Python

面向中文大模型价值观的评估与对齐研究

519
FewCLUE
FewCLUE CLUEbenchmark Python

FewCLUE 小样本学习测评基准,中文版

509
Leaderboard
Leaderboard SpeechColab Python

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

503
globalping
globalping jsdelivr TypeScript

A global network of probes to run network tests like ping, traceroute and DNS resolve

497
LongCite
LongCite THUDM Python

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

495
z-bench
z-bench zhenbench

Z-Bench 1.0 by 真格基金:一个麻瓜的大语言模型中文测试集。Z-Bench is a LLM prompt dataset for non-technical users, developed by an enthusiastic AI-focu...

495
pcam
pcam basveeling Python

The PatchCamelyon (PCam) deep learning classification benchmark.

492
Visual-Tracking-Development
Visual-Tracking-Development DavidZhangdw Python

Visual Object Tracking

489
llmc
llmc ModelTC Python

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Comp...

487
kg-gen
kg-gen stair-lab Python

Knowledge Graph Generation from Any Text

483
glmark2
glmark2 glmark2 C

glmark2 is an OpenGL 2.0 and ES 2.0 benchmark

480
RHEL7-CIS
RHEL7-CIS ansible-lockdown YAML

Automated CIS Benchmark Compliance Remediation for RHEL 7 with Ansible

478
web-tooling-benchmark
web-tooling-benchmark v8 JavaScript

JavaScript benchmark for common web developer workloads

474
awesome-state-of-depth-completion
awesome-state-of-depth-completion alexklwong

Current state of supervised and unsupervised depth completion methods

469
PaddleFleetX
PaddleFleetX PaddlePaddle Python

飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。

468
tf_to_trt_image_classification
tf_to_trt_image_classification NVIDIA-AI-IOT Python

Image classification with NVIDIA TensorRT from TensorFlow models.

457
LayoutFrameworkBenchmark
LayoutFrameworkBenchmark layoutBox Swift

Benchmark the performances of various Swift layout frameworks (autolayout, UIStackView, PinLayout, LayoutKit, FlexLayout, Yoga, ...)

445
sympact
sympact simonepri JavaScript

🔥 Stupid Simple CPU/MEM "Profiler" for your JS code.

443
prophiler
prophiler fabfuel PHP

PHP Profiler & Developer Toolbar (built for Phalcon)

442
benchmarks-of-javascript-package-managers
benchmarks-of-javascript-package-managers pnpm JavaScript

Benchmarks of JavaScript Package Managers

435
automlbenchmark
automlbenchmark openml Python

OpenML AutoML Benchmarking Framework

428
ChineseBLUE
ChineseBLUE alibaba-research Python

Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)

423
gymfc
gymfc wil3 Python

A universal flight control tuning framework

423
BlurTestAndroid
BlurTestAndroid patrickfav Java

This is a simple App to test some blur algorithms on their visual quality and performance.

421
pyaf
pyaf antoinecarme Python

PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.

414
srs-bench
srs-bench ossrs Go

SB(SRS Bench) is a set of benchmark and regression test tools, for SRS and other media servers, supports HTTP-FLV, RTMP, HLS, WebRTC and GB28181.

414
ant-application-security-testing-benchmark
ant-application-security-testing-benchmark alipay Java

xAST评价体系,让安全工具不再“黑盒”. The xAST evaluation benchmark makes security tools no longer a "black box".

412
oltpbench
oltpbench oltpbenchmark Java

Database Benchmarking Framework

411
superpixel-benchmark
superpixel-benchmark davidstutz C++

An extensive evaluation and comparison of 28 state-of-the-art superpixel algorithms on 5 datasets.

411
modclean
modclean ModClean JavaScript

Remove unwanted files and directories from your node_modules folder

406
BenchMARL
BenchMARL facebookresearch Python

A collection of MARL benchmarks based on TorchRL

403
mixbench
mixbench ekondis C++

A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)

401
FedScale
FedScale SymbioticLab Python

FedScale is a scalable and extensible open-source federated learning (FL) platform.

400
TheAgentCompany
TheAgentCompany TheAgentCompany Python

An agent benchmark with tasks in a simulated software company.

395
cob
cob knqyf263 Go

Continuous Benchmark for Go Project

387
package-benchmark
package-benchmark ordo-one Swift

Swift benchmark runner with many performance metrics and great CI support

386
jetson_benchmarks
jetson_benchmarks NVIDIA-AI-IOT Python

Jetson Benchmark

382
CSS-IN-JS-Benchmarks
CSS-IN-JS-Benchmarks A-gambit JavaScript
381
bigcodebench
bigcodebench bigcode-project Python

[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

378
KernelBench
KernelBench ScalingIntelligence Python

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems

374
Face-landmarks-detection-benchmark
Face-landmarks-detection-benchmark mrgloom

Face landmarks(fiducial points) detection benchmark

372
DynamicMap_Benchmark
DynamicMap_Benchmark KTH-RPL Jupyter Notebook

The First Dynamic Map Removal Benchmark | Included 8 SOTA methods | Continous updating

367
EasyCompressor
EasyCompressor mjebrahimi C#

⚡An Easy-to-Use and Optimized compression library for .NET that unified several compression algorithms including LZ4, Snappy, Zstd, LZMA, Brotli, GZi...

364
dance
dance OmicsML Python

DANCE: a deep learning library and benchmark platform for single-cell analysis

361
SKAB
SKAB waico Jupyter Notebook

SKAB - Skoltech Anomaly Benchmark. Time-series data for evaluating Anomaly Detection algorithms.

360
MOSE-api
MOSE-api henghuiding Python

[ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes

359
RGBD-SODsurvey
RGBD-SODsurvey taozh2017 MATLAB

RGB-D Salient Object Detection: A Survey

359
puck
puck baidu Jupyter Notebook

Puck is a high-performance ANN search engine

357
vtebench
vtebench alacritty Rust

Generate benchmarks for terminal emulators

357
are-we-fast-yet
are-we-fast-yet smarr Java

Are We Fast Yet? Comparing Language Implementations with Objects, Closures, and Arrays

356