Benchmarks of JavaScript Package Managers
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
SB(SRS Bench) is a set of benchmark and regression test tools, for SRS and other media servers, supports HTTP-FLV, RTMP, HLS, WebRTC and GB28181.
Swift benchmark runner with many performance metrics and great CI support
Benchmarking Legal Knowledge of Large Language Models
Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)
This is a simple App to test some blur algorithms on their visual quality and performance.
The First Dynamic Map Removal Benchmark | Included 8 SOTA methods | Continous updating
Official repository for the ProteinGym benchmarks
Awesome Deep Learning for Time-Series Imputation, including an unmissable paper and tool list about applying neural networks to impute incomplete time...
Database Benchmarking Framework
This is a reposotory that includes paper、code and datasets about domain generalization-based fault diagnosis and prognosis. (基于领域泛化的故障诊断和...
FedScale is a scalable and extensible open-source federated learning (FL) platform.
PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.
An extensive evaluation and comparison of 28 state-of-the-art superpixel algorithms on 5 datasets.
MCPMark is a comprehensive, stress-testing MCP benchmark designed to evaluate model and agent capabilities in real-world MCP use.
Inspect and Debug your Tauri applications in style 💃
Remove unwanted files and directories from your node_modules folder
Advanced System Analysis Software
RoNIN: Robust Neural Inertial Navigation in the Wild
Jetson Benchmark
Gym Electric Motor (GEM): An OpenAI Gym Environment for Electric Motors
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
Benchmarks for the Optimal Power Flow Problem
The official repo of GraphRAG-Bench for evaluating GraphRAG models. "When to use Graphs in RAG: A Comprehensive Analysis for Graph Retrieval-Augmented...
Framework to evaluate peformance of ROS 2
Are We Fast Yet? Comparing Language Implementations with Objects, Closures, and Arrays
Generate benchmarks for terminal emulators
SKAB - Skoltech Anomaly Benchmark. Time-series data for evaluating Anomaly Detection algorithms.
Continuous Benchmark for Go Project
DANCE: a deep learning library and benchmark platform for single-cell analysis
GAP Benchmark Suite
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
⚡An Easy-to-Use and Optimized compression library for .NET that unified several compression algorithms including LZ4, Snappy, Zstd, LZMA, Brotli, GZi...
🛍 A real-world e-commerce dataset for session-based recommender systems research.
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
Face landmarks(fiducial points) detection benchmark
[ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
A validation and profiling tool for AI infrastructure
RGB-D Salient Object Detection: A Survey
Puck is a high-performance ANN search engine
Compression Benchmark
STREAM, for lots of devices written in many programming models
Supplementary material for our paper "THERE IS NO DATA LIKE MORE DATA" is provided.
TPC-DS benchmark kit with some modifications/fixes
Framework for benchmarking vector search engines
LLM Benchmark for Throughput via Ollama (Local LLMs)
Benchmark your model on out-of-distribution datasets with carefully collected human comparison data (NeurIPS 2021 Oral)
An open-source toolbox for fast sampling of diffusion models. Official implementations of our works published in ICML, NeurIPS, CVPR, J. Stat. Mech.