A Chinese National Medical Licensing Examination dataset and large languge model benchmarks
Очень простой скрипт тестирования быстродействия PHP | Very simple script for testing of PHP operations speed (rusoft repo mirror)
Python project template with unit-tests, documentation, ci-testing and workflows.
Exploring Evolution-aware & free protein language models as protein function predictors
Official repository for KoMT-Bench built by LG AI Research
[ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical...
A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.
A Survey and Benchmark of QUIC
MoleculeNet benchmark dataset & MolMapNet dataset
Benchmarks for intrinsic word embeddings evaluation.
Handy tool to measure the performance and efficiency of LLMs workloads.
Code base for the SENSORIUM competition.
Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.
Cookbook to provide solutions to common tasks and problems in using Polars with R
Micro-benchmarking library for C and C++ with PMU counters tracking
Open-source benchmark datasets and pretrained transformer models in the Filipino language.
Code and models for the WACV 2021 paper "Benchmark for evaluating pedestrian action prediction"
A minimal wrk image for Docker - Modern HTTP benchmarking tool.
Utilize the Fetch API to send frequent requests to the target website, simulating the effect of pressing F5 to refresh, in order to test the server's...
Large-scale uncertainty benchmark in deep learning.
SustainDC is a set of Python environments for Data Center simulation and control using Heterogeneous Multi Agent Reinforcement Learning. Includes cust...
POSIX Compliant script to bench your server.
A benchmark for Salient Object Detection (SOD).
Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.
Datasets for Evaluation on Domain Knowledge Graph
掘金的小册《Android 进阶:基于 Kotlin 的 Android App 开发实践》中的相关的例子
Miscellaneous benchmarks for JSON serialization on JVM/Android
Directly calling functional components instead of mounting them is faster.
Code for "Are Local Features All You Need for Cross-Domain Visual Place Recognition?" CVPR IMW 2023.
benchmark tool for tars/http service
🔮 Obtain the power of touchless interaction with display screens
A tool automatically improving the performance of large-scale systems by finding better configuration settings
A component render time benchmarking suite for React
CPU micro benchmarks
Salient objects in clutter, TPAMI, 2022
Smart benchmarking of pull requests with statistical confidence
A LLM training and evaluation benchmark for credit scoring
a simple benchmark testing tool implemented in golang with some small features
Official code for DOC-VTON. We provide visualization results of Awesome Virtual Tryon. Besides, we provide auxiliary data of VITON and VITON-HD for tr...
Benchmark PHP, HHVM and Zephir
Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a small set of ex...
A python package providing a benchmark with various specified distribution shift patterns.
[ICLR 2025] Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video
[ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binarization.
This is the official code for the paper "Tracker Meets Night: A Transformer Enhancer for UAV Tracking".
🏋️ Bash Script which runs several Linux benchmarks (Sysbench, UnixBench and Geekbench)
GraphQL benchmarks using the-benchmarker framework.
Wide NoSQL benchmark for RocksDB, LevelDB, Redis, WiredTiger and MongoDB extending the Yahoo Cloud Serving Benchmark
A manually vetted dataset for security vulnerability detection in Java projects
[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis