Topic

benchmark

Repositories (1623)

CMExam
CMExam williamliujl Python

A Chinese National Medical Licensing Examination dataset and large languge model benchmarks

64
php-simple-benchmark-script
php-simple-benchmark-script rusoft PHP

Очень простой скрипт тестирования быстродействия PHP | Very simple script for testing of PHP operations speed (rusoft repo mirror)

64
PythonProjectTemplate
PythonProjectTemplate franneck94 Python

Python project template with unit-tests, documentation, ci-testing and workflows.

64
Revisiting-PLMs
Revisiting-PLMs elttaes Python

Exploring Evolution-aware & free protein language models as protein function predictors

64
KoMT-Bench
KoMT-Bench LG-AI-EXAONE Python

Official repository for KoMT-Bench built by LG AI Research

63
PhysBench
PhysBench USC-GVL Python

[ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical...

63
GeoBench
GeoBench aim-uofa Python

A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.

63
quic_vs_tcp
quic_vs_tcp Shenggan Python

A Survey and Benchmark of QUIC

63
ChemBench
ChemBench shenwanxiang HTML

MoleculeNet benchmark dataset & MolMapNet dataset

63
word-benchmarks
word-benchmarks vecto-ai

Benchmarks for intrinsic word embeddings evaluation.

63
ollama-benchmark
ollama-benchmark cloudmercato Python

Handy tool to measure the performance and efficiency of LLMs workloads.

62
sensorium
sensorium sinzlab Jupyter Notebook

Code base for the SENSORIUM competition.

62
Safe-Multi-Agent-Mujoco
Safe-Multi-Agent-Mujoco chauncygu Python

Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.

62
cookbook-rpolars
cookbook-rpolars ddotta CSS

Cookbook to provide solutions to common tasks and problems in using Polars with R

61
b63
b63 okuvshynov C

Micro-benchmarking library for C and C++ with PMU counters tracking

61
Filipino-Text-Benchmarks
Filipino-Text-Benchmarks jcblaisecruz02 Python

Open-source benchmark datasets and pretrained transformer models in the Filipino language.

61
PedestrianActionBenchmark
PedestrianActionBenchmark ykotseruba Python

Code and models for the WACV 2021 paper "Benchmark for evaluating pedestrian action prediction"

60
docker-wrk
docker-wrk William-Yeh Dockerfile

A minimal wrk image for Docker - Modern HTTP benchmarking tool.

60
f5-bench
f5-bench ikxin CSS

Utilize the Fetch API to send frequent requests to the target website, simulating the effect of pressing F5 to refresh, in order to test the server's...

60
untangle
untangle bmucsanyi Python

Large-scale uncertainty benchmark in deep learning.

60
dc-rl
dc-rl HewlettPackard HTML

SustainDC is a set of Python environments for Data Center simulation and control using Heterogeneous Multi Agent Reinforcement Learning. Includes cust...

60
benchy
benchy L1so Shell

POSIX Compliant script to bench your server.

60
SALOD
SALOD moothes Python

A benchmark for Salient Object Detection (SOD).

60
Safe-Multi-Agent-Isaac-Gym
Safe-Multi-Agent-Isaac-Gym chauncygu Python

Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.

60
OpenBG
OpenBG OpenBGBenchmark

Datasets for Evaluation on Domain Knowledge Graph

60
kotlin_tutorial
kotlin_tutorial fengzhizi715 Kotlin

掘金的小册《Android 进阶:基于 Kotlin 的 Android App 开发实践》中的相关的例子

60
json-serialization-benchmarking
json-serialization-benchmarking ZacSweers Java

Miscellaneous benchmarks for JSON serialization on JVM/Android

60
functional-components-benchmark
functional-components-benchmark missive JavaScript

Directly calling functional components instead of mounting them is faster.

59
re-ranking-for-VPR
re-ranking-for-VPR gbarbarani Python

Code for "Are Local Features All You Need for Cross-Domain Visual Place Recognition?" CVPR IMW 2023.

59
TarsBenchmark
TarsBenchmark TarsCloud JavaScript

benchmark tool for tars/http service

59
AR-Touch
AR-Touch erfansn Kotlin

🔮 Obtain the power of touchless interaction with display screens

58
bestconf
bestconf zhuyuqing Java

A tool automatically improving the performance of large-scale systems by finding better configuration settings

58
fuego
fuego apiv JavaScript

A component render time benchmarking suite for React

57
cpu-micro-benchmarks
cpu-micro-benchmarks jiegec Assembly

CPU micro benchmarks

57
SODBenchmark
SODBenchmark DengPingFan

Salient objects in clutter, TPAMI, 2022

57
touchstone
touchstone lorenzwalthert R

Smart benchmarking of pull requests with statistical confidence

57
CALM
CALM The-FinAI Python

A LLM training and evaluation benchmark for credit scoring

57
benchmark
benchmark cnlh Go

a simple benchmark testing tool implemented in golang with some small features

57
DOC-VTON
DOC-VTON JyChen9811 Python

Official code for DOC-VTON. We provide visualization results of Awesome Virtual Tryon. Besides, we provide auxiliary data of VITON and VITON-HD for tr...

57
Benchmark-PHP-HHVM-Zephir
Benchmark-PHP-HHVM-Zephir treffynnon Shell

Benchmark PHP, HHVM and Zephir

57
generalization
generalization lechmazur

Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a small set of ex...

57
whyshift
whyshift namkoong-lab Jupyter Notebook

A python package providing a benchmark with various specified distribution shift patterns.

57
SLAM-under-Perturbation
SLAM-under-Perturbation Xiaohao-Xu C++

[ICLR 2025] Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video

56
BiBench
BiBench htqin Python

[ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binarization.

56
SCT
SCT vision4robotics Python

This is the official code for the paper "Tracker Meets Night: A Transformer Enhancer for UAV Tracking".

56
benchmark
benchmark Cyclenerd Shell

🏋️ Bash Script which runs several Linux benchmarks (Sysbench, UnixBench and Geekbench)

56
graphql-benchmarks
graphql-benchmarks the-benchmarker Ruby

GraphQL benchmarks using the-benchmarker framework.

56
ucsb
ucsb unum-cloud C++

Wide NoSQL benchmark for RocksDB, LevelDB, Redis, WiredTiger and MongoDB extending the Yahoo Cloud Serving Benchmark

56
cwe-bench-java
cwe-bench-java iris-sast Python

A manually vetted dataset for security vulnerability detection in Java projects

56
Science-T2I
Science-T2I Jialuo-Li Python

[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis

55