Most popular benchmark repositories and open source projects

Edge-Detection-project bockp JavaScript

Tiny Image in Javascript - Edge Detection Algorithms

35 9 35

cpp-serialization-benchmark felixguendling C++

Comparison of C++ Serialization Libraries for Graph Data

35 3 35

MaskedFaceRepresentation sachith500 Python

Masked face recognition focuses on identifying people using their facial features while they are wearing masks. We introduce benchmarks on face verifi...

35 6 35

raytriangle-test johnnovak Nim

Ray-triangle intersection performance tests in various languages

35 5 35

ViHOS phusroyal Jupyter Notebook

Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans Detection" (EACL2023)

35 10 35

hdrhistogram-swift HdrHistogram Swift

Swift port of HdrHistogram

35 8 35

ConBench foundation-multimodal-models Python

[NeurIPS'24] Official implementation of paper "Unveiling the Tapestry of Consistency in Large Vision-Language Models".

35 2 35

sceneflow_from_blender cv-stuttgart Python

Get 3D motion vectors / scene flow directly from Blender

35 4 35

GenoArmory MAGICS-LAB Python

GenoArmory: A Unified Evaluation Framework for Adversarial Attacks on Genomic Foundation Models

35 1 35

wasm-render alanmacleod TypeScript

Software 3D renderer & rasteriser written in WASM/C & TypeScript to test / showcase WebAssembly and compare performance

34 5 34

goku k-nasa Rust

goku is a HTTP load testing application written in Rust

34 3 34

benchbox tboox C

🧀 The Benchmark Testing Box

34 11 34

SparkDataset Spratiher9 Jupyter Notebook

Instant search for and access to many datasets in Pyspark.

34 8 34

lua-vs-vimscript henriquehbr

A simple benchmark comparing Lua performance to Vimscript (because no one seems to care about these nowadays)

34 1 34

LREBench zjunlp Python

[EMNLP 2022 Findings] Towards Realistic Low-resource Relation Extraction: A Benchmark with Empirical Baseline Study

34 1 34

MACSum psunlpgroup Python

Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.

34 3 34

MileBench MileBench Python

This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"

34 2 34

TypeEvalPy secure-software-engineering Python

A Micro-benchmarking Framework for Python Type Inference Tools

34 2 34

Mess-benchmark bsc-mem Shell

A Multiplatform benchmark designed to provide holistic, detailed and close-to-hardware view of memory system performance with family of bandwidth--lat...

34 6 34

indivi_collection gaujay C++

A collection of std-like containers written in C++11. Features fast unordered flat map/set, configurable double-ended vector and sparse deque.

34 2 34

redis-benchmarks-specification redis Python

The Redis benchmarks specification describes the cross-language/tools requirements and expectations to foster performance and observability standards...

34 13 34

tiny_qa_benchmark_pp vincentkoc Python

Tiny QA Benchmark++ a micro-benchmark suite (52-item gold + on-demand multilingual synthetic packs), generator CLI, and CI-ready eval harness for ultr...

34 0 34

MSAD boniolp Jupyter Notebook

[VLDB 2023] Model Selection for Anomaly Detection in Time Series

34 6 34

CIS-Settings krispayne Shell

CIS settings bootstrapper for Mac

33 5 33

rex goanywhere Go

Pleasures for Web in Golang

33 3 33

saca-bench kurpicz Shell

Collection of Suffix Array Construction Algorithms (SACAs)

33 4 33

cmdbench manzik Python

Quick and easy resource usage monitoring and benchmarking for any command's CPU, memory, disk usage and runtime.

33 6 33

jomt gaujay C++

Google Benchmark data visualization tool

33 5 33

videocube-toolkit huuuuusy Python

The official python toolkit for running experiments and evaluate performance on VideoCube benchmark @TPAMI2023

33 6 33

criterion-table nu11ptr Rust

Generate markdown comparison tables from `cargo-criterion` JSON output

33 3 33

imread_benchmark ternaus Python

I/O benchmark for different image processing python libraries.

33 4 33

BeHonest GAIR-NLP JavaScript

BeHonest: Benchmarking Honesty in Large Language Models

33 0 33

critdd mirkobunse Python

Critical difference diagrams with Python and Tikz

33 3 33

cpp2lua-buindings-battle bagobor C++

Lua <-> C++ bindings libraries benchmark

32 1 32

go-test-driven-development gunjan5 Go

:hammer: :wrench: Test Driven Development :repeat: with Golang :hamster:

32 7 32

swords p-lambda Python

The Stanford Word Substitution (Swords) Benchmark

32 6 32

gwvault GoodwayGroup Go

ansible-vault CLI reimplemented in go

32 10 32

powerqe ryanxingql Python

An unified framework of quality enhancement approaches for compressed images based on PyTorch.

32 1 32

Python-Complementary-Languages 00sapo Python

Just a small test to see which language is better for extending python when using lists of lists

32 6 32

NYU-VPR ai4ce Python

[IROS2021] NYU-VPR: Long-Term Visual Place Recognition Benchmark with View Direction and Data Anonymization Influences

32 3 32

useb UKPLab Python

Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper: https://arxiv.org/abs/2104.06979.

32 2 32

OpenFed FederalLab Python

A Comprehensive and Versatile Open-Source Federated Learning Framework

32 3 32

HPO-B machinelearningnuremberg Python

[NeurIPS DBT 2021] HPO-B

32 9 32

divergent lechmazur

LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each other or to 50 i...

32 1 32