Topic

benchmark

Repositories (1623)

auto-pen-bench
auto-pen-bench lucagioacchini Python

This repo contains the codes of the penetration test benchmark for Generative Agents presented in the paper "AutoPenBench: Benchmarking Generative Age...

32
VCR
VCR tianyu-z Python

Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.

32
llm-compressive
llm-compressive liyucheng09 Python

Longitudinal Evaluation of LLMs via Data Compression

32
wasm-score
wasm-score bytecodealliance C

A benchmark for standalone WebAssembly

32
Cotempqa
Cotempqa zhaochen0110 Python

Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)

32
ECS.CSharp.Benchmark-common-use-cases
ECS.CSharp.Benchmark-common-use-cases friflo C#

C# ECS Benchmarks

32
spurious_imagenet
spurious_imagenet YanNeu Python

Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet

32
rapidash
rapidash Acanguven TypeScript

🔥 Collection of useful javascript snippets with automated benchmarks

31
CellBench
CellBench Shians HTML

R package for benchmarking single cell analysis methods

31
TriageSQL
TriageSQL yszh8 Python

The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Text-to-SQL"

31
arline_benchmarks
arline_benchmarks ArlineQ Python

Arline Benchmarks platform allows to benchmark various algorithms for quantum circuit mapping/compression against each other on a list of predefined h...

31
Python-Complementary-Languages
Python-Complementary-Languages 00sapo Python

Just a small test to see which language is better for extending python when using lists of lists

31
LinGBM
LinGBM LiUGraphQL Java

Linköping GraphQL Benchmark (LinGBM)

31
kino_benchee
kino_benchee livebook-dev Elixir

Benchee (Elixir benchmarking) integration for Livebook

31
benchmark-vfm-ss
benchmark-vfm-ss tue-mps Python
31
isitfast
isitfast yamiteru Jupyter Notebook

A modular benchmarking library with V8 warmup and cpu/ram denoising for the most accurate and consistent results.

31
Camel-Bench
Camel-Bench mbzuai-oryx Python

[NAACL 2025 🔥] CAMEL-Bench is an Arabic benchmark for evaluating multimodal models across eight domains with 29,000 questions.

31
NewsBench
NewsBench IAAR-Shanghai Python

[ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism

31
benchmark-kit
benchmark-kit phpbenchmarks Twig

phpbenchmarks.com kit to add your benchmark.

30
corebench
corebench deckarep Go

corebench - run your benchmarks against high performance computing servers with many CPU cores

30
thread-pool-benchmark
thread-pool-benchmark Red-Portal C++

A C++ Thread Pool Colosseum

30
DACBench
DACBench automl PDDL

A benchmark library for Dynamic Algorithm Configuration.

30
sharkbench
sharkbench sharkbench Rust

Benchmarking programming languages and web frameworks.

30
user-agent-parser-benchmarks
user-agent-parser-benchmarks kenjis PHP

PHP User Agent Parser Benchmarks

29
sim-parameter-estimation
sim-parameter-estimation NVlabs Python

The code accompaniment for the CoRL 2020 paper: A User's Guide to Calibrating Robotics Simulators (https://arxiv.org/abs/2011.08985), from NVIDIA Rese...

29
examples
examples vivaxy JavaScript

📚Examples

29
SERAB
SERAB Neclow Python

SERAB: a multi-lingual benchmark for speech emotion recognition

29
esnext-benchmarks
esnext-benchmarks bevry-archive JavaScript

Benchmarks comparing ESNext features to their ES5 and various pre-processor equivalents

28
golden
golden disruptek Nim

a benchmark for compile-time and/or runtime Nim 🏆

28
micro-runner
micro-runner lucamezzalira JavaScript

Micro-Runner, a CLI playground for benchmarking your JavaScript code

27
clusterbench
clusterbench clusterbench Java

Jakarta EE 5/6/7/8/10 WildFly/JBoss EAP Clustering Benchmark Application

27
js-testrunners-bench
js-testrunners-bench vitalets JavaScript

JavaScript test-runners benchmark

27
dgraph-bench
dgraph-bench linuxerwang Go

A benchmark program for dgraph.

27
OktoberfestFoodDataset
OktoberfestFoodDataset a1302z Jupyter Notebook

Publication of our Oktoberfest Food Dataset for Object Detection methods

27
python-performance
python-performance scivision Python

Performance benchmarks of Python, Numpy, etc. vs. other languages such as Matlab, Julia, Fortran.

27
inspec-gke-cis-benchmark
inspec-gke-cis-benchmark GoogleCloudPlatform Ruby

GKE CIS 1.1.0 Benchmark InSpec Profile

27
shopware6-benchmarking
shopware6-benchmarking tideways HTML

The Shopware 6 performance benchmarking toolset, built by Shopware and Tideways.

27
mpjbt
mpjbt domodwyer Go

MongoDB/PostgreSQL JSON benchmark tool (and slides) for Percona EU 2017

26
node-benchr
node-benchr robertklep JavaScript

Node.js benchmark runner

26
benchee
benchee planttheidea TypeScript

Simple benchmarks in both node and browser

26
text2image-benchmark
text2image-benchmark nashory Python

Performance comparison of existing GAN based Text To Image algorithms. (GAN-CLS, StackGAN, TAC-GAN)

26
streamalg
streamalg biboudis Java

Extensible stream pipelines with object algebras.

26
go-benchmark-app
go-benchmark-app mrLSD Go

Application for HTTP benchmarking via different rules and configs

26
CoreDataImages-Article
CoreDataImages-Article V8tr Swift

The app implements and benchmarks different Core Data persistence options. It supplements the blog post http://www.vadimbulavin.com/how-to-save-images...

26
dubbo-go-benchmark
dubbo-go-benchmark dubbogo Go

benchmark for [apache/dubbo-go](github.com/apache/dubbo-go)

26
Android-ORM-Benchmarks
Android-ORM-Benchmarks mykola-dev Kotlin
25
DeepLearningBenchmarks
DeepLearningBenchmarks avik-pal Julia

Benchmarks across Deep Learning Frameworks in Julia and Python

25
qmlbench
qmlbench CrimsonAS QML

Tool to easily benchmark QML/QtQuick (or your own QML components) performance on different hardware.

25
rgbd_scribble_benchmark
rgbd_scribble_benchmark tum-vision Python

RGB-D Scribble-based Segmentation Benchmark

25
websocket-benchmarker
websocket-benchmarker healeycodes Python

Benchmark a WebSocket server's message throughput ⌛

25