NVIDIA

nvidia-docker NVIDIA/nvidia-docker

Build and run Docker containers leveraging NVIDIA GPUs

17.5k 2.1k

DeepLearningExamples NVIDIA/DeepLearningExamples Jupyter Notebook

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

14.8k 3.4k

TensorRT NVIDIA/TensorRT C++

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

12.9k 2.3k

cutlass NVIDIA/cutlass C++

CUDA Templates and Python DSLs for High-Performance Linear Algebra

9.6k 1.8k

NeMo NVIDIA/NeMo Python

NeMo: a toolkit for conversational AI

7.2k 1.7k

pix2pixHD NVIDIA/pix2pixHD Python

Synthesizing and manipulating 2048x1024 images with conditional GANs

6.9k 1.4k

warp NVIDIA/warp Python

A Python framework for GPU-accelerated simulation, robotics, and machine learning.

6.5k 489

DALI NVIDIA/DALI C++

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

5.7k 662

thrust NVIDIA/thrust C++

[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl

5k 759

nccl NVIDIA/nccl C++

Optimized primitives for collective multi-GPU communication

4.6k 1.2k

DIGITS NVIDIA/DIGITS HTML

Deep Learning GPU Training System

4.2k 1.4k

k8s-device-plugin NVIDIA/k8s-device-plugin Go

NVIDIA device plugin for Kubernetes

3.7k 807

TransformerEngine NVIDIA/TransformerEngine Python

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

3.3k 708

TensorRT pytorch/TensorRT Python

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

3k 394

MinkowskiEngine NVIDIA/MinkowskiEngine Python

Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

2.9k 472

physicsnemo NVIDIA/physicsnemo Python

Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods

2.7k 648

gpu-operator NVIDIA/gpu-operator Go

NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes

2.7k 489

libcudacxx NVIDIA/libcudacxx C++

[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

2.3k 191

cccl NVIDIA/cccl C++

CUDA Core Compute Libraries

2.3k 379

cutile-python NVIDIA/cutile-python Python

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

2k 132

cub NVIDIA/cub Cuda

[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

1.8k 463

aistore NVIDIA/aistore Go

AIStore: scalable storage for AI applications

1.8k 246

DeepRecommender NVIDIA/DeepRecommender Python

Deep learning for recommender systems

1.7k 339

OpenSeq2Seq NVIDIA/OpenSeq2Seq Python

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

1.6k 371

cuda-quantum NVIDIA/cuda-quantum C++

C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

1k 367

retinanet-examples NVIDIA/retinanet-examples Python

Fast and accurate object detection with end-to-end GPU optimization

899 265

nvbench NVIDIA/nvbench Cuda

CUDA Kernel Benchmarking Library

856 104

cuopt NVIDIA/cuopt Cuda

GPU accelerated decision optimization

823 162

cuCollections NVIDIA/cuCollections Cuda

637 107

jitify NVIDIA/jitify C++

A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).

572 74

Repositories (44)