NVIDIA

NVIDIA

🏢 Organization

44 repositories on SrcLog

View on GitHub
44 Repos
135.6k Stars
24.5k Forks
135.6k Watchers

Repositories (44)

nvidia-docker NVIDIA/nvidia-docker

Build and run Docker containers leveraging NVIDIA GPUs

17.5k
DeepLearningExamples NVIDIA/DeepLearningExamples Jupyter Notebook

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

14.8k
TensorRT NVIDIA/TensorRT C++

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

12.9k
cutlass NVIDIA/cutlass C++

CUDA Templates and Python DSLs for High-Performance Linear Algebra

9.6k
NeMo NVIDIA/NeMo Python

NeMo: a toolkit for conversational AI

7.2k
pix2pixHD NVIDIA/pix2pixHD Python

Synthesizing and manipulating 2048x1024 images with conditional GANs

6.9k
warp NVIDIA/warp Python

A Python framework for GPU-accelerated simulation, robotics, and machine learning.

6.5k
DALI NVIDIA/DALI C++

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

5.7k
thrust NVIDIA/thrust C++

[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl

5k
nccl NVIDIA/nccl C++

Optimized primitives for collective multi-GPU communication

4.6k
DIGITS NVIDIA/DIGITS HTML

Deep Learning GPU Training System

4.2k
k8s-device-plugin NVIDIA/k8s-device-plugin Go

NVIDIA device plugin for Kubernetes

3.7k
TransformerEngine NVIDIA/TransformerEngine Python

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

3.3k
TensorRT pytorch/TensorRT Python

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

3k
MinkowskiEngine NVIDIA/MinkowskiEngine Python

Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

2.9k
physicsnemo NVIDIA/physicsnemo Python

Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods

2.7k
gpu-operator NVIDIA/gpu-operator Go

NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes

2.7k
libcudacxx NVIDIA/libcudacxx C++

[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

2.3k
cccl NVIDIA/cccl C++

CUDA Core Compute Libraries

2.3k
cutile-python NVIDIA/cutile-python Python

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

2k
cub NVIDIA/cub Cuda

[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

1.8k
aistore NVIDIA/aistore Go

AIStore: scalable storage for AI applications

1.8k
DeepRecommender NVIDIA/DeepRecommender Python

Deep learning for recommender systems

1.7k
OpenSeq2Seq NVIDIA/OpenSeq2Seq Python

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

1.6k
cuda-quantum NVIDIA/cuda-quantum C++

C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

997
retinanet-examples NVIDIA/retinanet-examples Python

Fast and accurate object detection with end-to-end GPU optimization

899
nvbench NVIDIA/nvbench Cuda

CUDA Kernel Benchmarking Library

854
cuopt NVIDIA/cuopt Cuda

GPU accelerated decision optimization

823
cuCollections NVIDIA/cuCollections C++
634
jitify NVIDIA/jitify C++

A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).

571