1 repository on SrcLog
🍎 One Kernel a Day, Keeps High Latency Away. Collection of hand-tuned, peak-performance CUDA kernels, seamlessly integrated as PyTorch C++ extensions.