AMD-NFS
theonlychant/AMD-NFS
C++
AMD(Advanced Micro Devices)-Native Inference Stack The goal is a ground-up LLM inference + serving stack that bypasses CUDA lock-in, targets ROCm natively, and replaces aging server software (think vLLM, llama.cpp, Triton servers) with a cohesive, AMD-optimized build