1 repository on SrcLog
OpenAI-compatible embedding server built on pure ONNX Runtime—fast starts, tiny footprint, GPU/CPU with optional quantization