2 repositories on SrcLog
A high-throughput and memory-efficient inference and serving engine for LLMs
System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge