569 Forks
9295 Stars
9295 Watchers

FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

How to download and setup FlexLLMGen

Open terminal and run command
git clone https://github.com/FMInference/FlexLLMGen.git
git clone is used to create a copy or clone of FlexLLMGen repositories. You pass git clone a repository URL.
it supports a few different network protocols and corresponding URL formats.

Also you may download zip file with FlexLLMGen https://github.com/FMInference/FlexLLMGen/archive/master.zip

Or simply clone FlexLLMGen with SSH
[email protected]:FMInference/FlexLLMGen.git

If you have some problems with FlexLLMGen

You may open issue on FlexLLMGen support forum (system) here: https://github.com/FMInference/FlexLLMGen/issues