KVQuant
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
How to download and setup KVQuant
Open terminal and run command
git clone https://github.com/SqueezeAILab/KVQuant.git
git clone is used to create a copy or clone of KVQuant repositories.
You pass git clone a repository URL. it supports a few different network protocols and corresponding URL formats.
Also you may download zip file with KVQuant https://github.com/SqueezeAILab/KVQuant/archive/master.zip
Or simply clone KVQuant with SSH
[email protected]:SqueezeAILab/KVQuant.git
If you have some problems with KVQuant
You may open issue on KVQuant support forum (system) here: https://github.com/SqueezeAILab/KVQuant/issuesSimilar to KVQuant repositories
Here you may see KVQuant alternatives and analogs
natural-language-processing lectures spaCy HanLP gensim MatchZoo tensorflow-nlp Awesome-pytorch-list spacy-models Repo-2017 stanford-tensorflow-tutorials awesome-nlp nlp_tasks nltk pattern TextBlob CoreNLP allennlp mycroft-core practical-pytorch textract languagetool MITIE machine_learning_examples prose arXivTimes ltp libpostal sling DeepNLP-models-Pytorch