diy-llm

🎓 系统性大语言模型构建课程｜🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)｜🚀 6 个渐进式作业 + 代码驱动，建立 LLM 全栈认知体系

nlp

View on GitHub Website

1.1k Stars

113 Forks

3 Watchers

Jupyter Notebook Language

100 SrcLog Score

Cost to Build

$642.0K

Market Value

$2.96M

How is this calculated?

Growth over time

4 data points · 2026-04-09 → 2026-07-25

Stars Forks Watchers

💬

How do you feel about this project?

Ask AI about diy-llm

Question copied to clipboard

What is the datawhalechina/diy-llm GitHub project? Description: "🎓 系统性大语言模型构建课程｜🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)｜🚀 6 个渐进式作业 + 代码驱动，建立 LLM 全栈认知体系". Written in Jupyter Notebook. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone diy-llm

Clone via HTTPS

git clone https://github.com/datawhalechina/diy-llm.git

Clone via SSH

[email protected]:datawhalechina/diy-llm.git

Download ZIP

Download main.zip

Found an issue?

Report bugs or request features on the diy-llm issue tracker:

Open GitHub Issues

Similar to diy-llm

lectures spaCy HanLP compromise gensim stanford-tensorflow-tutorials nltk awesome-nlp TextBlob ailearning CoreNLP ansj_seg rasa tensorflow_cookbook allennlp flashtext TagUI franc mycroft-core practical-pytorch text_classification nlp_tasks DeepPavlov snips-nlu Awesome-pytorch-list kcws Awesome-Chinese-NLP sentiment prose DeepLearn