diy-llm

diy-llm

datawhalechina

🎓 系统性大语言模型构建课程|🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)|🚀 6 个渐进式作业 + 代码驱动,建立 LLM 全栈认知体系

550 Stars
67 Forks
550 Watchers
Jupyter Notebook Language
100 SrcLog Score
Cost to Build
$580.7K
Market Value
$2.65M

Growth over time

2 data points  ·  2026-04-09 → 2026-04-17
Stars Forks Watchers
💬

How do you feel about this project?

Ask AI about diy-llm

Question copied to clipboard

What is the datawhalechina/diy-llm GitHub project? Description: "🎓 系统性大语言模型构建课程|🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)|🚀 6 个渐进式作业 + 代码驱动,建立 LLM 全栈认知体系". Written in Jupyter Notebook. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone diy-llm

Clone via HTTPS

git clone https://github.com/datawhalechina/diy-llm.git

Clone via SSH

[email protected]:datawhalechina/diy-llm.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the diy-llm issue tracker:

Open GitHub Issues