2 repositories on SrcLog
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
A Really Scalable RL Framework to 10k+ CPUs