14 Forks
355 Stars
355 Watchers

Step-DPO

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

How to download and setup Step-DPO

Open terminal and run command
git clone https://github.com/dvlab-research/Step-DPO.git
git clone is used to create a copy or clone of Step-DPO repositories. You pass git clone a repository URL.
it supports a few different network protocols and corresponding URL formats.

Also you may download zip file with Step-DPO https://github.com/dvlab-research/Step-DPO/archive/master.zip

Or simply clone Step-DPO with SSH
[email protected]:dvlab-research/Step-DPO.git

If you have some problems with Step-DPO

You may open issue on Step-DPO support forum (system) here: https://github.com/dvlab-research/Step-DPO/issues