Step-DPO

Step-DPO

JIA-Lab-research

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

392 Stars
16 Forks
392 Watchers
Python Language
100 SrcLog Score
Cost to Build
$271.1K
Market Value
$686.8K

Growth over time

2 data points  ·  2025-03-01 → 2026-04-01
Stars Forks Watchers
💬

How do you feel about this project?

Ask AI about Step-DPO

Question copied to clipboard

What is the JIA-Lab-research/Step-DPO GitHub project? Description: "Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone Step-DPO

Clone via HTTPS

git clone https://github.com/JIA-Lab-research/Step-DPO.git

Clone via SSH

[email protected]:JIA-Lab-research/Step-DPO.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the Step-DPO issue tracker:

Open GitHub Issues