Step-DPO

JIA-Lab-research

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

math

View on GitHub

395 Stars

16 Forks

395 Watchers

Python Language

100 SrcLog Score

Cost to Build

$279.4K

Market Value

$708.6K

How is this calculated?

Growth over time

2 data points · 2025-03-01 → 2026-04-01

Stars Forks Watchers

💬

How do you feel about this project?

Ask AI about Step-DPO

Question copied to clipboard

What is the JIA-Lab-research/Step-DPO GitHub project? Description: "Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone Step-DPO

Clone via HTTPS

git clone https://github.com/JIA-Lab-research/Step-DPO.git

Clone via SSH

[email protected]:JIA-Lab-research/Step-DPO.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the Step-DPO issue tracker:

Open GitHub Issues

Similar to Step-DPO

freeCodeCamp freecodecamp.cn mathjs gpu.js sympy Surge simple-statistics mlcourse.ai libchaos cute_headers mathnet-numerics stdlib RandomKit mathquill osmnx math-php Euler stats phobos sage-archive-2023-02-01 Project-Euler-solutions swix pragmatapro expr-eval Sophus long.js primesieve libRmath.js symengine AlgebraicEngine-Fraction