StyleTTS2

StyleTTS2

yl4579

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

6.2k Stars
670 Forks
6.2k Watchers
Python Language
mit License
100 SrcLog Score
Cost to Build
$8.18M
Market Value
$32.86M

Growth over time

3 data points  ·  2025-08-03 → 2026-04-19
Stars Forks Watchers
💬

How do you feel about this project?

Ask AI about StyleTTS2

Question copied to clipboard

What is the yl4579/StyleTTS2 GitHub project? Description: "StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone StyleTTS2

Clone via HTTPS

git clone https://github.com/yl4579/StyleTTS2.git

Clone via SSH

[email protected]:yl4579/StyleTTS2.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the StyleTTS2 issue tracker:

Open GitHub Issues