update news
Browse filesupdate new models in news
README.md
CHANGED
|
@@ -11,13 +11,18 @@ library_name: transformers
|
|
| 11 |
|
| 12 |
Check our open-source repository https://github.com/stepfun-ai/Step-Audio-EditX for more details!
|
| 13 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14 |
We are open-sourcing **Step-Audio-EditX**, a powerful **3B parameters** LLM-based audio model specialized in expressive and **iterative audio editing**.
|
| 15 |
It excels at **editing emotion**, **speaking style**, and **paralinguistics**, and also features robust **zero-shot text-to-speech (TTS)** capabilities.
|
| 16 |
|
| 17 |
## Features
|
| 18 |
- **Zero-Shot TTS**
|
| 19 |
-
- Excellent zero-shot TTS cloning for Mandarin
|
| 20 |
-
- To use a dialect, just add a
|
| 21 |
|
| 22 |
- **Emotion and Speaking Style Editing**
|
| 23 |
- Remarkably effective iterative control over emotions and styles, supporting **dozens** of options for editing.
|
|
|
|
| 11 |
|
| 12 |
Check our open-source repository https://github.com/stepfun-ai/Step-Audio-EditX for more details!
|
| 13 |
|
| 14 |
+
## 🔥🔥🔥 News!!!
|
| 15 |
+
* Nov 28, 2025: 🚀 New Model Release: Now supporting **`Japanese`** and **`Korean`** languages.
|
| 16 |
+
* Nov 23, 2025: 📊 [Step-Audio-Edit-Benchmark](https://github.com/stepfun-ai/Step-Audio-Edit-Benchmark) Released!
|
| 17 |
+
* Nov 19, 2025: ⚙️ We release a **new version** of our model, which **supports polyphonic pronunciation control** and improves the performance of emotion, speaking style, and paralinguistic editing.
|
| 18 |
+
|
| 19 |
We are open-sourcing **Step-Audio-EditX**, a powerful **3B parameters** LLM-based audio model specialized in expressive and **iterative audio editing**.
|
| 20 |
It excels at **editing emotion**, **speaking style**, and **paralinguistics**, and also features robust **zero-shot text-to-speech (TTS)** capabilities.
|
| 21 |
|
| 22 |
## Features
|
| 23 |
- **Zero-Shot TTS**
|
| 24 |
+
- Excellent zero-shot TTS cloning for `Mandarin`, `English`, `Sichuanese`, `Cantonese`, `Japanese` and `Korean`.
|
| 25 |
+
- To use a dialect, just add a **`[Sichuanese]`**, **`[Cantonese]`** ,**`[Japanese]`**,**`[Korean]`** tag before your text.
|
| 26 |
|
| 27 |
- **Emotion and Speaking Style Editing**
|
| 28 |
- Remarkably effective iterative control over emotions and styles, supporting **dozens** of options for editing.
|