yanchaomars commited on
Commit
7f3de60
·
verified ·
1 Parent(s): 4fe4b65

update news

Browse files

update new models in news

Files changed (1) hide show
  1. README.md +7 -2
README.md CHANGED
@@ -11,13 +11,18 @@ library_name: transformers
11
 
12
  Check our open-source repository https://github.com/stepfun-ai/Step-Audio-EditX for more details!
13
 
 
 
 
 
 
14
  We are open-sourcing **Step-Audio-EditX**, a powerful **3B parameters** LLM-based audio model specialized in expressive and **iterative audio editing**.
15
  It excels at **editing emotion**, **speaking style**, and **paralinguistics**, and also features robust **zero-shot text-to-speech (TTS)** capabilities.
16
 
17
  ## Features
18
  - **Zero-Shot TTS**
19
- - Excellent zero-shot TTS cloning for Mandarin, English, Sichuanese, and Cantonese.
20
- - To use a dialect, just add a **[Sichuanese]** or **[Cantonese]** tag before your text.
21
 
22
  - **Emotion and Speaking Style Editing**
23
  - Remarkably effective iterative control over emotions and styles, supporting **dozens** of options for editing.
 
11
 
12
  Check our open-source repository https://github.com/stepfun-ai/Step-Audio-EditX for more details!
13
 
14
+ ## 🔥🔥🔥 News!!!
15
+ * Nov 28, 2025: 🚀 New Model Release: Now supporting **`Japanese`** and **`Korean`** languages.
16
+ * Nov 23, 2025: 📊 [Step-Audio-Edit-Benchmark](https://github.com/stepfun-ai/Step-Audio-Edit-Benchmark) Released!
17
+ * Nov 19, 2025: ⚙️ We release a **new version** of our model, which **supports polyphonic pronunciation control** and improves the performance of emotion, speaking style, and paralinguistic editing.
18
+
19
  We are open-sourcing **Step-Audio-EditX**, a powerful **3B parameters** LLM-based audio model specialized in expressive and **iterative audio editing**.
20
  It excels at **editing emotion**, **speaking style**, and **paralinguistics**, and also features robust **zero-shot text-to-speech (TTS)** capabilities.
21
 
22
  ## Features
23
  - **Zero-Shot TTS**
24
+ - Excellent zero-shot TTS cloning for `Mandarin`, `English`, `Sichuanese`, `Cantonese`, `Japanese` and `Korean`.
25
+ - To use a dialect, just add a **`[Sichuanese]`**, **`[Cantonese]`** ,**`[Japanese]`**,**`[Korean]`** tag before your text.
26
 
27
  - **Emotion and Speaking Style Editing**
28
  - Remarkably effective iterative control over emotions and styles, supporting **dozens** of options for editing.