zhouyx1998 commited on
Commit
c1cd68b
·
verified ·
1 Parent(s): 8f9f62d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -3
README.md CHANGED
@@ -226,9 +226,11 @@ This is the secret sauce that gives your audio its unique sound.
226
 
227
  #### 1. Cooking with a Prompt Speech (Following a Famous Recipe)
228
  - A prompt speech provides the desired acoustic characteristics for VoxCPM. The speaker's timbre, speaking style, and even the background sounds and ambiance will be replicated.
229
- - **For a Clean, Studio-Quality Voice:**
230
- - ✅ Enable "Prompt Speech Enhancement". This acts like a noise filter, removing background hiss and rumble to give you a pure, clean voice clone.
231
-
 
 
232
  #### 2. Cooking au Naturel (Letting the Model Improvise)
233
  - If no reference is provided, VoxCPM becomes a creative chef! It will infer a fitting speaking style based on the text itself, thanks to the text-smartness of its foundation model, MiniCPM-4.
234
  - **Pro Tip**: Challenge VoxCPM with any text—poetry, song lyrics, dramatic monologues—it may deliver some interesting results!
 
226
 
227
  #### 1. Cooking with a Prompt Speech (Following a Famous Recipe)
228
  - A prompt speech provides the desired acoustic characteristics for VoxCPM. The speaker's timbre, speaking style, and even the background sounds and ambiance will be replicated.
229
+ - **For a Clean, Denoising Voice:**
230
+ - ✅ Enable "Prompt Speech Enhancement". This acts like a noise filter, removing background hiss and rumble to give you a pure, clean voice clone. However, this will limit the audio sampling rate to 16kHz, restricting the cloning quality ceiling.
231
+ - **For High-Quality Audio Cloning (Up to 44.1kHz):**
232
+ - ❌ Disable "Prompt Speech Enhancement" to preserve all original audio information, including background atmosphere, and support audio cloning up to 44.1kHz sampling rate.
233
+
234
  #### 2. Cooking au Naturel (Letting the Model Improvise)
235
  - If no reference is provided, VoxCPM becomes a creative chef! It will infer a fitting speaking style based on the text itself, thanks to the text-smartness of its foundation model, MiniCPM-4.
236
  - **Pro Tip**: Challenge VoxCPM with any text—poetry, song lyrics, dramatic monologues—it may deliver some interesting results!