Update README.md
Browse files
README.md
CHANGED
|
@@ -226,9 +226,11 @@ This is the secret sauce that gives your audio its unique sound.
|
|
| 226 |
|
| 227 |
#### 1. Cooking with a Prompt Speech (Following a Famous Recipe)
|
| 228 |
- A prompt speech provides the desired acoustic characteristics for VoxCPM. The speaker's timbre, speaking style, and even the background sounds and ambiance will be replicated.
|
| 229 |
-
- **For a Clean,
|
| 230 |
-
- ✅ Enable "Prompt Speech Enhancement". This acts like a noise filter, removing background hiss and rumble to give you a pure, clean voice clone.
|
| 231 |
-
|
|
|
|
|
|
|
| 232 |
#### 2. Cooking au Naturel (Letting the Model Improvise)
|
| 233 |
- If no reference is provided, VoxCPM becomes a creative chef! It will infer a fitting speaking style based on the text itself, thanks to the text-smartness of its foundation model, MiniCPM-4.
|
| 234 |
- **Pro Tip**: Challenge VoxCPM with any text—poetry, song lyrics, dramatic monologues—it may deliver some interesting results!
|
|
|
|
| 226 |
|
| 227 |
#### 1. Cooking with a Prompt Speech (Following a Famous Recipe)
|
| 228 |
- A prompt speech provides the desired acoustic characteristics for VoxCPM. The speaker's timbre, speaking style, and even the background sounds and ambiance will be replicated.
|
| 229 |
+
- **For a Clean, Denoising Voice:**
|
| 230 |
+
- ✅ Enable "Prompt Speech Enhancement". This acts like a noise filter, removing background hiss and rumble to give you a pure, clean voice clone. However, this will limit the audio sampling rate to 16kHz, restricting the cloning quality ceiling.
|
| 231 |
+
- **For High-Quality Audio Cloning (Up to 44.1kHz):**
|
| 232 |
+
- ❌ Disable "Prompt Speech Enhancement" to preserve all original audio information, including background atmosphere, and support audio cloning up to 44.1kHz sampling rate.
|
| 233 |
+
|
| 234 |
#### 2. Cooking au Naturel (Letting the Model Improvise)
|
| 235 |
- If no reference is provided, VoxCPM becomes a creative chef! It will infer a fitting speaking style based on the text itself, thanks to the text-smartness of its foundation model, MiniCPM-4.
|
| 236 |
- **Pro Tip**: Challenge VoxCPM with any text—poetry, song lyrics, dramatic monologues—it may deliver some interesting results!
|