zhouyx1998 commited on
Commit
ec9e5b4
ยท
verified ยท
1 Parent(s): ea5b234

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -42
README.md CHANGED
@@ -222,16 +222,6 @@ VoxCPM achieves competitive results on public zero-shot TTS benchmarks:
222
  | **VoxCPM** | **3.40** | **4.04** | 12.9 | 66.1 | 3.59 | **7.89** | 64.3 | 3.74 |
223
 
224
 
225
-
226
-
227
-
228
-
229
-
230
-
231
-
232
-
233
-
234
-
235
  ## โš ๏ธ Risks and limitations
236
  - General Model Behavior: While VoxCPM has been trained on a large-scale dataset, it may still produce outputs that are unexpected, biased, or contain artifacts.
237
  - Potential for Misuse of Voice Cloning: VoxCPM's powerful zero-shot voice cloning capability can generate highly realistic synthetic speech. This technology could be misused for creating convincing deepfakes for purposes of impersonation, fraud, or spreading disinformation. Users of this model must not use it to create content that infringes upon the rights of individuals. It is strictly forbidden to use VoxCPM for any illegal or unethical purposes. We strongly recommend that any publicly shared content generated with this model be clearly marked as AI-generated.
@@ -242,37 +232,6 @@ VoxCPM achieves competitive results on public zero-shot TTS benchmarks:
242
 
243
 
244
  ## ๐Ÿ“„ License
245
- The VoxCPM model weights and code are open-sourced under the [Apache-2.0](LICENSE) license.
246
-
247
- ## ๐Ÿ™ Acknowledgments
248
-
249
- We extend our sincere gratitude to the following works and resources for their inspiration and contributions:
250
-
251
- - [DiTAR](https://arxiv.org/abs/2502.03930) for the diffusion autoregressive backbone used in speech generation
252
- - [MiniCPM-4](https://github.com/OpenBMB/MiniCPM) for serving as the language model foundation
253
- - [CosyVoice](https://github.com/FunAudioLLM/CosyVoice) for the implementation of Flow Matching-based LocDiT
254
- - [DAC](https://github.com/descriptinc/descript-audio-codec) for providing the Audio VAE backbone
255
-
256
- ## Institutions
257
-
258
- This project is developed by the following institutions:
259
- - <img src="assets/modelbest_logo.png" width="28px"> [ModelBest](https://modelbest.cn/)
260
-
261
- - <img src="assets/thuhcsi_logo.png" width="28px"> [THUHCSI](https://github.com/thuhcsi)
262
-
263
-
264
-
265
-
266
- ## ๐Ÿ“š Citation
267
 
268
- If you find our model helpful, please consider citing our projects ๐Ÿ“ and staring us โญ๏ธ๏ผ
269
 
270
- ```bib
271
- @misc{voxcpm2025,
272
- author = {{Yixuan Zhou, Guoyang Zeng, Xin Liu, Xiang Li, Renjie Yu, Ziyang Wang, Runchuan Ye, Weiyue Sun, Jiancheng Gui, Kehan Li, Zhiyong Wu, Zhiyuan Liu}},
273
- title = {{VoxCPM}},
274
- year = {2025},
275
- publish = {\url{https://github.com/OpenBMB/VoxCPM}},
276
- note = {GitHub repository}
277
- }
278
- ```
 
222
  | **VoxCPM** | **3.40** | **4.04** | 12.9 | 66.1 | 3.59 | **7.89** | 64.3 | 3.74 |
223
 
224
 
 
 
 
 
 
 
 
 
 
 
225
  ## โš ๏ธ Risks and limitations
226
  - General Model Behavior: While VoxCPM has been trained on a large-scale dataset, it may still produce outputs that are unexpected, biased, or contain artifacts.
227
  - Potential for Misuse of Voice Cloning: VoxCPM's powerful zero-shot voice cloning capability can generate highly realistic synthetic speech. This technology could be misused for creating convincing deepfakes for purposes of impersonation, fraud, or spreading disinformation. Users of this model must not use it to create content that infringes upon the rights of individuals. It is strictly forbidden to use VoxCPM for any illegal or unethical purposes. We strongly recommend that any publicly shared content generated with this model be clearly marked as AI-generated.
 
232
 
233
 
234
  ## ๐Ÿ“„ License
235
+ The VoxCPM model weights and code are open-sourced under the Apache-2.0 license.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
236
 
 
237