Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
drbaph
/
MegaTTS3-WaveVAE
like
5
Text-to-Speech
Transformers
Safetensors
PyTorch
tts
voice-cloning
speech-synthesis
audio
chinese
english
zero-shot
diffusion
arxiv:
2502.18924
arxiv:
2408.16532
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
4bd73f1
MegaTTS3-WaveVAE
4.39 GB
1 contributor
History:
3 commits
drbaph
Upload 24 files
4bd73f1
verified
6 months ago
aligner_lm
Upload 24 files
6 months ago
diffusion_transformer
Upload 24 files
6 months ago
duration_lm
Upload 24 files
6 months ago
g2p
Upload 24 files
6 months ago
wavvae
Upload 24 files
6 months ago
.gitattributes
Safe
1.57 kB
Upload 24 files
6 months ago
.msc
Safe
1.81 kB
Upload 24 files
6 months ago
.mv
Safe
36 Bytes
Upload 24 files
6 months ago
README.md
6.57 kB
Upload 24 files
6 months ago
config.json
Safe
68 Bytes
Upload 24 files
6 months ago
configuration.json
Safe
72 Bytes
Upload 24 files
6 months ago