Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
drbaph
/
MegaTTS3-WaveVAE
like
2
Text-to-Speech
Transformers
Safetensors
PyTorch
tts
voice-cloning
speech-synthesis
audio
chinese
english
zero-shot
diffusion
arxiv:
2502.18924
arxiv:
2408.16532
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
MegaTTS3-WaveVAE
4.39 GB
1 contributor
History:
6 commits
drbaph
Update README.md
e1e4a7a
verified
2 months ago
aligner_lm
Upload 24 files
2 months ago
diffusion_transformer
Upload 24 files
2 months ago
duration_lm
Upload 24 files
2 months ago
g2p
Upload 24 files
2 months ago
wavvae
Upload 24 files
2 months ago
.gitattributes
Safe
1.57 kB
Upload 24 files
2 months ago
.msc
Safe
1.81 kB
Upload 24 files
2 months ago
.mv
Safe
36 Bytes
Upload 24 files
2 months ago
README.md
Safe
4.02 kB
Update README.md
2 months ago
config.json
Safe
68 Bytes
Upload 24 files
2 months ago
configuration.json
Safe
72 Bytes
Upload 24 files
2 months ago