Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
drbaph
/
MegaTTS3-WaveVAE
like
2
Text-to-Speech
Transformers
Safetensors
PyTorch
tts
voice-cloning
speech-synthesis
audio
chinese
english
zero-shot
diffusion
arxiv:
2502.18924
arxiv:
2408.16532
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
MegaTTS3-WaveVAE
/
g2p
1.04 GB
1 contributor
History:
1 commit
drbaph
Upload 24 files
4bd73f1
verified
2 months ago
added_tokens.json
Safe
574 kB
Upload 24 files
2 months ago
config.json
Safe
752 Bytes
Upload 24 files
2 months ago
generation_config.json
Safe
117 Bytes
Upload 24 files
2 months ago
latest
Safe
16 Bytes
Upload 24 files
2 months ago
merges.txt
Safe
1.67 MB
Upload 24 files
2 months ago
model.safetensors
Safe
1.02 GB
xet
Upload 24 files
2 months ago
special_tokens_map.json
Safe
616 Bytes
Upload 24 files
2 months ago
tokenizer.json
Safe
14.8 MB
xet
Upload 24 files
2 months ago
tokenizer_config.json
Safe
3.21 MB
Upload 24 files
2 months ago
trainer_state.json
Safe
790 kB
Upload 24 files
2 months ago
vocab.json
Safe
2.78 MB
Upload 24 files
2 months ago