YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Russian Vosk TTS model
Version 0.9
Metrics:
CER 0.6 FAD 0.810 UTMOS 3.290 Speaker Similarity 0.875 xRT CPU 0.35 xRT GPU 0.06
License: Apache 2.0
Changelog:
- ASR alignment
- No encoder, just duration predictor
- Slightly thinner predictor width (160) to fit DiT hidden vector
- Scale for diffusion loss (to not dominate on duration loss)
- Downloads last month
- 440
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support