File size: 1,194 Bytes
36e015a 122c149 36e015a f817a3b 2ab0b09 1ef8ecb e0e22e1 78761f0 104e79a 78761f0 e0e22e1 fafc1c5 77d14db 78761f0 104e79a 78761f0 e888a46 e0e22e1 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 |
---
license: cc-by-nc-4.0
datasets:
- mozilla-foundation/common_voice_17_0
- facebook/voxpopuli
- mrfakename/librivox-full-catalog-archive
language:
- fi
base_model:
- SWivid/F5-TTS
pipeline_tag: text-to-speech
---
Here are two Finnish models of the F5-TTS, listen speech samples for both models.
The Common Voice and Vox Populi Finnish datasets are used for the first round.
- 20241206
- Epochs: 200
- Speakers: Multiple speakers from different corpus
- Use these with "f5-tts_infer-gradio":
Model: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_common_voice_fi_vox_populi_fi_20241206.safetensors
Vocab: hf://AsmoKoskinen/F5-TTS_Finnish_Model/vocab.txt
The second round is based on the Common Voice, LibriVox and Vox Populi Finnish data sets.
- 20241217
- Epochs: 200
- Speakers: Multiple speakers from different corpus
- Use these with "f5-tts_infer-gradio":
Model: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20241217/model_last_20241217.safetensors
Vocab: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20241217/vocab.txt
Numbers cannot be understood by both models. Convert numbers in words. |