metadata
license: cc-by-nc-sa-4.0
datasets:
- mozilla-foundation/common_voice_17_0
- bond005/sberdevices_golos_10h_crowd
- bond005/sova_rudevices
- Aniemore/resd_annotated
language:
- ru
base_model:
- SWivid/F5-TTS
Overview
The F5-TTS model is finetuned specifically for Russian language
License
This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution
Model Information
Base Model: SWivid/F5-TTS
Total Training Duration: 250.000 steps
Training Configuration:
"exp_name": "F5TTS_Base",
"learning_rate": 1e-05,
"batch_size_per_gpu": 4500,
"batch_size_type": "frame",
"max_samples": 64,
"grad_accumulation_steps": 1,
"max_grad_norm": 1,
"epochs": 144,
"num_warmup_updates": 5838,
"save_per_updates": 11676,
"last_per_steps": 2918,
"finetune": true,
"file_checkpoint_train": "",
"tokenizer_type": "char",
"tokenizer_file": "",
"mixed_precision": "fp16",
"logger": "wandb",
"bnb_optimizer": true
Usage Instructions
Go to base repo
To do
- Correct stressmarks
- English support