File size: 1,455 Bytes
2b9adc1
0e8f65d
2b9adc1
 
9e97798
4b9fcb5
9e97798
 
2b9adc1
 
4b9fcb5
2b9adc1
 
34b348c
 
4b9fcb5
34b348c
 
 
 
 
 
4b9fcb5
 
34b348c
4b9fcb5
 
 
 
 
34b348c
4b9fcb5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
34b348c
 
 
 
 
 
4b9fcb5
34b348c
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
---
license: cc-by-nc-sa-4.0
datasets:
- mozilla-foundation/common_voice_17_0
- bond005/sberdevices_golos_10h_crowd
- bond005/sberdevices_golos_100h_farfield
- bond005/sova_rudevices
- Aniemore/resd_annotated
language:
- ru
- en
base_model:
- SWivid/F5-TTS
---
## Overview
The F5-TTS model is finetuned for Russian and English language

## License
This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution

## Model Information
**Base Model**: SWivid/F5-TTS  
**Training Duration:** 813k steps  
**Dataset Duration:** 100k hours

## Train charts
![alt text](loss.svg)
![alt text](lr.svg)

## Training Configuration:
```json
{
    "exp_name": "F5TTS_Base",
    "learning_rate": 1e-05,
    "batch_size_per_gpu": 5000,
    "batch_size_type": "frame",
    "max_samples": 64,
    "grad_accumulation_steps": 1,
    "max_grad_norm": 1,
    "epochs": 1,
    "num_warmup_updates": 405764,
    "save_per_updates": 811528,
    "keep_last_n_checkpoints": 5,
    "last_per_updates": 10000,
    "finetune": true,
    "file_checkpoint_train": "",
    "tokenizer_type": "char",
    "tokenizer_file": "",
    "mixed_precision": "fp16",
    "logger": "wandb",
    "bnb_optimizer": true
}
```

## Usage Instructions
Go to [base repo](https://github.com/SWivid/F5-TTS)

## To do
- Ask in community tab

# Other links
- [Github repo](https://github.com/HotDro4illa/F5-TTS)