File size: 2,457 Bytes
a4c2ca0
a4c8861
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a4c2ca0
a4c8861
a4c2ca0
a4c8861
 
 
 
 
 
818d879
da238d6
d3b3a30
818d879
 
 
 
 
 
 
a4c8861
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
---
library_name: TTS
task: text-to-speech
tags:
- Persian
- TTS
- Farsi
- Coqui
- CoquiTTS
- pytorch
- audio
- text-to-speech
language: fa
datasets:
- persian-tts-dataset
widget:
- text: .زندگی فقط یک بار است؛ از آن به خوبی استفاده کن
  example_title: Hello, this is a test run.
license: openrail
pipeline_tag: text-to-speech
---

# **persian-tts-male-vits**

- persian-tts-male vits model for text to speech purposes.
- Persian  فارسی
- Single-speaker male voice
- finetuned **[persian-tts-female-vits](https://huggingface.co/Kamtera/persian-tts-female-vits)** model on **[persian-tts-dataset-male](https://www.kaggle.com/datasets/magnoliasis/persian-tts-dataset)**  dataset
- [GitHub Repo](https://github.com/karim23657/Persian-tts-coqui)
- <span style="color: #0072ff;font-weight: bold;">[Demo](https://huggingface.co/spaces/Kamtera/Persian-tts-CoquiTTS)</span>

# **info**

finetuned <span style="color: #d0212d;font-weight: bold;">[persian-tts-female-vits](https://huggingface.co/Kamtera/persian-tts-female-vits)</span>
  model on <span style="color: #d0212d;font-weight: bold;">[persian-tts-dataset-male](https://www.kaggle.com/datasets/magnoliasis/persian-tts-dataset)</span>
  dataset
  
# **Uses**

Install dependencies:
```python
!pip install TTS
!sudo apt-get -y install espeak-ng
```

Generate audio from text:

##### using cli:
```python
!tts --text "زندگی فقط یک بار است؛ از آن به خوبی استفاده کن" \
     --model_path "best_model_91323.pth" \
     --config_path "config.json" \
     --out_path "speech1.wav"
```
##### python api:

```python

from TTS.config import load_config
from TTS.utils.manage import ModelManager
from TTS.utils.synthesizer import Synthesizer

config="config.json" 
model="best_model_91323.pth"

model_path =model # Absolute path to the model checkpoint.pth
config_path =config # Absolute path to the model config.json

text=".زندگی فقط یک بار است؛ از آن به خوبی استفاده کن"

synthesizer = Synthesizer(
    model_path, config_path
)
wavs = synthesizer.tts(text)
synthesizer.save_wav(wavs, 'sp.wav')
```
Display audio:


```python
import IPython
IPython.display.Audio('sp.wav')
```


- **Hours used:** 10
- **Cloud Provider:** kaggle



# How to Get Started with the Model

Use the code below to get started with the model.

<details>
<summary> Click to expand </summary>

More information needed

</details>