bene-ges commited on
Commit
fc19722
1 Parent(s): 471f3fb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -0
README.md CHANGED
@@ -1,3 +1,30 @@
1
  ---
2
  license: cc-by-sa-4.0
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: cc-by-sa-4.0
3
+ language:
4
+ - ru
5
+ library_name: nemo
6
+ tags:
7
+ - tts
8
+ - text-to-speech
9
+ - Vocoder
10
  ---
11
+
12
+ ### Input
13
+
14
+ This model accepts batches of mel spectrograms.
15
+
16
+ ### Output
17
+
18
+ This model outputs audio at 22050Hz.
19
+
20
+ ## Training
21
+
22
+ The NeMo toolkit [1] was used for training the model for several epochs.
23
+
24
+ ### Datasets
25
+
26
+ This model is trained on [RUSLAN](https://ruslan-corpus.github.io/) [2] corpus (single speaker, male voice) sampled at 22050Hz.
27
+
28
+ ## References
29
+ - [1] [NVIDIA NeMo Toolkit](https://github.com/NVIDIA/NeMo)
30
+ - [2] Gabdrakhmanov L., Garaev R., Razinkov E. (2019) RUSLAN: Russian Spoken Language Corpus for Speech Synthesis. In: Salah A., Karpov A., Potapova R. (eds) Speech and Computer. SPECOM 2019. Lecture Notes in Computer Science, vol 11658. Springer, Cham