patriotyk
/

vocos-mel-hifigan-compat-44100khz

Model card Files Files and versions Metrics Training metrics Community

patriotyk commited on May 11, 2024

Commit

142e7e2

·

verified ·

1 Parent(s): d01f44f

Add metrics

Files changed (1) hide show

README.md +29 -1

README.md CHANGED Viewed

@@ -66,4 +66,32 @@ We where using two RTX-3090 video cards for training, and it took about one mont
 * mel_loss_coeff: 45
 * mrd_loss_coeff: 1.0
 * batch_size: 20
-* num_samples: 32768

 * mel_loss_coeff: 45
 * mrd_loss_coeff: 1.0
 * batch_size: 20
+* num_samples: 32768
+## Evaluation
+Evaluation was done using the metrics on the original repo, after 210 epochs we achieve:
+* val_loss: 3.703
+* f1_score: 0.950
+* mel_loss: 0.248
+* periodicity_loss:0.127
+* pesq_score: 3.399
+* pitch_loss: 38.26
+* utmos_score: 3.146
+## Citation
+If this code contributes to your research, please cite the work:
+```
+@article{siuzdak2023vocos,
+  title={Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis},
+  author={Siuzdak, Hubert},
+  journal={arXiv preprint arXiv:2306.00814},
+  year={2023}
+}
+```