BSC-LT
/

vocos-mel-22khz

Model card Files Files and versions Community

wetdog commited on Mar 25, 2024

Commit

452f58b

·

verified ·

1 Parent(s): c65ff3e

Update README.md

Files changed (1) hide show

README.md +11 -24

README.md CHANGED Viewed

@@ -103,8 +103,8 @@ We also modified the mel spectrogram loss to use 128 bins and fmax of 11025 inst
 * initial_learning_rate: 5e-4
 * scheduler: cosine without warmup or restarts
-*  mel_loss_coeff: 45
-*  mrd_loss_coeff: 0.1
 * batch_size: 16
 * num_samples: 16384
@@ -112,27 +112,15 @@ We also modified the mel spectrogram loss to use 128 bins and fmax of 11025 inst
 <!-- This section describes the evaluation protocols and provides the results. -->
-### Testing Data, Factors & Metrics
-#### Testing Data
-<!-- This should link to a Dataset Card if possible. -->
-[More Information Needed]
-#### Metrics
-<!-- These are the evaluation metrics being used, ideally with a description of why. -->
-[More Information Needed]
-### Results
-[More Information Needed]
-#### Summary
 ## Citation
@@ -165,6 +153,5 @@ Copyright(c) 2024 by Language Technologies Unit, Barcelona Supercomputing Center
 [MIT](https://opensource.org/license/mit)
 ### Funding
-This work was funded by:
-  - The [Departament de la Vicepresidència i de Polítiques Digitals i Territori de la Generalitat de Catalunya](https://politiquesdigitals.gencat.cat/ca/inici/index.html#googtrans(ca|en) within the framework of [Projecte AINA](https://politiquesdigitals.gencat.cat/ca/economia/catalonia-ai/aina).

 * initial_learning_rate: 5e-4
 * scheduler: cosine without warmup or restarts
+* mel_loss_coeff: 45
+* mrd_loss_coeff: 0.1
 * batch_size: 16
 * num_samples: 16384
 <!-- This section describes the evaluation protocols and provides the results. -->
+Evaluation was done using the metrics on the original repo, after 183 epochs we achieve:
+* val_loss: 3.81
+* f1_score: 0.94
+* mel_loss: 0.25
+* periodicity_loss:0.132
+* pesq_score: 3.16
+* pitch_loss: 38.11
+* utmos_score: 3.27
 ## Citation
 [MIT](https://opensource.org/license/mit)
 ### Funding
+This work has been promoted and financed by the Generalitat de Catalunya through the [Aina project](https://projecteaina.cat/).