Spaces:

rjzevallos
/

test_app

Sleeping

rjzevallos commited on Nov 25, 2024

Commit

4e61db1

•

1 Parent(s): bf10d7f

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -25,7 +25,7 @@ Models in the leaderboard are evaluated using several key metrics:
 These metrics help evaluate both the accuracy and quality of the model.
-### UTMOS (UTokyo-SaruLab Mean Opinion Score)[Paper](https://arxiv.org/abs/2204.02152)
 UTMOS is a MOS prediction system. **A higher UTMOS indicates better quality** of the generated voice.
@@ -45,10 +45,10 @@ The WER calculation is done as follows:
 WER = (S + I + D) / N = (1 + 0 + 1) / 6 = 0.333
 ```
-### STOI (Short-Time Objective Intelligibility)[Paper](https://ieeexplore.ieee.org/abstract/document/5495701?casa_token=PLtqLc8KNAgAAAAA:FOLuZ4dgMYsnGb1dQHgqVOouQzRJ3vA5yqj-sbwf8gs9Q-AIDCLkMZzAgzRrAogwwxULK9zsYeE)
 STOI measures the intelligibility of the synthesized speech signal compared to the original signal. **A higher STOI indicates better intelligibility**.
-### PESQ (Perceptual Evaluation of Speech Quality)[Paper](https://ieeexplore.ieee.org/abstract/document/941023?casa_token=jdtHy84_KhQAAAAA:qHN3WbT6cNdufj6OOn_fn0Je0RedMv-WJCmhQ_3CWy4nMTuDvFMF3KstAmKqLx5suQwdPgGByoY)
 PESQ is a perceptual metric that evaluates the quality of speech in a similar manner to how a human listener would. **A higher PESQ indicates better voice quality**.

 These metrics help evaluate both the accuracy and quality of the model.
+### UTMOS (UTokyo-SaruLab Mean Opinion Score)[[Paper](https://arxiv.org/abs/2204.02152)]
 UTMOS is a MOS prediction system. **A higher UTMOS indicates better quality** of the generated voice.
 WER = (S + I + D) / N = (1 + 0 + 1) / 6 = 0.333
 ```
+### STOI (Short-Time Objective Intelligibility)[[Paper](https://ieeexplore.ieee.org/abstract/document/5495701?casa_token=PLtqLc8KNAgAAAAA:FOLuZ4dgMYsnGb1dQHgqVOouQzRJ3vA5yqj-sbwf8gs9Q-AIDCLkMZzAgzRrAogwwxULK9zsYeE)]
 STOI measures the intelligibility of the synthesized speech signal compared to the original signal. **A higher STOI indicates better intelligibility**.
+### PESQ (Perceptual Evaluation of Speech Quality)[[Paper](https://ieeexplore.ieee.org/abstract/document/941023?casa_token=jdtHy84_KhQAAAAA:qHN3WbT6cNdufj6OOn_fn0Je0RedMv-WJCmhQ_3CWy4nMTuDvFMF3KstAmKqLx5suQwdPgGByoY)]
 PESQ is a perceptual metric that evaluates the quality of speech in a similar manner to how a human listener would. **A higher PESQ indicates better voice quality**.