Spaces:
Sleeping
Sleeping
rjzevallos
commited on
Commit
•
4e61db1
1
Parent(s):
bf10d7f
Update app.py
Browse files
app.py
CHANGED
@@ -25,7 +25,7 @@ Models in the leaderboard are evaluated using several key metrics:
|
|
25 |
|
26 |
These metrics help evaluate both the accuracy and quality of the model.
|
27 |
|
28 |
-
### UTMOS (UTokyo-SaruLab Mean Opinion Score)[Paper](https://arxiv.org/abs/2204.02152)
|
29 |
UTMOS is a MOS prediction system. **A higher UTMOS indicates better quality** of the generated voice.
|
30 |
|
31 |
|
@@ -45,10 +45,10 @@ The WER calculation is done as follows:
|
|
45 |
WER = (S + I + D) / N = (1 + 0 + 1) / 6 = 0.333
|
46 |
```
|
47 |
|
48 |
-
### STOI (Short-Time Objective Intelligibility)[Paper](https://ieeexplore.ieee.org/abstract/document/5495701?casa_token=PLtqLc8KNAgAAAAA:FOLuZ4dgMYsnGb1dQHgqVOouQzRJ3vA5yqj-sbwf8gs9Q-AIDCLkMZzAgzRrAogwwxULK9zsYeE)
|
49 |
STOI measures the intelligibility of the synthesized speech signal compared to the original signal. **A higher STOI indicates better intelligibility**.
|
50 |
|
51 |
-
### PESQ (Perceptual Evaluation of Speech Quality)[Paper](https://ieeexplore.ieee.org/abstract/document/941023?casa_token=jdtHy84_KhQAAAAA:qHN3WbT6cNdufj6OOn_fn0Je0RedMv-WJCmhQ_3CWy4nMTuDvFMF3KstAmKqLx5suQwdPgGByoY)
|
52 |
PESQ is a perceptual metric that evaluates the quality of speech in a similar manner to how a human listener would. **A higher PESQ indicates better voice quality**.
|
53 |
|
54 |
|
|
|
25 |
|
26 |
These metrics help evaluate both the accuracy and quality of the model.
|
27 |
|
28 |
+
### UTMOS (UTokyo-SaruLab Mean Opinion Score)[[Paper](https://arxiv.org/abs/2204.02152)]
|
29 |
UTMOS is a MOS prediction system. **A higher UTMOS indicates better quality** of the generated voice.
|
30 |
|
31 |
|
|
|
45 |
WER = (S + I + D) / N = (1 + 0 + 1) / 6 = 0.333
|
46 |
```
|
47 |
|
48 |
+
### STOI (Short-Time Objective Intelligibility)[[Paper](https://ieeexplore.ieee.org/abstract/document/5495701?casa_token=PLtqLc8KNAgAAAAA:FOLuZ4dgMYsnGb1dQHgqVOouQzRJ3vA5yqj-sbwf8gs9Q-AIDCLkMZzAgzRrAogwwxULK9zsYeE)]
|
49 |
STOI measures the intelligibility of the synthesized speech signal compared to the original signal. **A higher STOI indicates better intelligibility**.
|
50 |
|
51 |
+
### PESQ (Perceptual Evaluation of Speech Quality)[[Paper](https://ieeexplore.ieee.org/abstract/document/941023?casa_token=jdtHy84_KhQAAAAA:qHN3WbT6cNdufj6OOn_fn0Je0RedMv-WJCmhQ_3CWy4nMTuDvFMF3KstAmKqLx5suQwdPgGByoY)]
|
52 |
PESQ is a perceptual metric that evaluates the quality of speech in a similar manner to how a human listener would. **A higher PESQ indicates better voice quality**.
|
53 |
|
54 |
|