jonatasgrosman
commited on
Commit
•
e3bbe4d
1
Parent(s):
9ac1ee5
update README
Browse files
README.md
CHANGED
@@ -159,7 +159,7 @@ print(f"CER: {cer.compute(predictions=predictions, references=references, chunk_
|
|
159 |
|
160 |
**Test Result**:
|
161 |
|
162 |
-
In the table below I report the Word Error Rate (WER) and the Character Error Rate (CER) of the model. I ran the evaluation script described above on other models as well (on 2021-05-20). Note that the table below may show different results from those already reported, this may have been caused due to some specificity of the other evaluation scripts used.
|
163 |
|
164 |
---
|
165 |
|
@@ -167,7 +167,7 @@ In the table below I report the Word Error Rate (WER) and the Character Error Ra
|
|
167 |
|
168 |
| Model | WER | CER |
|
169 |
| ------------- | ------------- | ------------- |
|
170 |
-
| jonatasgrosman/wav2vec2-large-xlsr-53-english | **19.
|
171 |
| jonatasgrosman/wav2vec2-large-english | 21.16% | 9.53% |
|
172 |
| facebook/wav2vec2-large-960h-lv60-self | 22.03% | 10.39% |
|
173 |
| facebook/wav2vec2-large-960h-lv60 | 23.97% | 11.14% |
|
@@ -189,8 +189,8 @@ In the table below I report the Word Error Rate (WER) and the Character Error Ra
|
|
189 |
| facebook/wav2vec2-large-960h-lv60 | 2.15% | 0.61% |
|
190 |
| facebook/wav2vec2-large-960h | 2.82% | 0.84% |
|
191 |
| facebook/wav2vec2-base-960h | 3.44% | 1.06% |
|
|
|
192 |
| facebook/wav2vec2-base-100h | 6.26% | 2.00% |
|
193 |
-
| jonatasgrosman/wav2vec2-large-xlsr-53-english | 6.97% | 2.02% |
|
194 |
| jonatasgrosman/wav2vec2-large-english | 8.00% | 2.55% |
|
195 |
| elgeish/wav2vec2-large-lv60-timit-asr | 15.53% | 4.93% |
|
196 |
| boris/xlsr-en-punctuation | 19.28% | 6.45% |
|
@@ -206,8 +206,8 @@ In the table below I report the Word Error Rate (WER) and the Character Error Ra
|
|
206 |
| facebook/wav2vec2-large-960h-lv60-self | **3.89%** | **1.40%** |
|
207 |
| facebook/wav2vec2-large-960h-lv60 | 4.45% | 1.56% |
|
208 |
| facebook/wav2vec2-large-960h | 6.49% | 2.52% |
|
|
|
209 |
| facebook/wav2vec2-base-960h | 8.90% | 3.55% |
|
210 |
-
| jonatasgrosman/wav2vec2-large-xlsr-53-english | 11.75% | 4.23% |
|
211 |
| jonatasgrosman/wav2vec2-large-english | 13.62% | 5.24% |
|
212 |
| facebook/wav2vec2-base-100h | 13.97% | 5.51% |
|
213 |
| boris/xlsr-en-punctuation | 26.40% | 10.11% |
|
@@ -223,13 +223,12 @@ In the table below I report the Word Error Rate (WER) and the Character Error Ra
|
|
223 |
| ------------- | ------------- | ------------- |
|
224 |
| facebook/wav2vec2-large-960h-lv60-self | **5.17%** | **1.33%** |
|
225 |
| facebook/wav2vec2-large-960h-lv60 | 6.24% | 1.54% |
|
|
|
226 |
| facebook/wav2vec2-large-960h | 9.63% | 2.19% |
|
227 |
| facebook/wav2vec2-base-960h | 11.48% | 2.76% |
|
228 |
-
| jonatasgrosman/wav2vec2-large-xlsr-53-english | 11.93% | 3.50% |
|
229 |
| elgeish/wav2vec2-large-lv60-timit-asr | 13.83% | 4.36% |
|
230 |
| jonatasgrosman/wav2vec2-large-english | 13.91% | 4.01% |
|
231 |
| facebook/wav2vec2-base-100h | 16.75% | 4.79% |
|
232 |
| elgeish/wav2vec2-base-timit-asr | 25.40% | 8.16% |
|
233 |
| boris/xlsr-en-punctuation | 25.93% | 9.99% |
|
234 |
| facebook/wav2vec2-base-10k-voxpopuli-ft-en | 51.08% | 19.84% |
|
235 |
-
|
|
|
159 |
|
160 |
**Test Result**:
|
161 |
|
162 |
+
In the table below I report the Word Error Rate (WER) and the Character Error Rate (CER) of the model. I ran the evaluation script described above on other models as well (on 2021-05-20). Note that the table below may show different results from those already reported, this may have been caused due to some specificity of the other evaluation scripts used... I've also tested the model using the LibriSpeech and TIMIT datasets, which are better-behaved datasets than the Common Voice, containing only examples in US English extracted from audiobooks.
|
163 |
|
164 |
---
|
165 |
|
|
|
167 |
|
168 |
| Model | WER | CER |
|
169 |
| ------------- | ------------- | ------------- |
|
170 |
+
| jonatasgrosman/wav2vec2-large-xlsr-53-english | **19.76%** | **8.60%** |
|
171 |
| jonatasgrosman/wav2vec2-large-english | 21.16% | 9.53% |
|
172 |
| facebook/wav2vec2-large-960h-lv60-self | 22.03% | 10.39% |
|
173 |
| facebook/wav2vec2-large-960h-lv60 | 23.97% | 11.14% |
|
|
|
189 |
| facebook/wav2vec2-large-960h-lv60 | 2.15% | 0.61% |
|
190 |
| facebook/wav2vec2-large-960h | 2.82% | 0.84% |
|
191 |
| facebook/wav2vec2-base-960h | 3.44% | 1.06% |
|
192 |
+
| jonatasgrosman/wav2vec2-large-xlsr-53-english | 4.16% | 1.28% |
|
193 |
| facebook/wav2vec2-base-100h | 6.26% | 2.00% |
|
|
|
194 |
| jonatasgrosman/wav2vec2-large-english | 8.00% | 2.55% |
|
195 |
| elgeish/wav2vec2-large-lv60-timit-asr | 15.53% | 4.93% |
|
196 |
| boris/xlsr-en-punctuation | 19.28% | 6.45% |
|
|
|
206 |
| facebook/wav2vec2-large-960h-lv60-self | **3.89%** | **1.40%** |
|
207 |
| facebook/wav2vec2-large-960h-lv60 | 4.45% | 1.56% |
|
208 |
| facebook/wav2vec2-large-960h | 6.49% | 2.52% |
|
209 |
+
| jonatasgrosman/wav2vec2-large-xlsr-53-english | 8.82% | 3.42% |
|
210 |
| facebook/wav2vec2-base-960h | 8.90% | 3.55% |
|
|
|
211 |
| jonatasgrosman/wav2vec2-large-english | 13.62% | 5.24% |
|
212 |
| facebook/wav2vec2-base-100h | 13.97% | 5.51% |
|
213 |
| boris/xlsr-en-punctuation | 26.40% | 10.11% |
|
|
|
223 |
| ------------- | ------------- | ------------- |
|
224 |
| facebook/wav2vec2-large-960h-lv60-self | **5.17%** | **1.33%** |
|
225 |
| facebook/wav2vec2-large-960h-lv60 | 6.24% | 1.54% |
|
226 |
+
| jonatasgrosman/wav2vec2-large-xlsr-53-english | 6.81% | 2.02% |
|
227 |
| facebook/wav2vec2-large-960h | 9.63% | 2.19% |
|
228 |
| facebook/wav2vec2-base-960h | 11.48% | 2.76% |
|
|
|
229 |
| elgeish/wav2vec2-large-lv60-timit-asr | 13.83% | 4.36% |
|
230 |
| jonatasgrosman/wav2vec2-large-english | 13.91% | 4.01% |
|
231 |
| facebook/wav2vec2-base-100h | 16.75% | 4.79% |
|
232 |
| elgeish/wav2vec2-base-timit-asr | 25.40% | 8.16% |
|
233 |
| boris/xlsr-en-punctuation | 25.93% | 9.99% |
|
234 |
| facebook/wav2vec2-base-10k-voxpopuli-ft-en | 51.08% | 19.84% |
|
|