FremyCompany commited on
Commit
93b5096
1 Parent(s): f6ca04e

Improved results using a 8s + 2s chunking strategy

Browse files
README.md CHANGED
@@ -38,10 +38,10 @@ model-index:
38
  metrics:
39
  - name: Test WER
40
  type: wer
41
- value: 22.58
42
  - name: Test CER
43
  type: cer
44
- value: 11.26
45
  ---
46
 
47
  # XLS-R-based CTC model with 5-gram language model from Common Voice
38
  metrics:
39
  - name: Test WER
40
  type: wer
41
+ value: 20.79
42
  - name: Test CER
43
  type: cer
44
+ value: 10.72
45
  ---
46
 
47
  # XLS-R-based CTC model with 5-gram language model from Common Voice
eval.sh CHANGED
@@ -1,2 +1,2 @@
1
  python ./eval.py --model_id FremyCompany/xls-r-nl-v1-cv8-lm --dataset mozilla-foundation/common_voice_8_0 --config nl --split test --log_outputs
2
- python ./eval.py --model_id FremyCompany/xls-r-nl-v1-cv8-lm --dataset speech-recognition-community-v2/dev_data --config nl --split validation --chunk_length_s 5.0 --stride_length_s 1.0
1
  python ./eval.py --model_id FremyCompany/xls-r-nl-v1-cv8-lm --dataset mozilla-foundation/common_voice_8_0 --config nl --split test --log_outputs
2
+ python ./eval.py --model_id FremyCompany/xls-r-nl-v1-cv8-lm --dataset speech-recognition-community-v2/dev_data --config nl --split validation --chunk_length_s 8.0 --stride_length_s 2.0
log_speech-recognition-community-v2_dev_data_nl_validation_predictions.txt CHANGED
The diff for this file is too large to render. See raw diff
speech-recognition-community-v2_dev_data_nl_validation_eval_results.txt CHANGED
@@ -1,2 +1,2 @@
1
- WER: 0.225885510882885
2
- CER: 0.11266761318820002
1
+ WER: 0.20791270350703156
2
+ CER: 0.10724088973024136