speechbrain
/

asr-streaming-conformer-librispeech

Automatic Speech Recognition

Model card Files Files and versions Community

sdelangen commited on Feb 26

Commit

9310681

•

1 Parent(s): ad6ae83

Update README.md

Files changed (1) hide show

README.md +12 -1

README.md CHANGED Viewed

@@ -58,12 +58,23 @@ This repository provides all the necessary tools to perform automatic speech
 recognition from an end-to-end system pretrained on LibriSpeech (EN) within
 SpeechBrain. For a better experience, we encourage you to learn more about
 [SpeechBrain](https://speechbrain.github.io).
-The performance of the model is the following:
 | Release | Test clean WER | Test other WER | GPUs |
 |:-------------:|:--------------:|:--------------:|:--------:|
 | 24-02-26 | 2.72 | 3.13 | 4xA100 80GB |
 ## Pipeline description
 TODO

 recognition from an end-to-end system pretrained on LibriSpeech (EN) within
 SpeechBrain. For a better experience, we encourage you to learn more about
 [SpeechBrain](https://speechbrain.github.io).
+The performance of the model in full context mode (no streaming) is the following:
 | Release | Test clean WER | Test other WER | GPUs |
 |:-------------:|:--------------:|:--------------:|:--------:|
 | 24-02-26 | 2.72 | 3.13 | 4xA100 80GB |
+With streaming, the results with different chunk sizes on test-clean are the following:
+|       | full | cs=32 (1280ms) | 24 (960ms) | 16 (640ms) | 12 (480ms) | 8 (320ms) |
+|:-----:|:----:|:-----:|:-----:|:-----:|:-----:|:-----:|
+| full  | 2.72%| -     | -     | -     | -     | -     |
+| lc=32 | -    | 3.09% | 3.07% | 3.26% | 3.31% | 3.44% |
+| 16    | -    | 3.10% | 3.07% | 3.27% | 3.32% | 3.50% |
+| 8     | -    | 3.10% | 3.11% | 3.31% | 3.39% | 3.62% |
+| 4     | -    | 3.12% | 3.13% | 3.37% | 3.51% | 3.80% |
+| 2     | -    | 3.19% | 3.24% | 3.50% | 3.79% | 4.38% |
 ## Pipeline description
 TODO