speechbrainteam commited on
Commit
c4cb7e1
1 Parent(s): f348f58

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -19,7 +19,7 @@ metrics:
19
 
20
  This repository provides all the necessary tools to perform automatic speech
21
  recognition from an end-to-end system pretrained on LibriSpeech (EN) within
22
- SpeechBrain. For a better experience we encourage you to learn more about
23
  [SpeechBrain](https://speechbrain.github.io). The given ASR model performance are:
24
 
25
  | Release | Test clean WER | Test other WER | GPUs |
@@ -28,21 +28,21 @@ SpeechBrain. For a better experience we encourage you to learn more about
28
 
29
  ## Pipeline description
30
 
31
- This ASR system is composed with 3 different but linked blocks:
32
  1. Tokenizer (unigram) that transforms words into subword units and trained with
33
  the train transcriptions of LibriSpeech.
34
  2. Neural language model (Transformer LM) trained on the full 10M words dataset.
35
  3. Acoustic model (CRDNN + CTC/Attention). The CRDNN architecture is made of
36
- N blocks of convolutional neural networks with normalisation and pooling on the
37
  frequency domain. Then, a bidirectional LSTM with projection layers is connected
38
  to a final DNN to obtain the final acoustic representation that is given to
39
  the CTC and attention decoders.
40
 
41
  ## Intended uses & limitations
42
 
43
- This model has been primilarly developed to be run within SpeechBrain as a pretrained ASR model
44
- for the english language. Thanks to the flexibility of SpeechBrain, any of the 3 blocks
45
- detailed above can be extracted and connected to you custom pipeline as long as SpeechBrain is
46
  installed.
47
 
48
  ## Install SpeechBrain
 
19
 
20
  This repository provides all the necessary tools to perform automatic speech
21
  recognition from an end-to-end system pretrained on LibriSpeech (EN) within
22
+ SpeechBrain. For a better experience, we encourage you to learn more about
23
  [SpeechBrain](https://speechbrain.github.io). The given ASR model performance are:
24
 
25
  | Release | Test clean WER | Test other WER | GPUs |
 
28
 
29
  ## Pipeline description
30
 
31
+ This ASR system is composed of 3 different but linked blocks:
32
  1. Tokenizer (unigram) that transforms words into subword units and trained with
33
  the train transcriptions of LibriSpeech.
34
  2. Neural language model (Transformer LM) trained on the full 10M words dataset.
35
  3. Acoustic model (CRDNN + CTC/Attention). The CRDNN architecture is made of
36
+ N blocks of convolutional neural networks with normalization and pooling on the
37
  frequency domain. Then, a bidirectional LSTM with projection layers is connected
38
  to a final DNN to obtain the final acoustic representation that is given to
39
  the CTC and attention decoders.
40
 
41
  ## Intended uses & limitations
42
 
43
+ This model has been primarily developed to be run within SpeechBrain as a pretrained ASR model
44
+ for the English language. Thanks to the flexibility of SpeechBrain, any of the 3 blocks
45
+ detailed above can be extracted and connected to your custom pipeline as long as SpeechBrain is
46
  installed.
47
 
48
  ## Install SpeechBrain