speechbrainteam
commited on
Commit
•
44212e8
1
Parent(s):
6a6d157
Update README.md
Browse files
README.md
CHANGED
@@ -32,10 +32,10 @@ SpeechBrain. For a better experience we encourage you to learn more about
|
|
32 |
## Pipeline description
|
33 |
|
34 |
This ASR system is composed with 3 different but linked blocks:
|
35 |
-
|
36 |
the train transcriptions of LibriSpeech.
|
37 |
-
|
38 |
-
|
39 |
N blocks of convolutional neural networks with normalisation and pooling on the
|
40 |
frequency domain. Then, a bidirectional LSTM is connected to a final DNN to obtain
|
41 |
the final acoustic representation that is given to the CTC and attention decoders.
|
@@ -76,7 +76,7 @@ The SpeechBrain team does not provide any warranty on the performance achieved b
|
|
76 |
year = {2021},
|
77 |
publisher = {GitHub},
|
78 |
journal = {GitHub repository},
|
79 |
-
howpublished = {
|
80 |
}
|
81 |
```
|
82 |
|
|
|
32 |
## Pipeline description
|
33 |
|
34 |
This ASR system is composed with 3 different but linked blocks:
|
35 |
+
- Tokenizer (unigram) that transforms words into subword units and trained with
|
36 |
the train transcriptions of LibriSpeech.
|
37 |
+
- Neural language model (RNNLM) trained on the full 10M words dataset.
|
38 |
+
- Acoustic model (CRDNN + CTC/Attention). The CRDNN architecture is made of
|
39 |
N blocks of convolutional neural networks with normalisation and pooling on the
|
40 |
frequency domain. Then, a bidirectional LSTM is connected to a final DNN to obtain
|
41 |
the final acoustic representation that is given to the CTC and attention decoders.
|
|
|
76 |
year = {2021},
|
77 |
publisher = {GitHub},
|
78 |
journal = {GitHub repository},
|
79 |
+
howpublished = {\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\url{https://github.com/speechbrain/speechbrain}},
|
80 |
}
|
81 |
```
|
82 |
|