speechbrain
/

asr-transformer-transformerlm-librispeech

Titouan commited on Mar 10, 2021

Commit

381bad0

•

2 Parent(s): 0a69b53 6bbde74

Merge branch 'main' of https://huggingface.co/speechbrain/asr-transformer-transformerlm-librispeech into main

Files changed (1) hide show

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ tags:
 - ASR
 - CTC
 - Attention
-- Tranformer
 - pytorch
 license: "apache-2.0"
 datasets:
@@ -19,7 +19,7 @@ metrics:
 This repository provides all the necessary tools to perform automatic speech
 recognition from an end-to-end system pretrained on LibriSpeech (EN) within
-SpeechBrain. For a better experience we encourage you to learn more about
 [SpeechBrain](https://speechbrain.github.io). The given ASR model performance are:
 | Release | Test clean WER | Test other WER | GPUs |
@@ -28,18 +28,18 @@ SpeechBrain. For a better experience we encourage you to learn more about
 ## Pipeline description
-This ASR system is composed with 3 different but linked blocks:
 1. Tokenizer (unigram) that transforms words into subword units and trained with
 the train transcriptions of LibriSpeech.
 2. Neural language model (Transformer LM) trained on the full 10M words dataset.
 3. Acoustic model made of a transformer encoder and a joint decoder with CTC +
-transformer. Hence, the decoding also incorporate the CTC probabilities.
 ## Intended uses & limitations
-This model has been primilarly developed to be run within SpeechBrain as a pretrained ASR model
-for the english language. Thanks to the flexibility of SpeechBrain, any of the 3 blocks
-detailed above can be extracted and connected to you custom pipeline as long as SpeechBrain is
 installed.
 ## Install SpeechBrain

 - ASR
 - CTC
 - Attention
+- Transformer
 - pytorch
 license: "apache-2.0"
 datasets:
 This repository provides all the necessary tools to perform automatic speech
 recognition from an end-to-end system pretrained on LibriSpeech (EN) within
+SpeechBrain. For a better experience, we encourage you to learn more about
 [SpeechBrain](https://speechbrain.github.io). The given ASR model performance are:
 | Release | Test clean WER | Test other WER | GPUs |
 ## Pipeline description
+This ASR system is composed of 3 different but linked blocks:
 1. Tokenizer (unigram) that transforms words into subword units and trained with
 the train transcriptions of LibriSpeech.
 2. Neural language model (Transformer LM) trained on the full 10M words dataset.
 3. Acoustic model made of a transformer encoder and a joint decoder with CTC +
+transformer. Hence, the decoding also incorporates the CTC probabilities.
 ## Intended uses & limitations
+This model has been primarily developed to be run within SpeechBrain as a pretrained ASR model
+for the English language. Thanks to the flexibility of SpeechBrain, any of the 3 blocks
+detailed above can be extracted and connected to your custom pipeline as long as SpeechBrain is
 installed.
 ## Install SpeechBrain