pplantinga commited on
Commit
2f2b72c
1 Parent(s): 4266b19

Actually create README.md

Browse files
Files changed (1) hide show
  1. README.md +79 -0
README.md CHANGED
@@ -0,0 +1,79 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: "en"
3
+ tags:
4
+ - Robust ASR
5
+ - Speech Enhancement
6
+ - PyTorch
7
+ license: "apache-2.0"
8
+ datasets:
9
+ - Voicebank
10
+ - DEMAND
11
+ metrics:
12
+ - WER
13
+ - PESQ
14
+ - eSTOI
15
+ ---
16
+
17
+ # 1D CNN + Transformer Trained w/ Mimic Loss
18
+
19
+ This repository provides all the necessary tools to perform enhancement and
20
+ robust ASR training (EN) within
21
+ SpeechBrain. For a better experience we encourage you to learn more about
22
+ [SpeechBrain](https://speechbrain.github.io). The given model performance is:
23
+
24
+ | Release | Test PESQ | Test eSTOI | Valid WER | Test WER |
25
+ |:-----------:|:-----:| :-----:|:----:|:---------:|
26
+ | 21-03-08 | 2.92 | 85.2 | 3.20 | 2.96 |
27
+
28
+ ## Pipeline description
29
+
30
+ The mimic loss training system consists of three steps:
31
+
32
+ 1. A perceptual model is pre-trained on clean speech features, the
33
+ same type used for the enhancement masking system.
34
+ 2. An enhancement model is trained with mimic loss, using the
35
+ pre-trained perceptual model.
36
+ 3. A large ASR model pre-trained on LibriSpeech is fine-tuned
37
+ using the enhancement front-end.
38
+
39
+ The enhancement and ASR models can be used together or
40
+ independently.
41
+
42
+ ## Install SpeechBrain
43
+
44
+ First of all, please install SpeechBrain with the following command:
45
+
46
+ ```
47
+ pip install \\we hide ! SpeechBrain is still private :p
48
+ ```
49
+
50
+ Please notice that we encourage you to read our tutorials and learn more about
51
+ [SpeechBrain](https://speechbrain.github.io).
52
+
53
+ ## Referencing Mimic Loss
54
+
55
+ If you find mimic loss useful, please cite:
56
+
57
+ ```
58
+ @inproceedings{bagchi2018spectral,
59
+ title={Spectral Feature Mapping with Mimic Loss for Robust Speech Recognition},
60
+ author={Bagchi, Deblin and Plantinga, Peter and Stiff, Adam and Fosler-Lussier, Eric},
61
+ booktitle={IEEE Conference on Audio, Speech, and Signal Processing (ICASSP)},
62
+ year={2018}
63
+ }
64
+ ```
65
+
66
+ ## Referencing SpeechBrain
67
+
68
+ If you find SpeechBrain useful, please cite:
69
+
70
+ ```
71
+ @misc{SB2021,
72
+ author = {Ravanelli, Mirco and Parcollet, Titouan and Rouhe, Aku and Plantinga, Peter and Rastorgueva, Elena and Lugosch, Loren and Dawalatabad, Nauman and Ju-Chieh, Chou and Heba, Abdel and Grondin, Francois and Aris, William and Liao, Chien-Feng and Cornell, Samuele and Yeh, Sung-Lin and Na, Hwidong and Gao, Yan and Fu, Szu-Wei and Subakan, Cem and De Mori, Renato and Bengio, Yoshua },
73
+ title = {SpeechBrain},
74
+ year = {2021},
75
+ publisher = {GitHub},
76
+ journal = {GitHub repository},
77
+ howpublished = {\url{https://github.com/speechbrain/speechbrain}},
78
+ }
79
+ ```