GaelleLaperriere commited on
Commit
5af308a
1 Parent(s): e1a60d9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -10
README.md CHANGED
@@ -16,14 +16,14 @@ metrics:
16
  - cer
17
  - cher
18
  model-index:
19
- - name: slu-wav2vec2-ctc-MEDIA-relax
20
  results:
21
  - task:
22
  name: Spoken Language Understanding
23
  type: spoken-language-understanding
24
  dataset:
25
  name: MEDIA
26
- type: MEDIA_slu_relax
27
  config: fr
28
  split: test
29
  args:
@@ -31,13 +31,13 @@ model-index:
31
  metrics:
32
  - name: Test ChER
33
  type: cher
34
- value: 7.46
35
  - name: Test CER
36
  type: cer
37
- value: 20.10
38
  - name: Test CVER
39
  type: cver
40
- value: 31.41
41
  ---
42
 
43
  <iframe src="https://ghbtns.com/github-btn.html?user=speechbrain&repo=speechbrain&type=star&count=true&size=large&v=2" frameborder="0" scrolling="0" width="170" height="30" title="GitHub"></iframe>
@@ -54,7 +54,7 @@ The performance of the model is the following:
54
 
55
  | Release | Test ChER | Test CER | Test CVER | GPUs |
56
  |:-------------:|:--------------:|:--------------:|:--------------:|:--------:|
57
- | 22-02-23 | 7.46 | 20.10 | 31.41 | 1xV100 32GB |
58
 
59
  ## Pipeline description
60
 
@@ -80,7 +80,7 @@ Please notice that we encourage you to read our tutorials and learn more about
80
  ```python
81
  from speechbrain.pretrained import EncoderASR
82
 
83
- asr_model = EncoderASR.from_hparams(source="speechbrain/slu-wav2vec2-ctc-MEDIA-relax", savedir="pretrained_models/slu-wav2vec2-ctc-MEDIA-relax")
84
  asr_model.transcribe_file('speechbrain/asr-wav2vec2-commonvoice-fr/example-fr.wav')
85
 
86
  ```
@@ -104,7 +104,7 @@ pip install -e .
104
  - [Media ASR (ELRA-S0272)](https://catalogue.elra.info/en-us/repository/browse/ELRA-S0272/)
105
  - [Media SLU (ELRA-E0024)](https://catalogue.elra.info/en-us/repository/browse/ELRA-E0024/)
106
  - [channels.csv and concepts_full_relax.csv](https://drive.google.com/drive/u/1/folders/1z2zFZp3c0NYLFaUhhghhBakGcFdXVRyf)
107
- 4. Modify placeholders in hparams/train_hf_wav2vec_relax.yaml:
108
  ```bash
109
  data_folder = !PLACEHOLDER
110
  channels_path = !PLACEHOLDER
@@ -113,10 +113,10 @@ concepts_path = !PLACEHOLDER
113
  5. Run Training:
114
  ```bash
115
  cd recipes/MEDIA/SLU/CTC/
116
- python train_hf_wav2vec.py hparams/train_hf_wav2vec_relax.yaml
117
  ```
118
 
119
- You can find our training results (models, logs, etc) [here](https://drive.google.com/drive/folders/1ALtwmk3VUUM0XRToecQp1DKAh9FsGqMA?usp=sharing).
120
 
121
  ### Limitations
122
  The SpeechBrain team does not provide any warranty on the performance achieved by this model when used on other datasets.
 
16
  - cer
17
  - cher
18
  model-index:
19
+ - name: slu-wav2vec2-ctc-MEDIA-full
20
  results:
21
  - task:
22
  name: Spoken Language Understanding
23
  type: spoken-language-understanding
24
  dataset:
25
  name: MEDIA
26
+ type: MEDIA_slu_full
27
  config: fr
28
  split: test
29
  args:
 
31
  metrics:
32
  - name: Test ChER
33
  type: cher
34
+ value: 7.78
35
  - name: Test CER
36
  type: cer
37
+ value: 24.88
38
  - name: Test CVER
39
  type: cver
40
+ value: 35.77
41
  ---
42
 
43
  <iframe src="https://ghbtns.com/github-btn.html?user=speechbrain&repo=speechbrain&type=star&count=true&size=large&v=2" frameborder="0" scrolling="0" width="170" height="30" title="GitHub"></iframe>
 
54
 
55
  | Release | Test ChER | Test CER | Test CVER | GPUs |
56
  |:-------------:|:--------------:|:--------------:|:--------------:|:--------:|
57
+ | 22-02-23 | 7.78 | 24.88 | 35.77 | 1xV100 32GB |
58
 
59
  ## Pipeline description
60
 
 
80
  ```python
81
  from speechbrain.pretrained import EncoderASR
82
 
83
+ asr_model = EncoderASR.from_hparams(source="speechbrain/slu-wav2vec2-ctc-MEDIA-full", savedir="pretrained_models/slu-wav2vec2-ctc-MEDIA-full")
84
  asr_model.transcribe_file('speechbrain/asr-wav2vec2-commonvoice-fr/example-fr.wav')
85
 
86
  ```
 
104
  - [Media ASR (ELRA-S0272)](https://catalogue.elra.info/en-us/repository/browse/ELRA-S0272/)
105
  - [Media SLU (ELRA-E0024)](https://catalogue.elra.info/en-us/repository/browse/ELRA-E0024/)
106
  - [channels.csv and concepts_full_relax.csv](https://drive.google.com/drive/u/1/folders/1z2zFZp3c0NYLFaUhhghhBakGcFdXVRyf)
107
+ 4. Modify placeholders in hparams/train_hf_wav2vec_full.yaml:
108
  ```bash
109
  data_folder = !PLACEHOLDER
110
  channels_path = !PLACEHOLDER
 
113
  5. Run Training:
114
  ```bash
115
  cd recipes/MEDIA/SLU/CTC/
116
+ python train_hf_wav2vec.py hparams/train_hf_wav2vec_full.yaml
117
  ```
118
 
119
+ You can find our training results (models, logs, etc) [here](https://drive.google.com/drive/folders/1cZRRQ3Cc_ucXXlwYSJqB7pOl6KwZdo84?usp=share_link).
120
 
121
  ### Limitations
122
  The SpeechBrain team does not provide any warranty on the performance achieved by this model when used on other datasets.