Siddhant commited on
Commit
6d2f848
1 Parent(s): d202bc6

import from zenodo

Browse files
Files changed (29) hide show
  1. README.md +50 -0
  2. exp/asr_stats_raw_en_char_sp/train/feats_stats.npz +0 -0
  3. exp/asr_train_asr_transformer3_raw_en_char_sp/RESULTS.md +45 -0
  4. exp/asr_train_asr_transformer3_raw_en_char_sp/config.yaml +231 -0
  5. exp/asr_train_asr_transformer3_raw_en_char_sp/images/acc.png +0 -0
  6. exp/asr_train_asr_transformer3_raw_en_char_sp/images/backward_time.png +0 -0
  7. exp/asr_train_asr_transformer3_raw_en_char_sp/images/cer.png +0 -0
  8. exp/asr_train_asr_transformer3_raw_en_char_sp/images/cer_ctc.png +0 -0
  9. exp/asr_train_asr_transformer3_raw_en_char_sp/images/forward_time.png +0 -0
  10. exp/asr_train_asr_transformer3_raw_en_char_sp/images/iter_time.png +0 -0
  11. exp/asr_train_asr_transformer3_raw_en_char_sp/images/loss.png +0 -0
  12. exp/asr_train_asr_transformer3_raw_en_char_sp/images/loss_att.png +0 -0
  13. exp/asr_train_asr_transformer3_raw_en_char_sp/images/loss_ctc.png +0 -0
  14. exp/asr_train_asr_transformer3_raw_en_char_sp/images/lr_0.png +0 -0
  15. exp/asr_train_asr_transformer3_raw_en_char_sp/images/optim_step_time.png +0 -0
  16. exp/asr_train_asr_transformer3_raw_en_char_sp/images/train_time.png +0 -0
  17. exp/asr_train_asr_transformer3_raw_en_char_sp/images/wer.png +0 -0
  18. exp/asr_train_asr_transformer3_raw_en_char_sp/valid.acc.ave_10best.pth +3 -0
  19. exp/lm_train_lm_transformer_en_char/config.yaml +180 -0
  20. exp/lm_train_lm_transformer_en_char/images/backward_time.png +0 -0
  21. exp/lm_train_lm_transformer_en_char/images/forward_time.png +0 -0
  22. exp/lm_train_lm_transformer_en_char/images/iter_time.png +0 -0
  23. exp/lm_train_lm_transformer_en_char/images/loss.png +0 -0
  24. exp/lm_train_lm_transformer_en_char/images/lr_0.png +0 -0
  25. exp/lm_train_lm_transformer_en_char/images/optim_step_time.png +0 -0
  26. exp/lm_train_lm_transformer_en_char/images/train_time.png +0 -0
  27. exp/lm_train_lm_transformer_en_char/perplexity_test/ppl +1 -0
  28. exp/lm_train_lm_transformer_en_char/valid.loss.ave_10best.pth +3 -0
  29. meta.yaml +10 -0
README.md ADDED
@@ -0,0 +1,50 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - espnet
4
+ - audio
5
+ - automatic-speech-recognition
6
+ language: en
7
+ datasets:
8
+ - chime4
9
+ license: cc-by-4.0
10
+ ---
11
+ ## Example ESPnet2 ASR model
12
+ ### `kamo-naoyuki/chime4_asr_train_asr_transformer3_raw_en_char_sp_valid.acc.ave`
13
+ ♻️ Imported from https://zenodo.org/record/4414883/
14
+
15
+ This model was trained by kamo-naoyuki using chime4/asr1 recipe in [espnet](https://github.com/espnet/espnet/).
16
+ ### Demo: How to use in ESPnet2
17
+ ```python
18
+ # coming soon
19
+ ```
20
+ ### Citing ESPnet
21
+ ```BibTex
22
+ @inproceedings{watanabe2018espnet,
23
+ author={Shinji Watanabe and Takaaki Hori and Shigeki Karita and Tomoki Hayashi and Jiro Nishitoba and Yuya Unno and Nelson {Enrique Yalta Soplin} and Jahn Heymann and Matthew Wiesner and Nanxin Chen and Adithya Renduchintala and Tsubasa Ochiai},
24
+ title={{ESPnet}: End-to-End Speech Processing Toolkit},
25
+ year={2018},
26
+ booktitle={Proceedings of Interspeech},
27
+ pages={2207--2211},
28
+ doi={10.21437/Interspeech.2018-1456},
29
+ url={http://dx.doi.org/10.21437/Interspeech.2018-1456}
30
+ }
31
+ @inproceedings{hayashi2020espnet,
32
+ title={{Espnet-TTS}: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit},
33
+ author={Hayashi, Tomoki and Yamamoto, Ryuichi and Inoue, Katsuki and Yoshimura, Takenori and Watanabe, Shinji and Toda, Tomoki and Takeda, Kazuya and Zhang, Yu and Tan, Xu},
34
+ booktitle={Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
35
+ pages={7654--7658},
36
+ year={2020},
37
+ organization={IEEE}
38
+ }
39
+ ```
40
+ or arXiv:
41
+ ```bibtex
42
+ @misc{watanabe2018espnet,
43
+ title={ESPnet: End-to-End Speech Processing Toolkit},
44
+ author={Shinji Watanabe and Takaaki Hori and Shigeki Karita and Tomoki Hayashi and Jiro Nishitoba and Yuya Unno and Nelson Enrique Yalta Soplin and Jahn Heymann and Matthew Wiesner and Nanxin Chen and Adithya Renduchintala and Tsubasa Ochiai},
45
+ year={2018},
46
+ eprint={1804.00015},
47
+ archivePrefix={arXiv},
48
+ primaryClass={cs.CL}
49
+ }
50
+ ```
exp/asr_stats_raw_en_char_sp/train/feats_stats.npz ADDED
Binary file (1.4 kB). View file
 
exp/asr_train_asr_transformer3_raw_en_char_sp/RESULTS.md ADDED
@@ -0,0 +1,45 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <!-- Generated by scripts/utils/show_asr_result.sh -->
2
+ # RESULTS
3
+ ## Environments
4
+ - date: `Wed Dec 30 06:42:56 JST 2020`
5
+ - python version: `3.8.5 (default, Sep 4 2020, 07:30:14) [GCC 7.3.0]`
6
+ - espnet version: `espnet 0.9.6`
7
+ - pytorch version: `pytorch 1.4.0`
8
+ - Git hash: `d5ddd5e601f064eea0c1f96acaeefa324fc7d392`
9
+ - Commit date: `Fri Dec 25 07:31:34 2020 +0000`
10
+
11
+ ## asr_train_asr_transformer3_raw_en_char_sp
12
+ ### WER
13
+
14
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
15
+ |---|---|---|---|---|---|---|---|---|
16
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/dt05_real_beamformit_2mics|1640|27119|95.2|3.7|1.1|0.5|5.3|42.4|
17
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/dt05_real_beamformit_5mics|1640|27119|96.2|2.9|0.9|0.4|4.3|37.9|
18
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/dt05_real_isolated_1ch_track|1640|27119|94.3|4.4|1.3|0.7|6.3|46.6|
19
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/dt05_simu_beamformit_2mics|1640|27120|93.3|5.1|1.6|0.8|7.5|52.0|
20
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/dt05_simu_beamformit_5mics|1640|27120|94.9|3.8|1.3|0.6|5.7|45.9|
21
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/dt05_simu_isolated_1ch_track|1640|27120|92.2|6.0|1.7|0.9|8.7|52.4|
22
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/et05_real_beamformit_2mics|1320|21409|91.4|6.9|1.7|1.0|9.6|52.8|
23
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/et05_real_beamformit_5mics|1320|21409|93.0|5.6|1.4|0.9|7.9|48.9|
24
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/et05_real_isolated_1ch_track|1320|21409|89.5|8.4|2.1|1.3|11.8|59.1|
25
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/et05_simu_beamformit_2mics|1320|21416|89.3|8.0|2.6|1.4|12.1|61.0|
26
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/et05_simu_beamformit_5mics|1320|21416|91.4|6.4|2.2|1.2|9.8|54.5|
27
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/et05_simu_isolated_1ch_track|1320|21416|87.5|9.6|2.8|1.6|14.1|60.8|
28
+
29
+ ### CER
30
+
31
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
32
+ |---|---|---|---|---|---|---|---|---|
33
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/dt05_real_beamformit_2mics|1640|160390|97.8|1.0|1.2|0.5|2.7|42.4|
34
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/dt05_real_beamformit_5mics|1640|160390|98.3|0.8|0.9|0.4|2.1|37.9|
35
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/dt05_real_isolated_1ch_track|1640|160390|97.4|1.3|1.3|0.7|3.3|46.6|
36
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/dt05_simu_beamformit_2mics|1640|160400|96.7|1.5|1.8|0.8|4.1|52.0|
37
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/dt05_simu_beamformit_5mics|1640|160400|97.6|1.1|1.3|0.6|3.0|45.9|
38
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/dt05_simu_isolated_1ch_track|1640|160400|96.2|1.9|1.9|1.0|4.8|52.4|
39
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/et05_real_beamformit_2mics|1320|126796|95.9|2.1|2.0|1.0|5.1|52.8|
40
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/et05_real_beamformit_5mics|1320|126796|96.8|1.6|1.6|0.8|4.1|48.9|
41
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/et05_real_isolated_1ch_track|1320|126796|94.7|2.7|2.5|1.3|6.6|59.1|
42
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/et05_simu_beamformit_2mics|1320|126812|94.2|2.7|3.1|1.5|7.2|61.0|
43
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/et05_simu_beamformit_5mics|1320|126812|95.6|2.0|2.4|1.2|5.6|54.5|
44
+ |decode_asr_rnn_lm_lm_train_lm_transformer_en_char_valid.loss.ave_asr_model_valid.acc.ave/et05_simu_isolated_1ch_track|1320|126812|93.2|3.4|3.4|1.8|8.6|60.8|
45
+
exp/asr_train_asr_transformer3_raw_en_char_sp/config.yaml ADDED
@@ -0,0 +1,231 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ config: conf/tuning/train_asr_transformer3.yaml
2
+ print_config: false
3
+ log_level: INFO
4
+ dry_run: false
5
+ iterator_type: sequence
6
+ output_dir: exp/asr_train_asr_transformer3_raw_en_char_sp
7
+ ngpu: 1
8
+ seed: 0
9
+ num_workers: 1
10
+ num_att_plot: 3
11
+ dist_backend: nccl
12
+ dist_init_method: env://
13
+ dist_world_size: 4
14
+ dist_rank: 0
15
+ local_rank: 0
16
+ dist_master_addr: localhost
17
+ dist_master_port: 56935
18
+ dist_launcher: null
19
+ multiprocessing_distributed: true
20
+ cudnn_enabled: true
21
+ cudnn_benchmark: false
22
+ cudnn_deterministic: true
23
+ collect_stats: false
24
+ write_collected_feats: false
25
+ max_epoch: 50
26
+ patience: null
27
+ val_scheduler_criterion:
28
+ - valid
29
+ - loss
30
+ early_stopping_criterion:
31
+ - valid
32
+ - loss
33
+ - min
34
+ best_model_criterion:
35
+ - - valid
36
+ - acc
37
+ - max
38
+ keep_nbest_models: 10
39
+ grad_clip: 5.0
40
+ grad_clip_type: 2.0
41
+ grad_noise: false
42
+ accum_grad: 1
43
+ no_forward_run: false
44
+ resume: true
45
+ train_dtype: float32
46
+ use_amp: false
47
+ log_interval: null
48
+ unused_parameters: false
49
+ use_tensorboard: true
50
+ use_wandb: false
51
+ wandb_project: null
52
+ wandb_id: null
53
+ pretrain_path: null
54
+ init_param: []
55
+ freeze_param: []
56
+ num_iters_per_epoch: null
57
+ batch_size: 20
58
+ valid_batch_size: null
59
+ batch_bins: 16000000
60
+ valid_batch_bins: null
61
+ train_shape_file:
62
+ - exp/asr_stats_raw_en_char_sp/train/speech_shape
63
+ - exp/asr_stats_raw_en_char_sp/train/text_shape.char
64
+ valid_shape_file:
65
+ - exp/asr_stats_raw_en_char_sp/valid/speech_shape
66
+ - exp/asr_stats_raw_en_char_sp/valid/text_shape.char
67
+ batch_type: numel
68
+ valid_batch_type: null
69
+ fold_length:
70
+ - 80000
71
+ - 150
72
+ sort_in_batch: descending
73
+ sort_batch: descending
74
+ multiple_iterator: false
75
+ chunk_length: 500
76
+ chunk_shift_ratio: 0.5
77
+ num_cache_chunks: 1024
78
+ train_data_path_and_name_and_type:
79
+ - - dump/raw/tr05_multi_noisy_si284_sp/wav.scp
80
+ - speech
81
+ - sound
82
+ - - dump/raw/tr05_multi_noisy_si284_sp/text
83
+ - text
84
+ - text
85
+ valid_data_path_and_name_and_type:
86
+ - - dump/raw/dt05_multi_isolated_1ch_track/wav.scp
87
+ - speech
88
+ - sound
89
+ - - dump/raw/dt05_multi_isolated_1ch_track/text
90
+ - text
91
+ - text
92
+ allow_variable_data_keys: false
93
+ max_cache_size: 0.0
94
+ max_cache_fd: 32
95
+ valid_max_cache_size: null
96
+ optim: adam
97
+ optim_conf:
98
+ lr: 0.005
99
+ scheduler: warmuplr
100
+ scheduler_conf:
101
+ warmup_steps: 30000
102
+ token_list:
103
+ - <blank>
104
+ - <unk>
105
+ - <space>
106
+ - E
107
+ - T
108
+ - A
109
+ - N
110
+ - I
111
+ - O
112
+ - S
113
+ - R
114
+ - H
115
+ - L
116
+ - D
117
+ - C
118
+ - U
119
+ - M
120
+ - P
121
+ - F
122
+ - G
123
+ - Y
124
+ - W
125
+ - B
126
+ - V
127
+ - K
128
+ - .
129
+ - X
130
+ - ''''
131
+ - J
132
+ - Q
133
+ - Z
134
+ - ','
135
+ - '-'
136
+ - '"'
137
+ - <NOISE>
138
+ - '*'
139
+ - ':'
140
+ - (
141
+ - )
142
+ - '?'
143
+ - '&'
144
+ - ;
145
+ - '!'
146
+ - /
147
+ - '{'
148
+ - '}'
149
+ - '1'
150
+ - '2'
151
+ - '0'
152
+ - $
153
+ - '8'
154
+ - '9'
155
+ - '6'
156
+ - '3'
157
+ - '5'
158
+ - '7'
159
+ - '4'
160
+ - '~'
161
+ - '`'
162
+ - _
163
+ - <*IN*>
164
+ - <*MR.*>
165
+ - \
166
+ - ^
167
+ - <sos/eos>
168
+ init: xavier_uniform
169
+ input_size: null
170
+ ctc_conf:
171
+ dropout_rate: 0.0
172
+ ctc_type: builtin
173
+ reduce: true
174
+ ignore_nan_grad: false
175
+ model_conf:
176
+ ctc_weight: 0.3
177
+ lsm_weight: 0.1
178
+ length_normalized_loss: false
179
+ use_preprocessor: true
180
+ token_type: char
181
+ bpemodel: null
182
+ non_linguistic_symbols: data/nlsyms.txt
183
+ cleaner: null
184
+ g2p: null
185
+ frontend: default
186
+ frontend_conf:
187
+ fs: 16k
188
+ specaug: specaug
189
+ specaug_conf:
190
+ apply_time_warp: true
191
+ time_warp_window: 5
192
+ time_warp_mode: bicubic
193
+ apply_freq_mask: true
194
+ freq_mask_width_range:
195
+ - 0
196
+ - 30
197
+ num_freq_mask: 2
198
+ apply_time_mask: true
199
+ time_mask_width_range:
200
+ - 0
201
+ - 40
202
+ num_time_mask: 2
203
+ normalize: global_mvn
204
+ normalize_conf:
205
+ stats_file: exp/asr_stats_raw_en_char_sp/train/feats_stats.npz
206
+ preencoder: null
207
+ preencoder_conf: {}
208
+ encoder: transformer
209
+ encoder_conf:
210
+ output_size: 256
211
+ attention_heads: 4
212
+ linear_units: 2048
213
+ num_blocks: 12
214
+ dropout_rate: 0.1
215
+ positional_dropout_rate: 0.1
216
+ attention_dropout_rate: 0.0
217
+ input_layer: conv2d
218
+ normalize_before: true
219
+ decoder: transformer
220
+ decoder_conf:
221
+ attention_heads: 4
222
+ linear_units: 2028
223
+ num_blocks: 6
224
+ dropout_rate: 0.1
225
+ positional_dropout_rate: 0.1
226
+ self_attention_dropout_rate: 0.0
227
+ src_attention_dropout_rate: 0.0
228
+ required:
229
+ - output_dir
230
+ - token_list
231
+ distributed: true
exp/asr_train_asr_transformer3_raw_en_char_sp/images/acc.png ADDED
exp/asr_train_asr_transformer3_raw_en_char_sp/images/backward_time.png ADDED
exp/asr_train_asr_transformer3_raw_en_char_sp/images/cer.png ADDED
exp/asr_train_asr_transformer3_raw_en_char_sp/images/cer_ctc.png ADDED
exp/asr_train_asr_transformer3_raw_en_char_sp/images/forward_time.png ADDED
exp/asr_train_asr_transformer3_raw_en_char_sp/images/iter_time.png ADDED
exp/asr_train_asr_transformer3_raw_en_char_sp/images/loss.png ADDED
exp/asr_train_asr_transformer3_raw_en_char_sp/images/loss_att.png ADDED
exp/asr_train_asr_transformer3_raw_en_char_sp/images/loss_ctc.png ADDED
exp/asr_train_asr_transformer3_raw_en_char_sp/images/lr_0.png ADDED
exp/asr_train_asr_transformer3_raw_en_char_sp/images/optim_step_time.png ADDED
exp/asr_train_asr_transformer3_raw_en_char_sp/images/train_time.png ADDED
exp/asr_train_asr_transformer3_raw_en_char_sp/images/wer.png ADDED
exp/asr_train_asr_transformer3_raw_en_char_sp/valid.acc.ave_10best.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:656593443e5372019181462bafca75d0f6d01a8dbc01a34d7fcec6afd49cae5f
3
+ size 108486817
exp/lm_train_lm_transformer_en_char/config.yaml ADDED
@@ -0,0 +1,180 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ config: conf/tuning/train_lm_transformer.yaml
2
+ print_config: false
3
+ log_level: INFO
4
+ dry_run: false
5
+ iterator_type: sequence
6
+ output_dir: exp/lm_train_lm_transformer_en_char
7
+ ngpu: 1
8
+ seed: 0
9
+ num_workers: 1
10
+ num_att_plot: 3
11
+ dist_backend: nccl
12
+ dist_init_method: env://
13
+ dist_world_size: 4
14
+ dist_rank: 0
15
+ local_rank: 0
16
+ dist_master_addr: localhost
17
+ dist_master_port: 50129
18
+ dist_launcher: null
19
+ multiprocessing_distributed: true
20
+ cudnn_enabled: true
21
+ cudnn_benchmark: false
22
+ cudnn_deterministic: true
23
+ collect_stats: false
24
+ write_collected_feats: false
25
+ max_epoch: 25
26
+ patience: null
27
+ val_scheduler_criterion:
28
+ - valid
29
+ - loss
30
+ early_stopping_criterion:
31
+ - valid
32
+ - loss
33
+ - min
34
+ best_model_criterion:
35
+ - - valid
36
+ - loss
37
+ - min
38
+ keep_nbest_models: 10
39
+ grad_clip: 5.0
40
+ grad_clip_type: 2.0
41
+ grad_noise: false
42
+ accum_grad: 2
43
+ no_forward_run: false
44
+ resume: true
45
+ train_dtype: float32
46
+ use_amp: false
47
+ log_interval: null
48
+ unused_parameters: false
49
+ use_tensorboard: true
50
+ use_wandb: false
51
+ wandb_project: null
52
+ wandb_id: null
53
+ pretrain_path: null
54
+ init_param: []
55
+ freeze_param: []
56
+ num_iters_per_epoch: null
57
+ batch_size: 20
58
+ valid_batch_size: null
59
+ batch_bins: 350000
60
+ valid_batch_bins: null
61
+ train_shape_file:
62
+ - exp/lm_stats_en_char/train/text_shape.char
63
+ valid_shape_file:
64
+ - exp/lm_stats_en_char/valid/text_shape.char
65
+ batch_type: numel
66
+ valid_batch_type: null
67
+ fold_length:
68
+ - 150
69
+ sort_in_batch: descending
70
+ sort_batch: descending
71
+ multiple_iterator: false
72
+ chunk_length: 500
73
+ chunk_shift_ratio: 0.5
74
+ num_cache_chunks: 1024
75
+ train_data_path_and_name_and_type:
76
+ - - dump/raw/lm_train.txt
77
+ - text
78
+ - text
79
+ valid_data_path_and_name_and_type:
80
+ - - dump/raw/dt05_multi_isolated_1ch_track/text
81
+ - text
82
+ - text
83
+ allow_variable_data_keys: false
84
+ max_cache_size: 0.0
85
+ max_cache_fd: 32
86
+ valid_max_cache_size: null
87
+ optim: adam
88
+ optim_conf:
89
+ lr: 0.001
90
+ scheduler: warmuplr
91
+ scheduler_conf:
92
+ warmup_steps: 25000
93
+ token_list:
94
+ - <blank>
95
+ - <unk>
96
+ - <space>
97
+ - E
98
+ - T
99
+ - A
100
+ - N
101
+ - I
102
+ - O
103
+ - S
104
+ - R
105
+ - H
106
+ - L
107
+ - D
108
+ - C
109
+ - U
110
+ - M
111
+ - P
112
+ - F
113
+ - G
114
+ - Y
115
+ - W
116
+ - B
117
+ - V
118
+ - K
119
+ - .
120
+ - X
121
+ - ''''
122
+ - J
123
+ - Q
124
+ - Z
125
+ - ','
126
+ - '-'
127
+ - '"'
128
+ - <NOISE>
129
+ - '*'
130
+ - ':'
131
+ - (
132
+ - )
133
+ - '?'
134
+ - '&'
135
+ - ;
136
+ - '!'
137
+ - /
138
+ - '{'
139
+ - '}'
140
+ - '1'
141
+ - '2'
142
+ - '0'
143
+ - $
144
+ - '8'
145
+ - '9'
146
+ - '6'
147
+ - '3'
148
+ - '5'
149
+ - '7'
150
+ - '4'
151
+ - '~'
152
+ - '`'
153
+ - _
154
+ - <*IN*>
155
+ - <*MR.*>
156
+ - \
157
+ - ^
158
+ - <sos/eos>
159
+ init: null
160
+ model_conf:
161
+ ignore_id: 0
162
+ use_preprocessor: true
163
+ token_type: char
164
+ bpemodel: null
165
+ non_linguistic_symbols: data/nlsyms.txt
166
+ cleaner: null
167
+ g2p: null
168
+ lm: transformer
169
+ lm_conf:
170
+ pos_enc: null
171
+ embed_unit: 128
172
+ att_unit: 512
173
+ head: 8
174
+ unit: 2048
175
+ layer: 16
176
+ dropout_rate: 0.1
177
+ required:
178
+ - output_dir
179
+ - token_list
180
+ distributed: true
exp/lm_train_lm_transformer_en_char/images/backward_time.png ADDED
exp/lm_train_lm_transformer_en_char/images/forward_time.png ADDED
exp/lm_train_lm_transformer_en_char/images/iter_time.png ADDED
exp/lm_train_lm_transformer_en_char/images/loss.png ADDED
exp/lm_train_lm_transformer_en_char/images/lr_0.png ADDED
exp/lm_train_lm_transformer_en_char/images/optim_step_time.png ADDED
exp/lm_train_lm_transformer_en_char/images/train_time.png ADDED
exp/lm_train_lm_transformer_en_char/perplexity_test/ppl ADDED
@@ -0,0 +1 @@
 
 
1
+ 1.8042961355060017
exp/lm_train_lm_transformer_en_char/valid.loss.ave_10best.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c80fe3cbd661b5c3ad71538c2989d3d06f47ce73ff322d1c6ad2709bd6b42c64
3
+ size 202251148
meta.yaml ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ espnet: 0.9.6
2
+ files:
3
+ asr_model_file: exp/asr_train_asr_transformer3_raw_en_char_sp/valid.acc.ave_10best.pth
4
+ lm_file: exp/lm_train_lm_transformer_en_char/valid.loss.ave_10best.pth
5
+ python: "3.8.5 (default, Sep 4 2020, 07:30:14) \n[GCC 7.3.0]"
6
+ timestamp: 1609720326.551673
7
+ torch: 1.4.0
8
+ yaml_files:
9
+ asr_train_config: exp/asr_train_asr_transformer3_raw_en_char_sp/config.yaml
10
+ lm_train_config: exp/lm_train_lm_transformer_en_char/config.yaml