Wangyou Zhang commited on
Commit
569cf47
1 Parent(s): 7fe6eac

Add model and configs

Browse files
README.md CHANGED
@@ -1,3 +1,297 @@
1
  ---
 
 
 
 
 
 
 
2
  license: cc-by-4.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ tags:
3
+ - espnet
4
+ - audio
5
+ - audio-to-audio
6
+ language: en
7
+ datasets:
8
+ - wsj0_2mix
9
  license: cc-by-4.0
10
  ---
11
+
12
+ ## ESPnet2 ENH model
13
+
14
+ ### `espnet/Wangyou_Zhang_wsj0_2mix_train_enh_tse_td_speakerbeam_raw`
15
+
16
+ This model was trained by Wangyou Zhang using the wsj0_2mix recipe in [espnet](https://github.com/espnet/espnet/).
17
+
18
+ ### Demo: How to use in ESPnet2
19
+
20
+ Follow the [ESPnet installation instructions](https://espnet.github.io/espnet/installation.html)
21
+ if you haven't done that already.
22
+
23
+ ```bash
24
+ cd espnet
25
+
26
+ pip install -e .
27
+ cd egs2/wsj0_2mix/tse1
28
+ ./run.sh --skip_data_prep false --skip_train true --is_tse_task true --download_model espnet/Wangyou_Zhang_wsj0_2mix_train_enh_tse_td_speakerbeam_raw
29
+ ```
30
+
31
+ <!-- Generated by ./scripts/utils/show_enh_score.sh -->
32
+ # RESULTS
33
+ ## Environments
34
+ - date: `Sun Jul 9 09:23:16 CST 2023`
35
+ - python version: `3.8.16 (default, Mar 2 2023, 03:21:46) [GCC 11.2.0]`
36
+ - espnet version: `espnet 202301`
37
+ - pytorch version: `pytorch 2.0.1`
38
+ - Git hash: ``
39
+ - Commit date: ``
40
+
41
+
42
+ ## enh_train_enh_tse_td_speakerbeam_org_raw
43
+
44
+ config: conf/tuning/train_enh_tse_td_speakerbeam_org.yaml
45
+
46
+ |dataset|PESQ_NB|STOI|SAR|SDR|SIR|SI_SNR|
47
+ |---|---|---|---|---|---|---|
48
+ |enhanced_cv_min_8k|3.54|96.41|18.75|18.75|0.00|18.37|
49
+ |enhanced_tt_min_8k|3.46|96.35|17.51|17.51|0.00|17.11|
50
+
51
+ ## ENH config
52
+
53
+ <details><summary>expand</summary>
54
+
55
+ ```
56
+ config: conf/tuning/train_enh_tse_td_speakerbeam.yaml
57
+ print_config: false
58
+ log_level: INFO
59
+ dry_run: false
60
+ iterator_type: chunk
61
+ output_dir: exp/enh_train_enh_tse_td_speakerbeam_raw
62
+ ngpu: 1
63
+ seed: 0
64
+ num_workers: 4
65
+ num_att_plot: 3
66
+ dist_backend: nccl
67
+ dist_init_method: env://
68
+ dist_world_size: null
69
+ dist_rank: null
70
+ local_rank: 0
71
+ dist_master_addr: null
72
+ dist_master_port: null
73
+ dist_launcher: null
74
+ multiprocessing_distributed: false
75
+ unused_parameters: true
76
+ sharded_ddp: false
77
+ cudnn_enabled: true
78
+ cudnn_benchmark: false
79
+ cudnn_deterministic: true
80
+ collect_stats: false
81
+ write_collected_feats: false
82
+ skip_stats_npz: false
83
+ max_epoch: 100
84
+ patience: 20
85
+ val_scheduler_criterion:
86
+ - valid
87
+ - loss
88
+ early_stopping_criterion:
89
+ - valid
90
+ - loss
91
+ - min
92
+ best_model_criterion:
93
+ - - valid
94
+ - snr
95
+ - max
96
+ - - valid
97
+ - loss
98
+ - min
99
+ keep_nbest_models: 1
100
+ nbest_averaging_interval: 0
101
+ grad_clip: 5.0
102
+ grad_clip_type: 2.0
103
+ grad_noise: false
104
+ accum_grad: 1
105
+ no_forward_run: false
106
+ resume: true
107
+ train_dtype: float32
108
+ use_amp: false
109
+ log_interval: null
110
+ use_matplotlib: true
111
+ use_tensorboard: true
112
+ create_graph_in_tensorboard: false
113
+ use_wandb: false
114
+ wandb_project: null
115
+ wandb_id: null
116
+ wandb_entity: null
117
+ wandb_name: null
118
+ wandb_model_log_interval: -1
119
+ detect_anomaly: false
120
+ pretrain_path: null
121
+ init_param: []
122
+ ignore_init_mismatch: false
123
+ freeze_param: []
124
+ num_iters_per_epoch: null
125
+ batch_size: 4
126
+ valid_batch_size: null
127
+ batch_bins: 1000000
128
+ valid_batch_bins: null
129
+ train_shape_file:
130
+ - exp/enh_stats_tr_min_8k_cv_min_8k_8k/train/speech_mix_shape
131
+ - exp/enh_stats_tr_min_8k_cv_min_8k_8k/train/speech_ref1_shape
132
+ - exp/enh_stats_tr_min_8k_cv_min_8k_8k/train/enroll_ref1_shape
133
+ valid_shape_file:
134
+ - exp/enh_stats_tr_min_8k_cv_min_8k_8k/valid/speech_mix_shape
135
+ - exp/enh_stats_tr_min_8k_cv_min_8k_8k/valid/speech_ref1_shape
136
+ - exp/enh_stats_tr_min_8k_cv_min_8k_8k/valid/enroll_ref1_shape
137
+ batch_type: folded
138
+ valid_batch_type: null
139
+ fold_length:
140
+ - 80000
141
+ - 80000
142
+ - 80000
143
+ sort_in_batch: descending
144
+ sort_batch: descending
145
+ multiple_iterator: false
146
+ chunk_length: 32000
147
+ chunk_shift_ratio: 0.5
148
+ num_cache_chunks: 1024
149
+ chunk_excluded_key_prefixes:
150
+ - enroll_ref
151
+ train_data_path_and_name_and_type:
152
+ - - dump/raw/tr_min_8k/wav.scp
153
+ - speech_mix
154
+ - sound
155
+ - - dump/raw/tr_min_8k/spk1.scp
156
+ - speech_ref1
157
+ - sound
158
+ - - dump/raw/tr_min_8k/enroll_spk1.scp
159
+ - enroll_ref1
160
+ - text
161
+ valid_data_path_and_name_and_type:
162
+ - - dump/raw/cv_min_8k/wav.scp
163
+ - speech_mix
164
+ - sound
165
+ - - dump/raw/cv_min_8k/spk1.scp
166
+ - speech_ref1
167
+ - sound
168
+ - - dump/raw/cv_min_8k/enroll_spk1.scp
169
+ - enroll_ref1
170
+ - text
171
+ allow_variable_data_keys: false
172
+ max_cache_size: 0.0
173
+ max_cache_fd: 32
174
+ valid_max_cache_size: null
175
+ exclude_weight_decay: false
176
+ exclude_weight_decay_conf: {}
177
+ optim: adam
178
+ optim_conf:
179
+ lr: 0.001
180
+ eps: 1.0e-08
181
+ weight_decay: 0
182
+ scheduler: reducelronplateau
183
+ scheduler_conf:
184
+ mode: min
185
+ factor: 0.7
186
+ patience: 3
187
+ init: null
188
+ model_conf:
189
+ num_spk: 1
190
+ share_encoder: true
191
+ criterions:
192
+ - name: snr
193
+ conf:
194
+ eps: 1.0e-07
195
+ wrapper: fixed_order
196
+ wrapper_conf:
197
+ weight: 1.0
198
+ train_spk2enroll: null
199
+ enroll_segment: 16000
200
+ load_spk_embedding: false
201
+ load_all_speakers: false
202
+ rir_scp: null
203
+ rir_apply_prob: 1.0
204
+ noise_scp: null
205
+ noise_apply_prob: 1.0
206
+ noise_db_range: '13_15'
207
+ short_noise_thres: 0.5
208
+ speech_volume_normalize: null
209
+ use_reverberant_ref: false
210
+ num_spk: 1
211
+ num_noise_type: 1
212
+ sample_rate: 8000
213
+ force_single_channel: false
214
+ channel_reordering: false
215
+ categories: []
216
+ encoder: conv
217
+ encoder_conf:
218
+ channel: 256
219
+ kernel_size: 16
220
+ stride: 8
221
+ extractor: td_speakerbeam
222
+ extractor_conf:
223
+ layer: 8
224
+ stack: 4
225
+ bottleneck_dim: 256
226
+ hidden_dim: 512
227
+ skip_dim: 256
228
+ kernel: 3
229
+ causal: false
230
+ norm_type: gLN
231
+ nonlinear: relu
232
+ i_adapt_layer: 7
233
+ adapt_layer_type: mul
234
+ adapt_enroll_dim: 256
235
+ use_spk_emb: false
236
+ spk_emb_dim: 256
237
+ decoder: conv
238
+ decoder_conf:
239
+ channel: 256
240
+ kernel_size: 16
241
+ stride: 8
242
+ preprocessor: tse
243
+ preprocessor_conf: {}
244
+ required:
245
+ - output_dir
246
+ version: '202301'
247
+ distributed: false
248
+ ```
249
+
250
+ </details>
251
+
252
+
253
+
254
+ ### Citing ESPnet
255
+
256
+ ```BibTex
257
+ @inproceedings{watanabe2018espnet,
258
+ author={Shinji Watanabe and Takaaki Hori and Shigeki Karita and Tomoki Hayashi and Jiro Nishitoba and Yuya Unno and Nelson Yalta and Jahn Heymann and Matthew Wiesner and Nanxin Chen and Adithya Renduchintala and Tsubasa Ochiai},
259
+ title={{ESPnet}: End-to-End Speech Processing Toolkit},
260
+ year={2018},
261
+ booktitle={Proceedings of Interspeech},
262
+ pages={2207--2211},
263
+ doi={10.21437/Interspeech.2018-1456},
264
+ url={http://dx.doi.org/10.21437/Interspeech.2018-1456}
265
+ }
266
+
267
+
268
+ @inproceedings{ESPnet-SE,
269
+ author = {Chenda Li and Jing Shi and Wangyou Zhang and Aswin Shanmugam Subramanian and Xuankai Chang and
270
+ Naoyuki Kamo and Moto Hira and Tomoki Hayashi and Christoph B{"{o}}ddeker and Zhuo Chen and Shinji Watanabe},
271
+ title = {ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for {ASR} Integration},
272
+ booktitle = {{IEEE} Spoken Language Technology Workshop, {SLT} 2021, Shenzhen, China, January 19-22, 2021},
273
+ pages = {785--792},
274
+ publisher = {{IEEE}},
275
+ year = {2021},
276
+ url = {https://doi.org/10.1109/SLT48900.2021.9383615},
277
+ doi = {10.1109/SLT48900.2021.9383615},
278
+ timestamp = {Mon, 12 Apr 2021 17:08:59 +0200},
279
+ biburl = {https://dblp.org/rec/conf/slt/Li0ZSCKHHBC021.bib},
280
+ bibsource = {dblp computer science bibliography, https://dblp.org}
281
+ }
282
+
283
+
284
+ ```
285
+
286
+ or arXiv:
287
+
288
+ ```bibtex
289
+ @misc{watanabe2018espnet,
290
+ title={ESPnet: End-to-End Speech Processing Toolkit},
291
+ author={Shinji Watanabe and Takaaki Hori and Shigeki Karita and Tomoki Hayashi and Jiro Nishitoba and Yuya Unno and Nelson Yalta and Jahn Heymann and Matthew Wiesner and Nanxin Chen and Adithya Renduchintala and Tsubasa Ochiai},
292
+ year={2018},
293
+ eprint={1804.00015},
294
+ archivePrefix={arXiv},
295
+ primaryClass={cs.CL}
296
+ }
297
+ ```
exp/enh_train_enh_tse_td_speakerbeam_raw/100epoch.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3b8d3045a4ee5e281fc0b8909c04bbc69424f51248f715fe72d0b0350fa6af28
3
+ size 65006631
exp/enh_train_enh_tse_td_speakerbeam_raw/RESULTS.md ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <!-- Generated by ./scripts/utils/show_enh_score.sh -->
2
+ # RESULTS
3
+ ## Environments
4
+ - date: `Sun Jul 9 09:23:16 CST 2023`
5
+ - python version: `3.8.16 (default, Mar 2 2023, 03:21:46) [GCC 11.2.0]`
6
+ - espnet version: `espnet 202301`
7
+ - pytorch version: `pytorch 2.0.1`
8
+ - Git hash: ``
9
+ - Commit date: ``
10
+
11
+
12
+ ## enh_train_enh_tse_td_speakerbeam_org_raw
13
+
14
+ config: conf/tuning/train_enh_tse_td_speakerbeam_org.yaml
15
+
16
+ |dataset|PESQ_NB|STOI|SAR|SDR|SIR|SI_SNR|
17
+ |---|---|---|---|---|---|---|
18
+ |enhanced_cv_min_8k|3.54|96.41|18.75|18.75|0.00|18.37|
19
+ |enhanced_tt_min_8k|3.46|96.35|17.51|17.51|0.00|17.11|
20
+
exp/enh_train_enh_tse_td_speakerbeam_raw/config.yaml ADDED
@@ -0,0 +1,192 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ config: conf/tuning/train_enh_tse_td_speakerbeam.yaml
2
+ print_config: false
3
+ log_level: INFO
4
+ dry_run: false
5
+ iterator_type: chunk
6
+ output_dir: exp/enh_train_enh_tse_td_speakerbeam_raw
7
+ ngpu: 1
8
+ seed: 0
9
+ num_workers: 4
10
+ num_att_plot: 3
11
+ dist_backend: nccl
12
+ dist_init_method: env://
13
+ dist_world_size: null
14
+ dist_rank: null
15
+ local_rank: 0
16
+ dist_master_addr: null
17
+ dist_master_port: null
18
+ dist_launcher: null
19
+ multiprocessing_distributed: false
20
+ unused_parameters: true
21
+ sharded_ddp: false
22
+ cudnn_enabled: true
23
+ cudnn_benchmark: false
24
+ cudnn_deterministic: true
25
+ collect_stats: false
26
+ write_collected_feats: false
27
+ skip_stats_npz: false
28
+ max_epoch: 100
29
+ patience: 20
30
+ val_scheduler_criterion:
31
+ - valid
32
+ - loss
33
+ early_stopping_criterion:
34
+ - valid
35
+ - loss
36
+ - min
37
+ best_model_criterion:
38
+ - - valid
39
+ - snr
40
+ - max
41
+ - - valid
42
+ - loss
43
+ - min
44
+ keep_nbest_models: 1
45
+ nbest_averaging_interval: 0
46
+ grad_clip: 5.0
47
+ grad_clip_type: 2.0
48
+ grad_noise: false
49
+ accum_grad: 1
50
+ no_forward_run: false
51
+ resume: true
52
+ train_dtype: float32
53
+ use_amp: false
54
+ log_interval: null
55
+ use_matplotlib: true
56
+ use_tensorboard: true
57
+ create_graph_in_tensorboard: false
58
+ use_wandb: false
59
+ wandb_project: null
60
+ wandb_id: null
61
+ wandb_entity: null
62
+ wandb_name: null
63
+ wandb_model_log_interval: -1
64
+ detect_anomaly: false
65
+ pretrain_path: null
66
+ init_param: []
67
+ ignore_init_mismatch: false
68
+ freeze_param: []
69
+ num_iters_per_epoch: null
70
+ batch_size: 4
71
+ valid_batch_size: null
72
+ batch_bins: 1000000
73
+ valid_batch_bins: null
74
+ train_shape_file:
75
+ - exp/enh_stats_tr_min_8k_cv_min_8k_8k/train/speech_mix_shape
76
+ - exp/enh_stats_tr_min_8k_cv_min_8k_8k/train/speech_ref1_shape
77
+ - exp/enh_stats_tr_min_8k_cv_min_8k_8k/train/enroll_ref1_shape
78
+ valid_shape_file:
79
+ - exp/enh_stats_tr_min_8k_cv_min_8k_8k/valid/speech_mix_shape
80
+ - exp/enh_stats_tr_min_8k_cv_min_8k_8k/valid/speech_ref1_shape
81
+ - exp/enh_stats_tr_min_8k_cv_min_8k_8k/valid/enroll_ref1_shape
82
+ batch_type: folded
83
+ valid_batch_type: null
84
+ fold_length:
85
+ - 80000
86
+ - 80000
87
+ - 80000
88
+ sort_in_batch: descending
89
+ sort_batch: descending
90
+ multiple_iterator: false
91
+ chunk_length: 32000
92
+ chunk_shift_ratio: 0.5
93
+ num_cache_chunks: 1024
94
+ chunk_excluded_key_prefixes:
95
+ - enroll_ref
96
+ train_data_path_and_name_and_type:
97
+ - - dump/raw/tr_min_8k/wav.scp
98
+ - speech_mix
99
+ - sound
100
+ - - dump/raw/tr_min_8k/spk1.scp
101
+ - speech_ref1
102
+ - sound
103
+ - - dump/raw/tr_min_8k/enroll_spk1.scp
104
+ - enroll_ref1
105
+ - text
106
+ valid_data_path_and_name_and_type:
107
+ - - dump/raw/cv_min_8k/wav.scp
108
+ - speech_mix
109
+ - sound
110
+ - - dump/raw/cv_min_8k/spk1.scp
111
+ - speech_ref1
112
+ - sound
113
+ - - dump/raw/cv_min_8k/enroll_spk1.scp
114
+ - enroll_ref1
115
+ - text
116
+ allow_variable_data_keys: false
117
+ max_cache_size: 0.0
118
+ max_cache_fd: 32
119
+ valid_max_cache_size: null
120
+ exclude_weight_decay: false
121
+ exclude_weight_decay_conf: {}
122
+ optim: adam
123
+ optim_conf:
124
+ lr: 0.001
125
+ eps: 1.0e-08
126
+ weight_decay: 0
127
+ scheduler: reducelronplateau
128
+ scheduler_conf:
129
+ mode: min
130
+ factor: 0.7
131
+ patience: 3
132
+ init: null
133
+ model_conf:
134
+ num_spk: 1
135
+ share_encoder: true
136
+ criterions:
137
+ - name: snr
138
+ conf:
139
+ eps: 1.0e-07
140
+ wrapper: fixed_order
141
+ wrapper_conf:
142
+ weight: 1.0
143
+ train_spk2enroll: null
144
+ enroll_segment: 16000
145
+ load_spk_embedding: false
146
+ load_all_speakers: false
147
+ rir_scp: null
148
+ rir_apply_prob: 1.0
149
+ noise_scp: null
150
+ noise_apply_prob: 1.0
151
+ noise_db_range: '13_15'
152
+ short_noise_thres: 0.5
153
+ speech_volume_normalize: null
154
+ use_reverberant_ref: false
155
+ num_spk: 1
156
+ num_noise_type: 1
157
+ sample_rate: 8000
158
+ force_single_channel: false
159
+ channel_reordering: false
160
+ categories: []
161
+ encoder: conv
162
+ encoder_conf:
163
+ channel: 256
164
+ kernel_size: 16
165
+ stride: 8
166
+ extractor: td_speakerbeam
167
+ extractor_conf:
168
+ layer: 8
169
+ stack: 4
170
+ bottleneck_dim: 256
171
+ hidden_dim: 512
172
+ skip_dim: 256
173
+ kernel: 3
174
+ causal: false
175
+ norm_type: gLN
176
+ nonlinear: relu
177
+ i_adapt_layer: 7
178
+ adapt_layer_type: mul
179
+ adapt_enroll_dim: 256
180
+ use_spk_emb: false
181
+ spk_emb_dim: 256
182
+ decoder: conv
183
+ decoder_conf:
184
+ channel: 256
185
+ kernel_size: 16
186
+ stride: 8
187
+ preprocessor: tse
188
+ preprocessor_conf: {}
189
+ required:
190
+ - output_dir
191
+ version: '202301'
192
+ distributed: false
exp/enh_train_enh_tse_td_speakerbeam_raw/images/backward_time.png ADDED
exp/enh_train_enh_tse_td_speakerbeam_raw/images/clip.png ADDED
exp/enh_train_enh_tse_td_speakerbeam_raw/images/forward_time.png ADDED
exp/enh_train_enh_tse_td_speakerbeam_raw/images/gpu_max_cached_mem_GB.png ADDED
exp/enh_train_enh_tse_td_speakerbeam_raw/images/grad_norm.png ADDED
exp/enh_train_enh_tse_td_speakerbeam_raw/images/iter_time.png ADDED
exp/enh_train_enh_tse_td_speakerbeam_raw/images/loss.png ADDED
exp/enh_train_enh_tse_td_speakerbeam_raw/images/loss_scale.png ADDED
exp/enh_train_enh_tse_td_speakerbeam_raw/images/optim0_lr0.png ADDED
exp/enh_train_enh_tse_td_speakerbeam_raw/images/optim_step_time.png ADDED
exp/enh_train_enh_tse_td_speakerbeam_raw/images/snr_loss.png ADDED
exp/enh_train_enh_tse_td_speakerbeam_raw/images/train_time.png ADDED
exp/enh_train_enh_tse_td_speakerbeam_raw/valid.loss.ave.pth ADDED
@@ -0,0 +1 @@
 
 
1
+ 100epoch.pth
exp/enh_train_enh_tse_td_speakerbeam_raw/valid.loss.ave_1best.pth ADDED
@@ -0,0 +1 @@
 
 
1
+ 100epoch.pth
exp/enh_train_enh_tse_td_speakerbeam_raw/valid.loss.best.pth ADDED
@@ -0,0 +1 @@
 
 
1
+ 100epoch.pth
meta.yaml ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ espnet: '202301'
2
+ files:
3
+ model_file: exp/enh_train_enh_tse_td_speakerbeam_raw/100epoch.pth
4
+ python: "3.8.16 (default, Mar 2 2023, 03:21:46) \n[GCC 11.2.0]"
5
+ timestamp: 1688865949.200904
6
+ torch: 2.0.1
7
+ yaml_files:
8
+ train_config: exp/enh_train_enh_tse_td_speakerbeam_raw/config.yaml