Siddhant commited on
Commit
7bbad55
1 Parent(s): daabfe9

Import from zenodo

Browse files
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - espnet
4
+ - audio
5
+ - automatic-speech-recognition
6
+ language: en
7
+ datasets:
8
+ - fsc
9
+ license: cc-by-4.0
10
+ ---
11
+ ## ESPnet2 ASR pretrained model
12
+ ### `siddhana/slurp_entity_asr_train_asr_conformer_raw_en_word_valid.acc.ave_10best`
13
+ ♻️ Imported from https://zenodo.org/record/5590204
14
+
15
+ This model was trained by siddhana using fsc/asr1 recipe in [espnet](https://github.com/espnet/espnet/).
16
+ ### Demo: How to use in ESPnet2
17
+ ```python
18
+ # coming soon
19
+ ```
20
+ ### Citing ESPnet
21
+ ```BibTex
22
+ @inproceedings{watanabe2018espnet,
23
+ author={Shinji Watanabe and Takaaki Hori and Shigeki Karita and Tomoki Hayashi and Jiro Nishitoba and Yuya Unno and Nelson {Enrique Yalta Soplin} and Jahn Heymann and Matthew Wiesner and Nanxin Chen and Adithya Renduchintala and Tsubasa Ochiai},
24
+ title={{ESPnet}: End-to-End Speech Processing Toolkit},
25
+ year={2018},
26
+ booktitle={Proceedings of Interspeech},
27
+ pages={2207--2211},
28
+ doi={10.21437/Interspeech.2018-1456},
29
+ url={http://dx.doi.org/10.21437/Interspeech.2018-1456}
30
+ }
31
+
32
+ ```
33
+ or arXiv:
34
+ ```bibtex
35
+ @misc{watanabe2018espnet,
36
+ title={ESPnet: End-to-End Speech Processing Toolkit},
37
+ author={Shinji Watanabe and Takaaki Hori and Shigeki Karita and Tomoki Hayashi and Jiro Nishitoba and Yuya Unno and Nelson Enrique Yalta Soplin and Jahn Heymann and Matthew Wiesner and Nanxin Chen and Adithya Renduchintala and Tsubasa Ochiai},
38
+ year={2018},
39
+ eprint={1804.00015},
40
+ archivePrefix={arXiv},
41
+ primaryClass={cs.CL}
42
+ }
43
+ ```
exp/asr_train_asr_conformer_raw_en_word/RESULTS.md ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <!-- Generated by scripts/utils/show_asr_result.sh -->
2
+ # RESULTS
3
+ ## Environments
4
+ - date: `Wed Oct 6 23:32:47 EDT 2021`
5
+ - python version: `3.9.5 (default, Jun 4 2021, 12:28:51) [GCC 7.5.0]`
6
+ - espnet version: `espnet 0.10.3a2`
7
+ - pytorch version: `pytorch 1.8.1+cu102`
8
+ - Git hash: `8ef7bd675815ae2fbaba930a53c8ad4ae0ad19af`
9
+ - Commit date: `Sat Sep 11 10:05:59 2021 +0900`
10
+
11
+ ## asr_train_asr_conformer_raw_en_word
12
+ ### WER
13
+
14
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
15
+ |---|---|---|---|---|---|---|---|---|
16
+ |inference_asr_model_valid.acc.ave_10best/devel|8690|177293|80.1|8.9|10.9|3.1|23.0|59.7|
17
+ |inference_asr_model_valid.acc.ave_10best/test|13078|261300|79.3|9.1|11.6|2.8|23.5|58.7|
18
+
19
+ ### CER
20
+
21
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
22
+ |---|---|---|---|---|---|---|---|---|
23
+ |inference_asr_model_valid.acc.ave_10best/devel|8690|846635|87.8|3.5|8.7|3.7|16.0|59.7|
24
+ |inference_asr_model_valid.acc.ave_10best/test|13078|1244599|86.8|3.5|9.6|3.4|16.6|58.7|
25
+
26
+ ### TER
27
+
28
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
29
+ |---|---|---|---|---|---|---|---|---|
exp/asr_train_asr_conformer_raw_en_word/config.yaml ADDED
@@ -0,0 +1,793 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ config: conf/tuning/train_asr_conformer.yaml
2
+ print_config: false
3
+ log_level: INFO
4
+ dry_run: false
5
+ iterator_type: sequence
6
+ output_dir: exp/asr_train_asr_conformer_raw_en_word
7
+ ngpu: 1
8
+ seed: 0
9
+ num_workers: 1
10
+ num_att_plot: 3
11
+ dist_backend: nccl
12
+ dist_init_method: env://
13
+ dist_world_size: null
14
+ dist_rank: null
15
+ local_rank: 0
16
+ dist_master_addr: null
17
+ dist_master_port: null
18
+ dist_launcher: null
19
+ multiprocessing_distributed: false
20
+ unused_parameters: false
21
+ sharded_ddp: false
22
+ cudnn_enabled: true
23
+ cudnn_benchmark: false
24
+ cudnn_deterministic: true
25
+ collect_stats: false
26
+ write_collected_feats: false
27
+ max_epoch: 40
28
+ patience: null
29
+ val_scheduler_criterion:
30
+ - valid
31
+ - loss
32
+ early_stopping_criterion:
33
+ - valid
34
+ - loss
35
+ - min
36
+ best_model_criterion:
37
+ - - valid
38
+ - acc
39
+ - max
40
+ keep_nbest_models: 10
41
+ grad_clip: 5.0
42
+ grad_clip_type: 2.0
43
+ grad_noise: false
44
+ accum_grad: 1
45
+ no_forward_run: false
46
+ resume: true
47
+ train_dtype: float32
48
+ use_amp: false
49
+ log_interval: null
50
+ use_tensorboard: true
51
+ use_wandb: false
52
+ wandb_project: null
53
+ wandb_id: null
54
+ wandb_entity: null
55
+ wandb_name: null
56
+ wandb_model_log_interval: -1
57
+ detect_anomaly: false
58
+ pretrain_path: null
59
+ init_param: []
60
+ ignore_init_mismatch: false
61
+ freeze_param: []
62
+ num_iters_per_epoch: null
63
+ batch_size: 20
64
+ valid_batch_size: null
65
+ batch_bins: 1000000
66
+ valid_batch_bins: null
67
+ train_shape_file:
68
+ - exp/asr_stats_raw_en_word/train/speech_shape
69
+ - exp/asr_stats_raw_en_word/train/text_shape.word
70
+ valid_shape_file:
71
+ - exp/asr_stats_raw_en_word/valid/speech_shape
72
+ - exp/asr_stats_raw_en_word/valid/text_shape.word
73
+ batch_type: folded
74
+ valid_batch_type: null
75
+ fold_length:
76
+ - 80000
77
+ - 150
78
+ sort_in_batch: descending
79
+ sort_batch: descending
80
+ multiple_iterator: false
81
+ chunk_length: 500
82
+ chunk_shift_ratio: 0.5
83
+ num_cache_chunks: 1024
84
+ train_data_path_and_name_and_type:
85
+ - - dump/raw/train/wav.scp
86
+ - speech
87
+ - sound
88
+ - - dump/raw/train/text
89
+ - text
90
+ - text
91
+ valid_data_path_and_name_and_type:
92
+ - - dump/raw/devel/wav.scp
93
+ - speech
94
+ - sound
95
+ - - dump/raw/devel/text
96
+ - text
97
+ - text
98
+ allow_variable_data_keys: false
99
+ max_cache_size: 0.0
100
+ max_cache_fd: 32
101
+ valid_max_cache_size: null
102
+ optim: adam
103
+ optim_conf:
104
+ lr: 0.0002
105
+ scheduler: warmuplr
106
+ scheduler_conf:
107
+ warmup_steps: 25000
108
+ token_list:
109
+ - <blank>
110
+ - <unk>
111
+ - ▁SEP
112
+ - ▁FILL
113
+ - s
114
+ - ▁the
115
+ - ▁to
116
+ - a
117
+ - ▁i
118
+ - ▁me
119
+ - ▁s
120
+ - ▁a
121
+ - i
122
+ - e
123
+ - ▁you
124
+ - ▁what
125
+ - er
126
+ - u
127
+ - ▁is
128
+ - ''''
129
+ - o
130
+ - ing
131
+ - p
132
+ - ▁in
133
+ - ▁p
134
+ - y
135
+ - ▁my
136
+ - ▁please
137
+ - d
138
+ - m
139
+ - ▁b
140
+ - l
141
+ - ▁c
142
+ - date
143
+ - n
144
+ - ▁d
145
+ - c
146
+ - le
147
+ - b
148
+ - ▁m
149
+ - ▁for
150
+ - re
151
+ - st
152
+ - ▁on
153
+ - t
154
+ - en
155
+ - h
156
+ - 'on'
157
+ - ar
158
+ - person
159
+ - ▁re
160
+ - ▁f
161
+ - ▁g
162
+ - ▁of
163
+ - an
164
+ - ▁
165
+ - g
166
+ - ▁today
167
+ - ▁t
168
+ - or
169
+ - ▁it
170
+ - ▁this
171
+ - ▁h
172
+ - r
173
+ - f
174
+ - at
175
+ - ch
176
+ - ce
177
+ - place_name
178
+ - ▁email
179
+ - ▁do
180
+ - es
181
+ - ri
182
+ - ▁e
183
+ - ▁w
184
+ - ic
185
+ - in
186
+ - ▁that
187
+ - event_name
188
+ - ▁play
189
+ - ▁and
190
+ - al
191
+ - ▁n
192
+ - ▁can
193
+ - email_query
194
+ - ve
195
+ - ▁new
196
+ - day
197
+ - it
198
+ - ate
199
+ - ▁from
200
+ - ▁have
201
+ - k
202
+ - time
203
+ - ▁am
204
+ - media_type
205
+ - email_sendemail
206
+ - ent
207
+ - ▁olly
208
+ - qa_factoid
209
+ - v
210
+ - et
211
+ - ck
212
+ - ▁any
213
+ - calendar_set
214
+ - ly
215
+ - se
216
+ - th
217
+ - ▁how
218
+ - ▁meeting
219
+ - ed
220
+ - ▁tell
221
+ - ▁st
222
+ - x
223
+ - ur
224
+ - ▁at
225
+ - ro
226
+ - nd
227
+ - ▁list
228
+ - ou
229
+ - w
230
+ - ▁u
231
+ - ▁not
232
+ - ▁about
233
+ - ▁an
234
+ - ▁o
235
+ - general_negate
236
+ - ut
237
+ - ▁time
238
+ - ▁be
239
+ - ▁ch
240
+ - ▁are
241
+ - social_post
242
+ - business_name
243
+ - ty
244
+ - la
245
+ - play_music
246
+ - ot
247
+ - general_quirky
248
+ - ▁l
249
+ - ▁sh
250
+ - ▁tweet
251
+ - om
252
+ - ▁week
253
+ - um
254
+ - ▁one
255
+ - ter
256
+ - ▁he
257
+ - ▁up
258
+ - ▁com
259
+ - general_praise
260
+ - weather_query
261
+ - ▁next
262
+ - ▁th
263
+ - ▁check
264
+ - calendar_query
265
+ - ▁last
266
+ - ▁ro
267
+ - ad
268
+ - is
269
+ - ▁with
270
+ - ay
271
+ - ▁send
272
+ - pe
273
+ - ▁pm
274
+ - ▁tomorrow
275
+ - ▁j
276
+ - un
277
+ - ▁train
278
+ - general_explain
279
+ - ▁v
280
+ - one
281
+ - ▁r
282
+ - ra
283
+ - news_query
284
+ - ation
285
+ - ▁emails
286
+ - us
287
+ - if
288
+ - ct
289
+ - ▁co
290
+ - ▁add
291
+ - ▁will
292
+ - ▁se
293
+ - nt
294
+ - ▁was
295
+ - ine
296
+ - ▁de
297
+ - ▁set
298
+ - ▁ex
299
+ - ▁would
300
+ - ir
301
+ - ow
302
+ - ber
303
+ - general_repeat
304
+ - ight
305
+ - ook
306
+ - ▁again
307
+ - ▁song
308
+ - currency_name
309
+ - ll
310
+ - ▁go
311
+ - ▁ha
312
+ - relation
313
+ - te
314
+ - ion
315
+ - and
316
+ - ▁y
317
+ - ▁ye
318
+ - general_affirm
319
+ - general_confirm
320
+ - ery
321
+ - ▁po
322
+ - ff
323
+ - ▁we
324
+ - ▁turn
325
+ - ▁did
326
+ - ▁mar
327
+ - ▁alarm
328
+ - ▁like
329
+ - datetime_query
330
+ - ers
331
+ - ▁all
332
+ - ▁remind
333
+ - ▁so
334
+ - qa_definition
335
+ - ▁calendar
336
+ - end
337
+ - ▁said
338
+ - ci
339
+ - ▁off
340
+ - ▁john
341
+ - ss
342
+ - ▁day
343
+ - pla
344
+ - ume
345
+ - ▁get
346
+ - pp
347
+ - ail
348
+ - z
349
+ - ry
350
+ - am
351
+ - ▁need
352
+ - as
353
+ - ▁thank
354
+ - ▁wh
355
+ - ▁want
356
+ - ▁right
357
+ - ▁jo
358
+ - ▁facebook
359
+ - ▁k
360
+ - ge
361
+ - ld
362
+ - ▁fri
363
+ - ▁two
364
+ - general_dontcare
365
+ - ▁news
366
+ - ol
367
+ - oo
368
+ - ant
369
+ - ▁five
370
+ - ▁event
371
+ - ake
372
+ - definition_word
373
+ - transport_type
374
+ - ▁your
375
+ - vi
376
+ - op
377
+ - ▁weather
378
+ - ome
379
+ - ▁app
380
+ - ▁lo
381
+ - de
382
+ - ▁music
383
+ - weather_descriptor
384
+ - ak
385
+ - ke
386
+ - ▁there
387
+ - ▁si
388
+ - ▁lights
389
+ - ▁now
390
+ - ▁mo
391
+ - calendar_remove
392
+ - our
393
+ - ▁dollar
394
+ - food_type
395
+ - me
396
+ - ▁more
397
+ - ▁no
398
+ - ▁birthday
399
+ - orrect
400
+ - ▁rep
401
+ - ▁show
402
+ - play_radio
403
+ - ▁mon
404
+ - ▁does
405
+ - ood
406
+ - li
407
+ - ag
408
+ - ▁sto
409
+ - ▁contact
410
+ - cket
411
+ - email_querycontact
412
+ - ▁ev
413
+ - ▁could
414
+ - ▁just
415
+ - ange
416
+ - out
417
+ - ame
418
+ - .
419
+ - ▁ja
420
+ - ▁confirm
421
+ - qa_currency
422
+ - ▁man
423
+ - ▁some
424
+ - ▁late
425
+ - ▁think
426
+ - timeofday
427
+ - ▁bo
428
+ - qa_stock
429
+ - ong
430
+ - ▁start
431
+ - ▁work
432
+ - ▁ten
433
+ - int
434
+ - ▁command
435
+ - all
436
+ - ▁make
437
+ - ▁la
438
+ - j
439
+ - ▁answ
440
+ - ▁hour
441
+ - ▁cle
442
+ - ah
443
+ - ▁find
444
+ - ▁service
445
+ - ▁fa
446
+ - qu
447
+ - general_commandstop
448
+ - ai
449
+ - ▁when
450
+ - ard
451
+ - ▁te
452
+ - ▁by
453
+ - social_query
454
+ - ul
455
+ - ▁tw
456
+ - id
457
+ - ▁seven
458
+ - ▁where
459
+ - ▁much
460
+ - art
461
+ - ▁appointment
462
+ - ver
463
+ - artist_name
464
+ - el
465
+ - device_type
466
+ - ▁know
467
+ - ▁three
468
+ - ▁events
469
+ - ▁tr
470
+ - ▁li
471
+ - ork
472
+ - red
473
+ - ect
474
+ - ▁let
475
+ - ▁respon
476
+ - ▁par
477
+ - zz
478
+ - ▁give
479
+ - ▁twenty
480
+ - ac
481
+ - ▁morning
482
+ - ▁ti
483
+ - est
484
+ - ▁curre
485
+ - play_podcasts
486
+ - ▁radio
487
+ - cooking_recipe
488
+ - transport_query
489
+ - ▁con
490
+ - gh
491
+ - ▁le
492
+ - lists_query
493
+ - ▁rem
494
+ - recommendation_events
495
+ - house_place
496
+ - alarm_set
497
+ - play_audiobook
498
+ - ist
499
+ - ase
500
+ - music_genre
501
+ - ive
502
+ - ast
503
+ - ort
504
+ - player_setting
505
+ - lly
506
+ - news_topic
507
+ - list_name
508
+ - ▁playlist
509
+ - ▁ne
510
+ - business_type
511
+ - personal_info
512
+ - ind
513
+ - ust
514
+ - di
515
+ - ress
516
+ - recommendation_locations
517
+ - lists_createoradd
518
+ - iot_hue_lightoff
519
+ - lists_remove
520
+ - ord
521
+ - ▁light
522
+ - ere
523
+ - alarm_query
524
+ - audio_volume_mute
525
+ - music_query
526
+ - ▁audio
527
+ - rain
528
+ - ▁date
529
+ - ▁order
530
+ - audio_volume_up
531
+ - ▁ar
532
+ - ▁podcast
533
+ - transport_ticket
534
+ - mail
535
+ - iot_hue_lightchange
536
+ - iot_coffee
537
+ - radio_name
538
+ - ill
539
+ - ▁ri
540
+ - '@'
541
+ - takeaway_query
542
+ - song_name
543
+ - takeaway_order
544
+ - ▁ra
545
+ - email_addcontact
546
+ - play_game
547
+ - book
548
+ - transport_traffic
549
+ - ▁house
550
+ - music_likeness
551
+ - her
552
+ - transport_taxi
553
+ - iot_hue_lightdim
554
+ - ment
555
+ - ght
556
+ - fo
557
+ - order_type
558
+ - color_type
559
+ - '1'
560
+ - ven
561
+ - ould
562
+ - general_joke
563
+ - ess
564
+ - ain
565
+ - qa_maths
566
+ - ▁place
567
+ - ▁twe
568
+ - cast
569
+ - iot_cleaning
570
+ - ▁che
571
+ - ▁cont
572
+ - orn
573
+ - ith
574
+ - audiobook_name
575
+ - email_address
576
+ - game_name
577
+ - ▁cal
578
+ - general_frequency
579
+ - ▁tom
580
+ - ▁food
581
+ - act
582
+ - iot_hue_lightup
583
+ - '2'
584
+ - alarm_remove
585
+ - podcast_descriptor
586
+ - ▁definition
587
+ - audio_volume_down
588
+ - ▁media
589
+ - email_folder
590
+ - dia
591
+ - meal_type
592
+ - ▁mus
593
+ - recommendation_movies
594
+ - ▁ad
595
+ - ree
596
+ - pt
597
+ - now
598
+ - playlist_name
599
+ - ▁person
600
+ - change_amount
601
+ - ▁pla
602
+ - escri
603
+ - datetime_convert
604
+ - podcast_name
605
+ - ▁ab
606
+ - time_zone
607
+ - ▁def
608
+ - ting
609
+ - iot_wemo_on
610
+ - music_settings
611
+ - iot_wemo_off
612
+ - orre
613
+ - cy
614
+ - ank
615
+ - music_descriptor
616
+ - lar
617
+ - app_name
618
+ - row
619
+ - joke_type
620
+ - set
621
+ - xt
622
+ - of
623
+ - ition
624
+ - ▁meet
625
+ - ink
626
+ - ▁confir
627
+ - transport_agency
628
+ - general_greet
629
+ - ▁business
630
+ - ▁art
631
+ - ▁ag
632
+ - urn
633
+ - escript
634
+ - rom
635
+ - ▁rel
636
+ - ▁au
637
+ - ▁currency
638
+ - audio_volume_other
639
+ - iot_hue_lighton
640
+ - ▁artist
641
+ - ▁bus
642
+ - '?'
643
+ - cooking_type
644
+ - movie_name
645
+ - coffee_type
646
+ - ingredient
647
+ - music_dislikeness
648
+ - ather
649
+ - sp
650
+ - q
651
+ - ▁ser
652
+ - esc
653
+ - ▁bir
654
+ - ▁cur
655
+ - name
656
+ - ▁tran
657
+ - ▁hou
658
+ - ek
659
+ - uch
660
+ - ▁conf
661
+ - ▁face
662
+ - '9'
663
+ - ▁birth
664
+ - I
665
+ - sw
666
+ - transport_descriptor
667
+ - ▁comm
668
+ - lease
669
+ - transport_name
670
+ - aid
671
+ - movie_type
672
+ - ▁device
673
+ - alarm_type
674
+ - audiobook_author
675
+ - '5'
676
+ - drink_type
677
+ - ▁joh
678
+ - ▁defin
679
+ - word
680
+ - ▁curren
681
+ - order
682
+ - iness
683
+ - W
684
+ - cooking_query
685
+ - sport_type
686
+ - ▁relation
687
+ - oint
688
+ - H
689
+ - '8'
690
+ - '0'
691
+ - ▁dol
692
+ - vice
693
+ - ▁pers
694
+ - '&'
695
+ - T
696
+ - ▁appoint
697
+ - A
698
+ - '7'
699
+ - '3'
700
+ - '-'
701
+ - game_type
702
+ - ▁pod
703
+ - _
704
+ - list
705
+ - music_album
706
+ - dio
707
+ - ▁transport
708
+ - qa_query
709
+ - C
710
+ - O
711
+ - U
712
+ - query_detail
713
+ - spon
714
+ - <sos/eos>
715
+ init: null
716
+ input_size: null
717
+ ctc_conf:
718
+ dropout_rate: 0.0
719
+ ctc_type: builtin
720
+ reduce: true
721
+ ignore_nan_grad: true
722
+ model_conf:
723
+ ctc_weight: 0.3
724
+ lsm_weight: 0.1
725
+ length_normalized_loss: false
726
+ extract_feats_in_collect_stats: false
727
+ use_preprocessor: true
728
+ token_type: word
729
+ bpemodel: null
730
+ non_linguistic_symbols: null
731
+ cleaner: null
732
+ g2p: null
733
+ speech_volume_normalize: null
734
+ rir_scp: null
735
+ rir_apply_prob: 1.0
736
+ noise_scp: null
737
+ noise_apply_prob: 1.0
738
+ noise_db_range: '13_15'
739
+ frontend: default
740
+ frontend_conf:
741
+ fs: 16k
742
+ specaug: specaug
743
+ specaug_conf:
744
+ apply_time_warp: true
745
+ time_warp_window: 5
746
+ time_warp_mode: bicubic
747
+ apply_freq_mask: true
748
+ freq_mask_width_range:
749
+ - 0
750
+ - 30
751
+ num_freq_mask: 2
752
+ apply_time_mask: true
753
+ time_mask_width_range:
754
+ - 0
755
+ - 40
756
+ num_time_mask: 2
757
+ normalize: utterance_mvn
758
+ normalize_conf: {}
759
+ preencoder: null
760
+ preencoder_conf: {}
761
+ encoder: conformer
762
+ encoder_conf:
763
+ output_size: 512
764
+ attention_heads: 8
765
+ linear_units: 2048
766
+ num_blocks: 12
767
+ dropout_rate: 0.1
768
+ positional_dropout_rate: 0.1
769
+ attention_dropout_rate: 0.1
770
+ input_layer: conv2d
771
+ normalize_before: true
772
+ macaron_style: true
773
+ pos_enc_layer_type: rel_pos
774
+ selfattention_layer_type: rel_selfattn
775
+ activation_type: swish
776
+ use_cnn_module: true
777
+ cnn_module_kernel: 31
778
+ postencoder: null
779
+ postencoder_conf: {}
780
+ decoder: transformer
781
+ decoder_conf:
782
+ attention_heads: 8
783
+ linear_units: 2048
784
+ num_blocks: 6
785
+ dropout_rate: 0.1
786
+ positional_dropout_rate: 0.1
787
+ self_attention_dropout_rate: 0.1
788
+ src_attention_dropout_rate: 0.1
789
+ required:
790
+ - output_dir
791
+ - token_list
792
+ version: 0.10.3a2
793
+ distributed: false
exp/asr_train_asr_conformer_raw_en_word/images/acc.png ADDED
exp/asr_train_asr_conformer_raw_en_word/images/backward_time.png ADDED
exp/asr_train_asr_conformer_raw_en_word/images/cer.png ADDED
exp/asr_train_asr_conformer_raw_en_word/images/cer_ctc.png ADDED
exp/asr_train_asr_conformer_raw_en_word/images/forward_time.png ADDED
exp/asr_train_asr_conformer_raw_en_word/images/gpu_max_cached_mem_GB.png ADDED
exp/asr_train_asr_conformer_raw_en_word/images/iter_time.png ADDED
exp/asr_train_asr_conformer_raw_en_word/images/loss.png ADDED
exp/asr_train_asr_conformer_raw_en_word/images/loss_att.png ADDED
exp/asr_train_asr_conformer_raw_en_word/images/loss_ctc.png ADDED
exp/asr_train_asr_conformer_raw_en_word/images/optim0_lr0.png ADDED
exp/asr_train_asr_conformer_raw_en_word/images/optim_step_time.png ADDED
exp/asr_train_asr_conformer_raw_en_word/images/train_time.png ADDED
exp/asr_train_asr_conformer_raw_en_word/images/wer.png ADDED
exp/asr_train_asr_conformer_raw_en_word/valid.acc.ave_10best.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f85bd4f2c885c262e6fb1e3eb214e000eb6f69cf4846e7d530635c71c7783646
3
+ size 437939867
meta.yaml ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ espnet: 0.10.3a2
2
+ files:
3
+ asr_model_file: exp/asr_train_asr_conformer_raw_en_word/valid.acc.ave_10best.pth
4
+ python: "3.9.5 (default, Jun 4 2021, 12:28:51) \n[GCC 7.5.0]"
5
+ timestamp: 1636234622.134315
6
+ torch: 1.8.1+cu102
7
+ yaml_files:
8
+ asr_train_config: exp/asr_train_asr_conformer_raw_en_word/config.yaml