root commited on
Commit
2896e2b
·
1 Parent(s): cafae72

first commit

Browse files
.gitattributes CHANGED
@@ -32,3 +32,7 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
32
  *.zip filter=lfs diff=lfs merge=lfs -text
33
  *.zst filter=lfs diff=lfs merge=lfs -text
34
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
32
  *.zip filter=lfs diff=lfs merge=lfs -text
33
  *.zst filter=lfs diff=lfs merge=lfs -text
34
  *tfevents* filter=lfs diff=lfs merge=lfs -text
35
+ data filter=lfs diff=lfs merge=lfs -text
36
+ exp filter=lfs diff=lfs merge=lfs -text
37
+ decoding-result filter=lfs diff=lfs merge=lfs -text
38
+ test_wavs filter=lfs diff=lfs merge=lfs -text
data/lang_bpe_500/L.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:90ba23c27471509a03c982f2ceb55c2622a84d2d7d31162cffada8efac16c945
3
+ size 330023
data/lang_bpe_500/LG.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8857024d33fc2f06133c02bb3694ec6e716a06f3853eab1d9bfac1fdfac5f62c
3
+ size 64875683
data/lang_bpe_500/Linv.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ccca9f0cced910c93d7fe8e3b5fc34c85d2f440fc2d6d2cfe1f50e7dac5e9acf
3
+ size 330023
data/lang_bpe_500/bpe.model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4c029f22a5cdd87967db44988bf93f4f33452bdaa4c6fdd8ec5f4deca1b9bf96
3
+ size 247505
data/lang_bpe_500/tokens.txt ADDED
@@ -0,0 +1,503 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <blk> 0
2
+ <sos/eos> 1
3
+ <unk> 2
4
+ ▁ 3
5
+ ▁དང 4
6
+ ས 5
7
+ ▁པ 6
8
+ ར 7
9
+ ང 8
10
+ ན 9
11
+ ད 10
12
+ ▁བ 11
13
+ ལ 12
14
+ ▁ལ 13
15
+ ག 14
16
+ ▁པའི 15
17
+ འི 16
18
+ ▁ཀྱི 17
19
+ ▁མ 18
20
+ ▁མི 19
21
+ ▁གི 20
22
+ ▁བྱེད 21
23
+ ▁དུ 22
24
+ ▁རྒྱུ 23
25
+ ▁དེ 24
26
+ ▁བྱ 25
27
+ ▁ཡིན 26
28
+ ▁དགོས 27
29
+ ▁ན 28
30
+ ▁ཡོད 29
31
+ ▁རྒྱ 30
32
+ ▁ལས 31
33
+ ▁བའི 32
34
+ ▁པོ 33
35
+ ེ 34
36
+ ▁གྱི 35
37
+ ▁ས 36
38
+ མ 37
39
+ ▁ང 38
40
+ བ 39
41
+ ▁ནི 40
42
+ ▁ག 41
43
+ ▁ནས 42
44
+ ▁རིགས 43
45
+ ▁ལུགས 44
46
+ ▁པར 45
47
+ ▁རང 46
48
+ ོས 47
49
+ ▁ཚོགས 48
50
+ ▁ད 49
51
+ ▁ཆེ 50
52
+ ▁ཚོ 51
53
+ ི 52
54
+ ▁བར 53
55
+ ▁ཞིག 54
56
+ ུ 55
57
+ ▁གཏོང 56
58
+ ▁སྤྱི 57
59
+ ▁དོན 58
60
+ ▁ཏ 59
61
+ ▁བྱས 60
62
+ ▁དག 61
63
+ ▁ཆ 62
64
+ ▁རེད 63
65
+ ▁ཁ 64
66
+ ▁སུ 65
67
+ ▁མེད 66
68
+ ▁གཅི 67
69
+ ▁གནས 68
70
+ ▁ར 69
71
+ ▁མང 70
72
+ ▁བྱུང 71
73
+ ▁ཆེན 72
74
+ ▁འདི 73
75
+ ▁ཡང 74
76
+ ▁རིང 75
77
+ ▁འགྱུར 76
78
+ ▁ནང 77
79
+ ▁འ 78
80
+ ▁ཏུ 79
81
+ ▁ཐུབ 80
82
+ ▁སྐྱེ 81
83
+ གས 82
84
+ ▁རྩ 83
85
+ ▁སྲིད 84
86
+ ▁གས 85
87
+ ོ 86
88
+ ▁ཕྱོགས 87
89
+ ▁ཤུགས 88
90
+ ▁མོ 89
91
+ ▁དམངས 90
92
+ ▁ཆོས 91
93
+ ▁གང 92
94
+ ▁དཔ 93
95
+ ▁སྤེལ 94
96
+ ▁གིས 95
97
+ ▁དུས 96
98
+ ▁གཞི 97
99
+ བས 98
100
+ ▁འཕེལ 99
101
+ ▁དབ 100
102
+ ▁ལྟ 101
103
+ ▁རིམ 102
104
+ ▁བས 103
105
+ ▁གོ 104
106
+ ▁ཐོག 105
107
+ ▁དམ 106
108
+ ▁བོ 107
109
+ ▁སྒོ 108
110
+ ▁མཐུན 109
111
+ ▁ཚང 110
112
+ ▁འདྲ 111
113
+ ▁ལོ 112
114
+ ▁བཞིན 113
115
+ ▁ཡ 114
116
+ ▁ཀྲུ 115
117
+ ▁ཡོང 116
118
+ ▁སློ 117
119
+ ▁ལམ 118
120
+ ▁ལག 119
121
+ ▁ཁབ 120
122
+ ོད 121
123
+ ▁ཀྱིས 122
124
+ འ 123
125
+ ▁ངེས 124
126
+ ▁ཐོན 125
127
+ ▁འཛུགས 126
128
+ ངས 127
129
+ ▁ཚད 128
130
+ ▁འཛིན 129
131
+ ▁རིག 130
132
+ ▁གོང 131
133
+ ོག 132
134
+ ▁ཀྱང 133
135
+ ▁རེ 134
136
+ ▁གཞན 135
137
+ ▁ཞིང 136
138
+ ▁ཚུལ 137
139
+ ▁ལེགས 138
140
+ ▁རུ 139
141
+ ▁སྐྱོང 140
142
+ ▁འབྲེལ 141
143
+ ▁རྐྱེན 142
144
+ ▁ལྡན 143
145
+ ▁ཁག 144
146
+ ▁ཀ 145
147
+ ▁ཅན 146
148
+ ▁ནུས 147
149
+ ▁རྒྱས 148
150
+ ▁ཧ 149
151
+ ▁གཉིས 150
152
+ ▁ཁོང 151
153
+ ུང 152
154
+ ▁ཚན 153
155
+ ▁གྱིས 154
156
+ ▁ཞི 155
157
+ ོན 156
158
+ ▁ལེན 157
159
+ ▁སྤྱོད 158
160
+ ▁ཐག 159
161
+ ▁གསོ 160
162
+ ▁སྔ 161
163
+ ▁ཕན 162
164
+ ▁འཚོ 163
165
+ ▁ཟ 164
166
+ ▁དོ 165
167
+ ▁གནད 166
168
+ ▁བསམ 167
169
+ ▁བཏང 168
170
+ ▁སོ 169
171
+ ▁ཡོངས 170
172
+ ▁ཁྱོ 171
173
+ ▁འགོ 172
174
+ ▁ངོ 173
175
+ ▁ཏེ 174
176
+ ▁བཟོ 175
177
+ ▁ཤེས 176
178
+ ▁ཐ 177
179
+ ▁བློ 178
180
+ ▁ཉི 179
181
+ ▁བདེ 180
182
+ ▁ཁེ 181
183
+ ▁ཤ 182
184
+ ▁འགྲོ 183
185
+ ▁འབྱོ 184
186
+ ▁བུ 185
187
+ ▁ཅ 186
188
+ ▁སྣ 187
189
+ ▁ཙ 188
190
+ ▁ཁོ 189
191
+ ▁སྒྲིག 190
192
+ ▁གཏ 191
193
+ ▁གཞུང 192
194
+ ▁སྐུལ 193
195
+ ུག 194
196
+ ▁ཁྲིམས 195
197
+ ▁ཆོག 196
198
+ ▁གྲོ 197
199
+ ▁ཁྱ 198
200
+ ▁སྲུང 199
201
+ ▁ལྟར 200
202
+ ▁རབས 201
203
+ ▁འགན 202
204
+ ▁ཁུ 203
205
+ ▁ཕྱི 204
206
+ ▁སྐབས 205
207
+ ▁བཅོས 206
208
+ ▁གཙོ 207
209
+ ▁ཐབས 208
210
+ ▁སྐ 209
211
+ ▁སྔོན 210
212
+ ▁གླིང 211
213
+ ▁འཛ 212
214
+ ▁བཅ 213
215
+ ▁ཨེ 214
216
+ ▁གྱུར 215
217
+ ▁ཕ 216
218
+ ▁དབྱི 217
219
+ ▁འབ 218
220
+ ▁བྲ 219
221
+ ▁ཆགས 220
222
+ ▁མཐའ 221
223
+ དག 222
224
+ ▁ཟིག 223
225
+ ▁འོ 224
226
+ ▁ཁང 225
227
+ ▁མཐོ 226
228
+ ▁གཅོད 227
229
+ ▁སྐྲུན 228
230
+ ▁འཐབ 229
231
+ ▁ཤས 230
232
+ མས 231
233
+ ཉམ 232
234
+ ▁གྲུབ 233
235
+ ▁སྡེ 234
236
+ ▁འཐུས 235
237
+ ▁ཁྲིད 236
238
+ ▁ཚ 237
239
+ ▁འཇུག 238
240
+ ▁རྩལ 239
241
+ ▁མཁན 240
242
+ ▁ཡི 241
243
+ ▁དེའི 242
244
+ ▁རྣམས 243
245
+ ▁གསལ 244
246
+ ▁བཤད 245
247
+ ▁སྟོབས 246
248
+ ▁ཆུང 247
249
+ ▁བཀ 248
250
+ ▁ཇུས 249
251
+ ▁ཅི 250
252
+ ▁སོང 251
253
+ ▁ཤིང 252
254
+ ▁ཉེ 253
255
+ ▁གསུམ 254
256
+ ▁ཆུ 255
257
+ ཇེ 256
258
+ ▁དུང 257
259
+ ▁ཟད 258
260
+ ▁ཆད 259
261
+ ▁རྣམ 260
262
+ ▁གྲ 261
263
+ ▁འབྱུང 262
264
+ ▁མཐོང 263
265
+ ▁ཐད 264
266
+ ▁མེ 265
267
+ ▁བརྟ 266
268
+ ྭ 267
269
+ ུལ 268
270
+ ▁འགྲ 269
271
+ ▁ཡུ 270
272
+ ▁མིན 271
273
+ འུ 272
274
+ ▁ཐུ 273
275
+ ▁རི 274
276
+ ▁མྱོང 275
277
+ ▁འཁྱོངས 276
278
+ ▁འཇོག 277
279
+ ▁སྨ 278
280
+ ▁ཨ 279
281
+ ▁སྟེང 280
282
+ ▁ཡུལ 281
283
+ ▁མཚན 282
284
+ ▁ཡོན 283
285
+ ▁ལྷག 284
286
+ ▁ཤི 285
287
+ ▁ནའང 286
288
+ ▁རྗེས 287
289
+ ྒ 288
290
+ ▁གྲངས 289
291
+ ▁འགོག 290
292
+ ▁དགོ 291
293
+ ▁འད 292
294
+ ▁ཕུ 293
295
+ ▁ཁྱབ 294
296
+ ▁ཅིག 295
297
+ ▁ཐང 296
298
+ ▁བསྒྱུར 297
299
+ ▁ཆི 298
300
+ ▁ཕྲ 299
301
+ ▁གན 300
302
+ ▁ཐོབ 301
303
+ ▁ཁྲ 302
304
+ ▁ཉུང 303
305
+ ▁རྩི 304
306
+ ▁མངོན 305
307
+ ▁འོག 306
308
+ ▁འདོན 307
309
+ ེམས 308
310
+ ེད 309
311
+ ▁ཁྲོད 310
312
+ ▁རུང 311
313
+ ▁འབྲས 312
314
+ ▁ཁུངས 313
315
+ ▁པས 314
316
+ ▁བཙ 315
317
+ ▁སྐོར 316
318
+ འོ 317
319
+ ▁ལྡོག 318
320
+ ▁དཀའ 319
321
+ ▁དམིགས 320
322
+ ▁ཙི 321
323
+ ▁སྲོ 322
324
+ ▁མཚུངས 323
325
+ ▁ཕྱིར 324
326
+ ▁བཅུ 325
327
+ ▁སྒྲུབ 326
328
+ ▁ཚེ 327
329
+ ▁སྟེ 328
330
+ ▁སྐྱེད 329
331
+ ▁བསྟར 330
332
+ ▁རོ 331
333
+ ▁བརྩ 332
334
+ ▁རྨ 333
335
+ ▁ཆབ 334
336
+ ▁སྲོལ 335
337
+ ▁དངུལ 336
338
+ ▁བརྒྱུད 337
339
+ ▁མཐུ 338
340
+ ▁སྤྲོ 339
341
+ ▁བཀོད 340
342
+ ▁ཞུ 341
343
+ ▁སྲ 342
344
+ ▁རྒྱག 343
345
+ ▁སྣང 344
346
+ ▁འདུ 345
347
+ ▁ཤིག 346
348
+ ▁ཞུགས 347
349
+ ▁མཚ 348
350
+ ▁འབུ 349
351
+ ▁འདུག 350
352
+ ▁སྣོན 351
353
+ ▁བརྟེན 352
354
+ ▁མཚོན 353
355
+ ▁བཟུང 354
356
+ ▁དཔེ 355
357
+ ▁འཆར 356
358
+ ▁ཁྱི 357
359
+ ▁ཚོང 358
360
+ ▁བཟང 359
361
+ ▁སྦྲ 360
362
+ ▁འཐ 361
363
+ ▁མགྱོགས 362
364
+ ▁རྟ 363
365
+ ▁བཞག 364
366
+ ▁བབ 365
367
+ ▁ནོ 366
368
+ ▁རྩོད 367
369
+ ▁འཕྲ 368
370
+ ▁གཏོད 369
371
+ ▁བསལ 370
372
+ ▁འདེམས 371
373
+ ▁ཚིག 372
374
+ བྱེད 373
375
+ ▁ཟབ 374
376
+ ▁སོགས 375
377
+ ▁གཤིས 376
378
+ ▁གཟུགས 377
379
+ ▁སྡོ 378
380
+ ▁བརྩི 379
381
+ ▁ཧྲ 380
382
+ ▁བརྒྱ 381
383
+ ▁རྩོ 382
384
+ ▁མུ 383
385
+ ▁འདེད 384
386
+ ▁སྦྱོང 385
387
+ ▁འགུ 386
388
+ ▁དྲག 387
389
+ ▁བསྐྱ 388
390
+ ▁རྒོལ 389
391
+ ▁བལྟ 390
392
+ ▁ཕྱིན 391
393
+ ▁རླ 392
394
+ ▁འགྱོ 393
395
+ ▁ཉམས 394
396
+ ▁གྲྭ 395
397
+ ▁ཡིག 396
398
+ ▁འགལ 397
399
+ ▁དབྱང 398
400
+ ▁སྟངས 399
401
+ ▁རྟག 400
402
+ ▁གུ 401
403
+ ▁སྟོན 402
404
+ ུབ 403
405
+ ▁ལུས 404
406
+ ▁གནོད 405
407
+ ▁སྙི 406
408
+ ▁འཇིག 407
409
+ ▁ཨུ 408
410
+ ▁ཕོ 409
411
+ ▁ཐེབས 410
412
+ ▁བླང 411
413
+ ▁སི 412
414
+ ▁མཚོ 413
415
+ ▁དད 414
416
+ ▁ཁྱེ 415
417
+ ▁ཅུ 416
418
+ ▁རྒྱུས 417
419
+ ▁ཞེ 418
420
+ ▁འདྲེ 419
421
+ ▁དགའ 420
422
+ ▁ཆེད 421
423
+ ▁ནོར 422
424
+ ▁མཐར 423
425
+ ▁སྤྱད 424
426
+ ▁ཚུན 425
427
+ ▁རོག 426
428
+ ▁སྒྲིལ 427
429
+ ▁བཞི 428
430
+ ▁འགའ 429
431
+ ▁རྫས 430
432
+ ▁ཟེར 431
433
+ ▁ཇི 432
434
+ ▁ནམ 433
435
+ ▁འགོས 434
436
+ ▁སྡུ 435
437
+ ▁སྐྱོ 436
438
+ ▁ཧུ 437
439
+ ▁སྐྱོན 438
440
+ ▁བརྗེ 439
441
+ ཏིང 440
442
+ ▁ཐེང 441
443
+ ▁ཡག 442
444
+ ྤ 443
445
+ ▁གོམ 444
446
+ ▁འཁོར 445
447
+ ▁སྤོ 446
448
+ ▁ཐོ 447
449
+ ▁ཟིན 448
450
+ ▁བཏོན 449
451
+ ▁སྦྱོ 450
452
+ ▁འདོད 451
453
+ ▁གཞག 452
454
+ ▁བསྒྲུབ 453
455
+ ▁མདུ 454
456
+ ▁ཐུག 455
457
+ ▁ཅིང 456
458
+ ▁གཞོན 457
459
+ ཱ 458
460
+ ཕ 459
461
+ ྟ 460
462
+ ༄ 461
463
+ ༅ 462
464
+ ཎ 463
465
+ ྥ 464
466
+ ཝ 465
467
+ ྕ 466
468
+ ྫ 467
469
+ ྨ 468
470
+ ྙ 469
471
+ ྷ 470
472
+ ྦ 471
473
+ ྗ 472
474
+ ཨ 473
475
+ ཧ 474
476
+ ྔ 475
477
+ ྣ 476
478
+ ཙ 477
479
+ ཇ 478
480
+ ཛ 479
481
+ ྡ 480
482
+ ྩ 481
483
+ ཉ 482
484
+ ླ 483
485
+ ཟ 484
486
+ ཤ 485
487
+ ཅ 486
488
+ ྐ 487
489
+ ཏ 488
490
+ ཞ 489
491
+ ཆ 490
492
+ ཀ 491
493
+ ཐ 492
494
+ ཚ 493
495
+ ཁ 494
496
+ ཡ 495
497
+ ྲ 496
498
+ པ 497
499
+ ྰ 498
500
+ ྱ 499
501
+ #0 500
502
+ #1 501
503
+ #2 502
data/lang_bpe_500/words.txt ADDED
The diff for this file is too large to render. See raw diff
 
decoding-result/beam_search/log-decode-epoch-23-avg-11-beam_search-beam-size-4-use-averaged-model-2022-12-02-13-41-06 ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2022-12-02 13:41:06,396 INFO [decode.py:682] Decoding started
2
+ 2022-12-02 13:41:06,397 INFO [decode.py:688] Device: cuda:0
3
+ 2022-12-02 13:41:06,399 INFO [decode.py:703] {'best_train_loss': inf, 'best_valid_loss': inf, 'best_train_epoch': -1, 'best_valid_epoch': -1, 'batch_idx_train': 0, 'log_interval': 50, 'reset_interval': 200, 'valid_interval': 3000, 'feature_dim': 80, 'subsampling_factor': 4, 'warm_step': 2000, 'env_info': {'k2-version': '1.22', 'k2-build-type': 'Release', 'k2-with-cuda': True, 'k2-git-sha1': '', 'k2-git-date': '', 'lhotse-version': '1.10.0', 'torch-version': '1.12.1', 'torch-cuda-available': True, 'torch-cuda-version': '11.6', 'python-version': '3.9', 'icefall-git-branch': 'master', 'icefall-git-sha1': 'e5d9426-dirty', 'icefall-git-date': 'Tue Nov 22 11:45:03 2022', 'icefall-path': '/root/workspace/icefall', 'k2-path': '/root/workspace/k2/k2/python/k2/__init__.py', 'lhotse-path': '/root/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py', 'hostname': 'VM-0-13-centos', 'IP address': '127.0.0.1'}, 'epoch': 23, 'iter': 0, 'avg': 11, 'use_averaged_model': True, 'exp_dir': PosixPath('pruned_transducer_stateless7/exp'), 'bpe_model': 'data/lang_bpe_500/bpe.model', 'lang_dir': PosixPath('data/lang_bpe_500'), 'decoding_method': 'beam_search', 'beam_size': 4, 'beam': 20.0, 'ngram_lm_scale': 0.01, 'max_contexts': 8, 'max_states': 64, 'context_size': 2, 'max_sym_per_frame': 1, 'num_paths': 200, 'nbest_scale': 0.5, 'simulate_streaming': False, 'decode_chunk_size': 16, 'left_context': 64, 'num_encoder_layers': '2,4,3,2,4', 'feedforward_dims': '1024,1024,2048,2048,1024', 'nhead': '8,8,8,8,8', 'encoder_dims': '384,384,384,384,384', 'attention_dims': '192,192,192,192,192', 'encoder_unmasked_dims': '256,256,256,256,256', 'zipformer_downsampling_factors': '1,2,4,8,2', 'cnn_module_kernels': '31,31,31,31,31', 'decoder_dim': 512, 'joiner_dim': 512, 'manifest_dir': PosixPath('data/fbank'), 'max_duration': 600, 'bucketing_sampler': True, 'num_buckets': 30, 'concatenate_cuts': False, 'duration_factor': 1.0, 'gap': 1.0, 'on_the_fly_feats': False, 'shuffle': True, 'drop_last': True, 'return_cuts': True, 'num_workers': 2, 'enable_spec_aug': True, 'spec_aug_time_warp_factor': 80, 'enable_musan': True, 'input_strategy': 'PrecomputedFeatures', 'res_dir': PosixPath('pruned_transducer_stateless7/exp/beam_search'), 'suffix': 'epoch-23-avg-11-beam_search-beam-size-4-use-averaged-model', 'blank_id': 0, 'unk_id': 2, 'vocab_size': 500}
4
+ 2022-12-02 13:41:06,399 INFO [decode.py:705] About to create model
5
+ 2022-12-02 13:41:06,854 INFO [zipformer.py:179] At encoder stack 4, which has downsampling_factor=2, we will combine the outputs of layers 1 and 3, with downsampling_factors=2 and 8.
6
+ 2022-12-02 13:41:06,864 INFO [decode.py:772] Calculating the averaged model over epoch range from 12 (excluded) to 23
7
+ 2022-12-02 13:41:10,607 INFO [decode.py:806] Number of model parameters: 70369391
8
+ 2022-12-02 13:41:10,607 INFO [asr_datamodule.py:408] About to get test cuts from data/fbank/xbmu_amdo31_cuts_test.jsonl.gz
9
+ 2022-12-02 13:42:14,866 INFO [decode.py:585] batch 0/?, cuts processed until now is 99
10
+ 2022-12-02 13:58:36,011 INFO [decode.py:585] batch 20/?, cuts processed until now is 2000
11
+ 2022-12-02 13:59:08,068 INFO [decode.py:601] The transcripts are stored in pruned_transducer_stateless7/exp/beam_search/recogs-test-beam_size_4-epoch-23-avg-11-beam_search-beam-size-4-use-averaged-model.txt
12
+ 2022-12-02 13:59:08,122 INFO [utils.py:530] [test-beam_size_4] %WER 9.77% [3285 / 33628, 298 ins, 292 del, 2695 sub ]
13
+ 2022-12-02 13:59:08,227 INFO [decode.py:614] Wrote detailed error stats to pruned_transducer_stateless7/exp/beam_search/errs-test-beam_size_4-epoch-23-avg-11-beam_search-beam-size-4-use-averaged-model.txt
14
+ 2022-12-02 13:59:08,228 INFO [decode.py:630]
15
+ For test, WER of different settings are:
16
+ beam_size_4 9.77 best for test
17
+
18
+ 2022-12-02 13:59:08,228 INFO [decode.py:835] Done!
decoding-result/greedy_search/log-decode-epoch-23-avg-11-context-2-max-sym-per-frame-1-use-averaged-model-2022-12-02-11-43-34 ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2022-12-02 11:43:34,585 INFO [decode.py:682] Decoding started
2
+ 2022-12-02 11:43:34,585 INFO [decode.py:688] Device: cuda:0
3
+ 2022-12-02 11:43:34,587 INFO [decode.py:703] {'best_train_loss': inf, 'best_valid_loss': inf, 'best_train_epoch': -1, 'best_valid_epoch': -1, 'batch_idx_train': 0, 'log_interval': 50, 'reset_interval': 200, 'valid_interval': 3000, 'feature_dim': 80, 'subsampling_factor': 4, 'warm_step': 2000, 'env_info': {'k2-version': '1.22', 'k2-build-type': 'Release', 'k2-with-cuda': True, 'k2-git-sha1': '', 'k2-git-date': '', 'lhotse-version': '1.10.0', 'torch-version': '1.12.1', 'torch-cuda-available': True, 'torch-cuda-version': '11.6', 'python-version': '3.9', 'icefall-git-branch': 'master', 'icefall-git-sha1': 'e5d9426-dirty', 'icefall-git-date': 'Tue Nov 22 11:45:03 2022', 'icefall-path': '/root/workspace/icefall', 'k2-path': '/root/workspace/k2/k2/python/k2/__init__.py', 'lhotse-path': '/root/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py', 'hostname': 'VM-0-13-centos', 'IP address': '127.0.0.1'}, 'epoch': 23, 'iter': 0, 'avg': 11, 'use_averaged_model': True, 'exp_dir': PosixPath('pruned_transducer_stateless7/exp'), 'bpe_model': 'data/lang_bpe_500/bpe.model', 'lang_dir': PosixPath('data/lang_bpe_500'), 'decoding_method': 'greedy_search', 'beam_size': 4, 'beam': 20.0, 'ngram_lm_scale': 0.01, 'max_contexts': 8, 'max_states': 64, 'context_size': 2, 'max_sym_per_frame': 1, 'num_paths': 200, 'nbest_scale': 0.5, 'simulate_streaming': False, 'decode_chunk_size': 16, 'left_context': 64, 'num_encoder_layers': '2,4,3,2,4', 'feedforward_dims': '1024,1024,2048,2048,1024', 'nhead': '8,8,8,8,8', 'encoder_dims': '384,384,384,384,384', 'attention_dims': '192,192,192,192,192', 'encoder_unmasked_dims': '256,256,256,256,256', 'zipformer_downsampling_factors': '1,2,4,8,2', 'cnn_module_kernels': '31,31,31,31,31', 'decoder_dim': 512, 'joiner_dim': 512, 'manifest_dir': PosixPath('data/fbank'), 'max_duration': 600, 'bucketing_sampler': True, 'num_buckets': 30, 'concatenate_cuts': False, 'duration_factor': 1.0, 'gap': 1.0, 'on_the_fly_feats': False, 'shuffle': True, 'drop_last': True, 'return_cuts': True, 'num_workers': 2, 'enable_spec_aug': True, 'spec_aug_time_warp_factor': 80, 'enable_musan': True, 'input_strategy': 'PrecomputedFeatures', 'res_dir': PosixPath('pruned_transducer_stateless7/exp/greedy_search'), 'suffix': 'epoch-23-avg-11-context-2-max-sym-per-frame-1-use-averaged-model', 'blank_id': 0, 'unk_id': 2, 'vocab_size': 500}
4
+ 2022-12-02 11:43:34,588 INFO [decode.py:705] About to create model
5
+ 2022-12-02 11:43:35,054 INFO [zipformer.py:179] At encoder stack 4, which has downsampling_factor=2, we will combine the outputs of layers 1 and 3, with downsampling_factors=2 and 8.
6
+ 2022-12-02 11:43:35,064 INFO [decode.py:772] Calculating the averaged model over epoch range from 12 (excluded) to 23
7
+ 2022-12-02 11:43:38,758 INFO [decode.py:806] Number of model parameters: 70369391
8
+ 2022-12-02 11:43:38,758 INFO [asr_datamodule.py:408] About to get test cuts from data/fbank/xbmu_amdo31_cuts_test.jsonl.gz
9
+ 2022-12-02 11:43:42,144 INFO [decode.py:585] batch 0/?, cuts processed until now is 99
10
+ 2022-12-02 11:43:59,551 INFO [decode.py:601] The transcripts are stored in pruned_transducer_stateless7/exp/greedy_search/recogs-test-greedy_search-epoch-23-avg-11-context-2-max-sym-per-frame-1-use-averaged-model.txt
11
+ 2022-12-02 11:43:59,603 INFO [utils.py:530] [test-greedy_search] %WER 10.13% [3405 / 33628, 260 ins, 396 del, 2749 sub ]
12
+ 2022-12-02 11:43:59,709 INFO [decode.py:614] Wrote detailed error stats to pruned_transducer_stateless7/exp/greedy_search/errs-test-greedy_search-epoch-23-avg-11-context-2-max-sym-per-frame-1-use-averaged-model.txt
13
+ 2022-12-02 11:43:59,709 INFO [decode.py:630]
14
+ For test, WER of different settings are:
15
+ greedy_search 10.13 best for test
16
+
17
+ 2022-12-02 11:43:59,709 INFO [decode.py:835] Done!
decoding-result/modified_beam_search/log-decode-epoch-23-avg-11-modified_beam_search-beam-size-4-use-averaged-model-2022-12-02-12-39-55 ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2022-12-02 12:39:55,315 INFO [decode.py:682] Decoding started
2
+ 2022-12-02 12:39:55,315 INFO [decode.py:688] Device: cuda:0
3
+ 2022-12-02 12:39:55,317 INFO [decode.py:703] {'best_train_loss': inf, 'best_valid_loss': inf, 'best_train_epoch': -1, 'best_valid_epoch': -1, 'batch_idx_train': 0, 'log_interval': 50, 'reset_interval': 200, 'valid_interval': 3000, 'feature_dim': 80, 'subsampling_factor': 4, 'warm_step': 2000, 'env_info': {'k2-version': '1.22', 'k2-build-type': 'Release', 'k2-with-cuda': True, 'k2-git-sha1': '', 'k2-git-date': '', 'lhotse-version': '1.10.0', 'torch-version': '1.12.1', 'torch-cuda-available': True, 'torch-cuda-version': '11.6', 'python-version': '3.9', 'icefall-git-branch': 'master', 'icefall-git-sha1': 'e5d9426-dirty', 'icefall-git-date': 'Tue Nov 22 11:45:03 2022', 'icefall-path': '/root/workspace/icefall', 'k2-path': '/root/workspace/k2/k2/python/k2/__init__.py', 'lhotse-path': '/root/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py', 'hostname': 'VM-0-13-centos', 'IP address': '127.0.0.1'}, 'epoch': 23, 'iter': 0, 'avg': 11, 'use_averaged_model': True, 'exp_dir': PosixPath('pruned_transducer_stateless7/exp'), 'bpe_model': 'data/lang_bpe_500/bpe.model', 'lang_dir': PosixPath('data/lang_bpe_500'), 'decoding_method': 'modified_beam_search', 'beam_size': 4, 'beam': 20.0, 'ngram_lm_scale': 0.01, 'max_contexts': 8, 'max_states': 64, 'context_size': 2, 'max_sym_per_frame': 1, 'num_paths': 200, 'nbest_scale': 0.5, 'simulate_streaming': False, 'decode_chunk_size': 16, 'left_context': 64, 'num_encoder_layers': '2,4,3,2,4', 'feedforward_dims': '1024,1024,2048,2048,1024', 'nhead': '8,8,8,8,8', 'encoder_dims': '384,384,384,384,384', 'attention_dims': '192,192,192,192,192', 'encoder_unmasked_dims': '256,256,256,256,256', 'zipformer_downsampling_factors': '1,2,4,8,2', 'cnn_module_kernels': '31,31,31,31,31', 'decoder_dim': 512, 'joiner_dim': 512, 'manifest_dir': PosixPath('data/fbank'), 'max_duration': 600, 'bucketing_sampler': True, 'num_buckets': 30, 'concatenate_cuts': False, 'duration_factor': 1.0, 'gap': 1.0, 'on_the_fly_feats': False, 'shuffle': True, 'drop_last': True, 'return_cuts': True, 'num_workers': 2, 'enable_spec_aug': True, 'spec_aug_time_warp_factor': 80, 'enable_musan': True, 'input_strategy': 'PrecomputedFeatures', 'res_dir': PosixPath('pruned_transducer_stateless7/exp/modified_beam_search'), 'suffix': 'epoch-23-avg-11-modified_beam_search-beam-size-4-use-averaged-model', 'blank_id': 0, 'unk_id': 2, 'vocab_size': 500}
4
+ 2022-12-02 12:39:55,318 INFO [decode.py:705] About to create model
5
+ 2022-12-02 12:39:55,768 INFO [zipformer.py:179] At encoder stack 4, which has downsampling_factor=2, we will combine the outputs of layers 1 and 3, with downsampling_factors=2 and 8.
6
+ 2022-12-02 12:39:55,778 INFO [decode.py:772] Calculating the averaged model over epoch range from 12 (excluded) to 23
7
+ 2022-12-02 12:39:59,502 INFO [decode.py:806] Number of model parameters: 70369391
8
+ 2022-12-02 12:39:59,502 INFO [asr_datamodule.py:408] About to get test cuts from data/fbank/xbmu_amdo31_cuts_test.jsonl.gz
9
+ 2022-12-02 12:40:05,457 INFO [decode.py:585] batch 0/?, cuts processed until now is 99
10
+ 2022-12-02 12:41:03,134 INFO [decode.py:585] batch 20/?, cuts processed until now is 2000
11
+ 2022-12-02 12:41:05,419 INFO [decode.py:601] The transcripts are stored in pruned_transducer_stateless7/exp/modified_beam_search/recogs-test-beam_size_4-epoch-23-avg-11-modified_beam_search-beam-size-4-use-averaged-model.txt
12
+ 2022-12-02 12:41:05,471 INFO [utils.py:530] [test-beam_size_4] %WER 9.70% [3262 / 33628, 283 ins, 292 del, 2687 sub ]
13
+ 2022-12-02 12:41:05,578 INFO [decode.py:614] Wrote detailed error stats to pruned_transducer_stateless7/exp/modified_beam_search/errs-test-beam_size_4-epoch-23-avg-11-modified_beam_search-beam-size-4-use-averaged-model.txt
14
+ 2022-12-02 12:41:05,578 INFO [decode.py:630]
15
+ For test, WER of different settings are:
16
+ beam_size_4 9.7 best for test
17
+
18
+ 2022-12-02 12:41:05,578 INFO [decode.py:835] Done!
exp/cpu_jit.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:95bb54eaab329da342450f3248a038d5c1259a1c0c94857b42047bbb78cb4b6f
3
+ size 281740798
exp/epoch-21.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:319cd1be0acf7a8a19ea06156db5dbbda5c0d61fa5bdddaf12ff862a20fc8e27
3
+ size 1126566559
exp/epoch-22.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c0fd0dad1e34afd1bfa57bda22b58c6a7cc8a4fac48143d07c09c085981e3198
3
+ size 1126566623
exp/epoch-23.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f112ea17965749d3e6d616ab2dce27129709f10adc1591975bef31bd386fb247
3
+ size 1126566623
exp/log/log-train-2022-12-01-19-18-32 ADDED
The diff for this file is too large to render. See raw diff
 
exp/pretrained.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9906a4ee24e4497d57b16a2405888b81cb163d941cf6fb77b759af3a0b7acee1
3
+ size 281766253
exp/tensorboard/events.out.tfevents.1669893512.VM-0-13-centos.31587.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eae516765c8ebc2428e7cc76179701315be5f24b0c72499c9dd866f6ab6454dd
3
+ size 372641
test_wavs/a_0_cacm-A70_31116.wav ADDED
Binary file (97.4 kB). View file
 
test_wavs/a_0_cacm-A70_31117.wav ADDED
Binary file (128 kB). View file
 
test_wavs/a_0_cacm-A70_31118.wav ADDED
Binary file (87.1 kB). View file
 
test_wavs/trans.txt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ a_0_cacm-A70_31116.wav ལོ བཅུ ཙམ མ འདང བའི དུས སྐབས ནང
2
+ a_0_cacm-A70_31117.wav དྲག པོའི ངོ ལོག ཟིང འཁྲུག སྒྲིག འཛུགས དང ངན བཀོད བྱས ཡོད
3
+ a_0_cacm-A70_31118.wav གནས བབ འདིའི རིགས གང མགྱོགས འགྱུར བ གཏོང དགོས