icefall-asr-librispeech-100h-transducer-stateless-multi-datasets-bpe-500-2022-02-21 / log /log-decode-epoch-57-avg-17-context-2-max-sym-per-frame-2-2022-02-21-11-31-01
csukuangfj's picture
First commit.
d3c37c8
2022-02-21 11:31:01,353 INFO [decode.py:421] Decoding started
2022-02-21 11:31:01,353 INFO [decode.py:427] Device: cuda:0
2022-02-21 11:31:01,360 INFO [decode.py:436] {'feature_dim': 80, 'encoder_out_dim': 512, 'subsampling_factor': 4, 'attention_dim': 512, 'nhead': 8, 'dim_feedforward': 2048, 'num_encoder_layers': 12, 'vgg_frontend': False, 'env_info': {'k2-version': '1.13', 'k2-build-type': 'Release', 'k2-with-cuda': True, 'k2-git-sha1': 'f4fefe4882bc0ae59af951da3f47335d5495ef71', 'k2-git-date': 'Thu Feb 10 15:16:02 2022', 'lhotse-version': '1.0.0.dev+missing.version.file', 'torch-cuda-available': True, 'torch-cuda-version': '10.2', 'python-version': '3.8', 'icefall-git-branch': 'multiple-datasets', 'icefall-git-sha1': '61b0019-dirty', 'icefall-git-date': 'Thu Feb 17 18:34:48 2022', 'icefall-path': '/ceph-fj/fangjun/open-source-2/icefall-multi-datasets', 'k2-path': '/ceph-fj/fangjun/open-source-2/k2-multi-datasets/k2/python/k2/__init__.py', 'lhotse-path': '/ceph-fj/fangjun/open-source-2/lhotse-multi-datasets/lhotse/__init__.py', 'hostname': 'de-74279-k2-dev-1-0203142625-9776c46db-pk7w6', 'IP address': '10.177.22.139'}, 'epoch': 57, 'avg': 17, 'exp_dir': PosixPath('transducer_stateless_multi_datasets/exp-100-2'), 'bpe_model': './data/lang_bpe_500/bpe.model', 'decoding_method': 'greedy_search', 'beam_size': 4, 'context_size': 2, 'max_sym_per_frame': 2, 'max_duration': 100, 'bucketing_sampler': True, 'num_buckets': 30, 'shuffle': True, 'return_cuts': True, 'num_workers': 2, 'enable_spec_aug': True, 'spec_aug_time_warp_factor': 80, 'enable_musan': True, 'manifest_dir': PosixPath('data/fbank'), 'on_the_fly_feats': False, 'res_dir': PosixPath('transducer_stateless_multi_datasets/exp-100-2/greedy_search'), 'suffix': 'epoch-57-avg-17-context-2-max-sym-per-frame-2', 'blank_id': 0, 'vocab_size': 500}
2022-02-21 11:31:01,360 INFO [decode.py:438] About to create model
2022-02-21 11:31:01,990 INFO [decode.py:449] averaging ['transducer_stateless_multi_datasets/exp-100-2/epoch-41.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-42.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-43.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-44.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-45.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-46.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-47.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-48.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-49.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-50.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-51.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-52.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-53.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-54.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-55.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-56.pt', 'transducer_stateless_multi_datasets/exp-100-2/epoch-57.pt']
2022-02-21 11:32:44,770 INFO [decode.py:460] Number of model parameters: 84521448
2022-02-21 11:32:44,770 INFO [librispeech.py:58] About to get test-clean cuts from data/fbank/cuts_test-clean.json.gz
2022-02-21 11:32:44,994 INFO [librispeech.py:63] About to get test-other cuts from data/fbank/cuts_test-other.json.gz
2022-02-21 11:32:47,120 INFO [decode.py:346] batch 0/?, cuts processed until now is 20
2022-02-21 11:35:12,670 INFO [decode.py:346] batch 100/?, cuts processed until now is 1406
2022-02-21 11:37:35,107 INFO [decode.py:346] batch 200/?, cuts processed until now is 2563
2022-02-21 11:37:48,512 INFO [decode.py:363] The transcripts are stored in transducer_stateless_multi_datasets/exp-100-2/greedy_search/recogs-test-clean-greedy_search-epoch-57-avg-17-context-2-max-sym-per-frame-2.txt
2022-02-21 11:37:48,609 INFO [utils.py:404] [test-clean-greedy_search] %WER 6.34% [3331 / 52576, 352 ins, 328 del, 2651 sub ]
2022-02-21 11:37:48,921 INFO [decode.py:376] Wrote detailed error stats to transducer_stateless_multi_datasets/exp-100-2/greedy_search/errs-test-clean-greedy_search-epoch-57-avg-17-context-2-max-sym-per-frame-2.txt
2022-02-21 11:37:48,922 INFO [decode.py:393]
For test-clean, WER of different settings are:
greedy_search 6.34 best for test-clean
2022-02-21 11:37:50,838 INFO [decode.py:346] batch 0/?, cuts processed until now is 23
2022-02-21 11:40:17,151 INFO [decode.py:346] batch 100/?, cuts processed until now is 1614
2022-02-21 11:42:30,546 INFO [decode.py:346] batch 200/?, cuts processed until now is 2899
2022-02-21 11:42:38,702 INFO [decode.py:363] The transcripts are stored in transducer_stateless_multi_datasets/exp-100-2/greedy_search/recogs-test-other-greedy_search-epoch-57-avg-17-context-2-max-sym-per-frame-2.txt
2022-02-21 11:42:38,808 INFO [utils.py:404] [test-other-greedy_search] %WER 16.70% [8740 / 52343, 841 ins, 1064 del, 6835 sub ]
2022-02-21 11:42:39,142 INFO [decode.py:376] Wrote detailed error stats to transducer_stateless_multi_datasets/exp-100-2/greedy_search/errs-test-other-greedy_search-epoch-57-avg-17-context-2-max-sym-per-frame-2.txt
2022-02-21 11:42:39,144 INFO [decode.py:393]
For test-other, WER of different settings are:
greedy_search 16.7 best for test-other
2022-02-21 11:42:39,144 INFO [decode.py:488] Done!