luomingshuang's picture
add pretrained files for pruned-rnnt2 wenetspeech
ffa2a91
(k2-python) luomingshuang@de-74279-k2-train-1-0307195509-54c966b95f-rtpfq:~/icefall/egs/wenetspeech/ASR$ CUDA_VISIBLE_DEVICES='5' python pruned_transducer_stateless2/pretrained.py --checkpoint icefall_asr_wenetspeech_pruned_transducer_stateless2/exp/pretrained_epoch_10_avg_2.pt --lang-dir icefall_asr_wenetspeech_pruned_transducer_stateless2/data/lang_char --decoding-method greedy_search --sample-rate 48000 icefall_asr_wenetspeech_pruned_transducer_stateless2/test_wavs/DEV_T0000000000.opus icefall_asr_wenetspeech_pruned_transducer_stateless2/test_wavs/DEV_T0000000001.opus icefall_asr_wenetspeech_pruned_transducer_stateless2/test_wavs/DEV_T0000000002.opus
2022-05-20 00:13:05,888 INFO [lexicon.py:176] Loading pre-compiled icefall_asr_wenetspeech_pruned_transducer_stateless2/data/lang_char/Linv.pt
2022-05-20 00:13:05,985 INFO [pretrained.py:213] {'best_train_loss': inf, 'best_valid_loss': inf, 'best_train_epoch': -1, 'best_valid_epoch': -1, 'batch_idx_train': 10, 'log_interval': 1, 'reset_interval': 200, 'feature_dim': 80, 'subsampling_factor': 4, 'encoder_dim': 512, 'nhead': 8, 'dim_feedforward': 2048, 'num_encoder_layers': 12, 'decoder_dim': 512, 'joiner_dim': 512, 'env_info': {'k2-version': '1.15.1', 'k2-build-type': 'Release', 'k2-with-cuda': True, 'k2-git-sha1': 'f8d2dba06c000ffee36aab5b66f24e7c9809f116', 'k2-git-date': 'Thu Apr 21 12:20:34 2022', 'lhotse-version': '1.2.0.dev+git.de75634.dirty', 'torch-version': '1.11.0', 'torch-cuda-available': True, 'torch-cuda-version': '10.2', 'python-version': '3.8', 'icefall-git-branch': 'wenetspeech-pruned-transducer-stateless2', 'icefall-git-sha1': '4b567e4-dirty', 'icefall-git-date': 'Wed Apr 27 13:43:54 2022', 'icefall-path': '/ceph-meixu/luomingshuang/icefall', 'k2-path': '/ceph-ms/luomingshuang/k2_latest/k2/python/k2/__init__.py', 'lhotse-path': '/ceph-meixu/luomingshuang/anaconda3/envs/k2-python/lib/python3.8/site-packages/lhotse-1.2.0.dev0+git.de75634.dirty-py3.8.egg/lhotse/__init__.py', 'hostname': 'de-74279-k2-train-1-0307195509-54c966b95f-rtpfq', 'IP address': '10.177.22.9'}, 'checkpoint': 'icefall_asr_wenetspeech_pruned_transducer_stateless2/exp/pretrained_epoch_10_avg_2.pt', 'lang_dir': 'icefall_asr_wenetspeech_pruned_transducer_stateless2/data/lang_char', 'decoding_method': 'greedy_search', 'sound_files': ['icefall_asr_wenetspeech_pruned_transducer_stateless2/test_wavs/DEV_T0000000000.opus', 'icefall_asr_wenetspeech_pruned_transducer_stateless2/test_wavs/DEV_T0000000001.opus', 'icefall_asr_wenetspeech_pruned_transducer_stateless2/test_wavs/DEV_T0000000002.opus'], 'sample_rate': 48000, 'beam_size': 4, 'beam': 4, 'max_contexts': 4, 'max_states': 8, 'context_size': 2, 'max_sym_per_frame': 1, 'blank_id': 0, 'vocab_size': 5537}
2022-05-20 00:13:05,985 INFO [pretrained.py:219] device: cuda:0
2022-05-20 00:13:05,985 INFO [pretrained.py:221] Creating model
2022-05-20 00:13:10,099 INFO [pretrained.py:235] Constructing Fbank computer
2022-05-20 00:13:10,102 INFO [pretrained.py:245] Reading sound files: ['icefall_asr_wenetspeech_pruned_transducer_stateless2/test_wavs/DEV_T0000000000.opus', 'icefall_asr_wenetspeech_pruned_transducer_stateless2/test_wavs/DEV_T0000000001.opus', 'icefall_asr_wenetspeech_pruned_transducer_stateless2/test_wavs/DEV_T0000000002.opus']
2022-05-20 00:13:10,262 INFO [pretrained.py:251] Decoding started
2022-05-20 00:13:11,015 INFO [pretrained.py:268] Using greedy_search
2022-05-20 00:13:11,306 INFO [pretrained.py:331]
icefall_asr_wenetspeech_pruned_transducer_stateless2/test_wavs/DEV_T0000000000.opus:
对 我 做 了 介 绍 那 么 我 想 说 的 是 呢 大 家 如 果 对 我 的 研 究 感 兴 趣 呢
icefall_asr_wenetspeech_pruned_transducer_stateless2/test_wavs/DEV_T0000000001.opus:
动 点 的 小 盘 三 个 问 题 首 先 呢 就 是 这 一 轮 全 球 金 融 动 荡 的 表 现
icefall_asr_wenetspeech_pruned_transducer_stateless2/test_wavs/DEV_T0000000002.opus:
收 入 了 分 析 师 这 一 次 全 球 金 融 动 荡 背 后 的 根 源
2022-05-20 00:13:11,306 INFO [pretrained.py:333] Decoding Done