Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Pre-trained AudioNet-CTC models for the GRID audio dataset with icefall.

The model was trained on full GRID with the scripts in icefall.
See (https://github.com/k2-fsa/icefall/tree/master/egs/grid/AVSR/audionet_ctc_asr) for more details of this model.

How to use

See (https://github.com/k2-fsa/icefall/blob/master/egs/grid/AVSR/audionet_ctc_asr/Pre-trained.md)

Training procedure

The main repositories are list below, we will update the training and decoding scripts with the update of version.
k2: https://github.com/k2-fsa/k2 icefall: https://github.com/k2-fsa/icefall

git clone https://github.com/k2-fsa/icefall
cd icefall
  • Preparing data.
cd egs/grid/AVSR
bash ./prepare.sh
  • Training
export CUDA_VISIBLE_DEVICES="0"
python audionet_ctc_asr/train.py --world-size 1

Evaluation results

The best decoding results (WER) on GRID TEST are listed below, we got this result by averaging models from epoch 25 to 29, the decoding method is 1best.

TEST
WER 2.35%
Downloads last month
0
Unable to determine this model's library. Check the docs .