Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Pre-trained VisualNet-CTC models for the GRID visual dataset with icefall.

The model was trained on full GRID with the scripts in icefall.
See (https://github.com/k2-fsa/icefall/tree/master/egs/grid/AVSR/visualnet_ctc_asr) for more details of this model.

How to use

See (https://github.com/k2-fsa/icefall/blob/master/egs/grid/AVSR/visualnet_ctc_asr/Pre-trained.md)

Training procedure

The main repositories are list below, we will update the training and decoding scripts with the update of version.
k2: https://github.com/k2-fsa/k2 icefall: https://github.com/k2-fsa/icefall

git clone https://github.com/k2-fsa/icefall
cd icefall
  • Preparing data.
cd egs/grid/AVSR
bash ./prepare.sh
  • Training
export CUDA_VISIBLE_DEVICES="0"
python visualnet_ctc_asr/train.py --world-size 1

Evaluation results

The best decoding results (WER) on GRID TEST are listed below, we got this result by averaging models from epoch 16 to 25, the decoding method is 1best.

TEST
WER 15.68%
Downloads last month
0
Unable to determine this model's library. Check the docs .