Loading model

by rjflynn2 - opened Nov 27, 2023

Nov 27, 2023

How should this model be loaded via the transformers API? loading with automodel and flax=True or using flax wav2vec2 results in a weight mismatch

sanchit-gandhi

End-to-End Speech Benchmark org Nov 27, 2023

Hey @rjflynn2 , you should be able to do:

from transformers import AutoProcessor, AutoModelForCTC

processor = AutoProcessor.from_pretrained("esb/wav2vec2-ctc-earnings22")
model = AutoModelForCTC.from_pretrained("esb/wav2vec2-ctc-earnings22", from_flax=True)

rjflynn2

Nov 28, 2023

This leads to a shape mismatch when loading the model, and the weights aren't loaded properly i.e https://colab.research.google.com/drive/1NP_yCyBrVHEOTUrujKk_rxIocSCbsbIG?usp=sharing

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment