Original repos, Many thanks!:
- Using this when training (with little modify for train using own datasets).
Note: As same as the original, this model does not have a tokenizer as it was pretrained on audio alone. In order to use this model speech recognition, a tokenizer should be created and the model should be fine-tuned on labeled text data. Check out this blog for more in-detail explanation of how to fine-tune the model.
See this blog for more information on how to fine-tune the model. Note that the class
Wav2Vec2ForCTC has to be replaced by
Note: This is not the best checkpoint and become more accurate with continued train, I think. I'll try to continue when I have a time.
- Downloads last month