Important...!!!!!

by trysem - opened 16 days ago

Thanks for this particular model ♥️
But it still leaves some lacuna, mispronounced words are there still..
Also the model is very high in size which needs lots of computing and skill to make it run for common people..
Still after all these developments, malayalam still lacks good ASR model.
All are proprietary handled by big firms, open ones are still far behind the game. People are still struggling. Malayalam in all sector is far behind(In TTS , LLM..etc) Just beacuse there are no Good Open ASR.
If one is decided to make a good model, still computing hits hard, training costs are high. Please make small one, one which is able to run via browser via ONNX, i suggest a below 800mb model. If its below 500mb, all welcome. I have seen proprietary firms having SOTA models below 450mb. For complex scripts like malayalam, it seems fastConformer TDT works well, try fastConformer-TDT-CTC hybrid for SOTA. Or fastConformerRNNT some decent production level model. CTC alone not working well.
If you are really into helping the people for everyday tasks, consider this suggestion, you'll be remembered...
Release in open type licence

a unfancy researcher guy

SujithPulikodan

ARTPARK org 11 days ago

Thanks for the feedback, the current model is 430 million parameter model with hybrid CTC -TDT decoder. Could you please share performance numbers(WER,CER) on your evaluation data for both opensource models as well proprietary models? we would like to see the performance gap.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment