https://hf.co/vinai/PhoWhisper-large with ONNX weights to be compatible with Transformers.js.

Please check out this demo using this model: Open in Spaces

PhoWhisper: Automatic Speech Recognition for Vietnamese

We introduce PhoWhisper in five versions for Vietnamese automatic speech recognition. PhoWhisper's robustness is achieved through fine-tuning the multilingual Whisper on an 844-hour dataset that encompasses diverse Vietnamese accents. Our experimental study demonstrates state-of-the-art performances of PhoWhisper on benchmark Vietnamese ASR datasets. Please cite our PhoWhisper paper when it is used to help produce published results or is incorporated into other software:

@inproceedings{PhoWhisper,
  title     = {{PhoWhisper: Automatic Speech Recognition for Vietnamese}},
  author    = {Thanh-Thien Le and Linh The Nguyen and Dat Quoc Nguyen},
  booktitle = {Proceedings of the ICLR 2024 Tiny Papers track},
  year      = {2024}
}

For further information or requests, please go to PhoWhisper's homepage!

Downloads last month
7
Inference Examples
Inference API (serverless) does not yet support transformers.js models for this pipeline type.

Space using huuquyet/PhoWhisper-large 1