metadata
language: id
license: apache-2.0
tags:
- icefall
- sherpa-onnx
- phoneme-recognition
- automatic-speech-recognition
datasets:
- mozilla-foundation/common_voice_13_0
- indonesian-nlp/librivox-indonesia
- google/fleurs
Sherpa-onnx Pruned Stateless Zipformer RNN-T Streaming ID
Sherpa-onnx Pruned Stateless Zipformer RNN-T Streaming ID is an automatic speech recognition model trained on the following datasets:
Instead of being trained to predict sequences of words, this model was trained to predict sequence of phonemes, e.g. ['p', 'ə', 'r', 'b', 'u', 'a', 't', 'a', 'n', 'ɲ', 'a']
. Therefore, the model's vocabulary contains the different IPA phonemes found in g2p ID.
This model was converted from the TorchScript version of Pruned Stateless Zipformer RNN-T Streaming ID to ONNX format.
Converting from TorchScript
Refer to the official instructions for conversion to ONNX.