w11wo's picture
Added Model
44390d5
metadata
language: id
license: apache-2.0
tags:
  - icefall
  - sherpa-onnx
  - phoneme-recognition
  - automatic-speech-recognition
datasets:
  - mozilla-foundation/common_voice_13_0
  - indonesian-nlp/librivox-indonesia
  - google/fleurs

Sherpa-onnx Pruned Stateless Zipformer RNN-T Streaming ID

Sherpa-onnx Pruned Stateless Zipformer RNN-T Streaming ID is an automatic speech recognition model trained on the following datasets:

Instead of being trained to predict sequences of words, this model was trained to predict sequence of phonemes, e.g. ['p', 'ə', 'r', 'b', 'u', 'a', 't', 'a', 'n', 'ɲ', 'a']. Therefore, the model's vocabulary contains the different IPA phonemes found in g2p ID.

This model was converted from the TorchScript version of Pruned Stateless Zipformer RNN-T Streaming ID to ONNX format.

Converting from TorchScript

Refer to the official instructions for conversion to ONNX.

Frameworks