w11wo commited on
Commit
44390d5
1 Parent(s): 26a3ba9

Added Model

Browse files
README.md CHANGED
@@ -1,3 +1,38 @@
1
  ---
 
2
  license: apache-2.0
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language: id
3
  license: apache-2.0
4
+ tags:
5
+ - icefall
6
+ - sherpa-onnx
7
+ - phoneme-recognition
8
+ - automatic-speech-recognition
9
+ datasets:
10
+ - mozilla-foundation/common_voice_13_0
11
+ - indonesian-nlp/librivox-indonesia
12
+ - google/fleurs
13
  ---
14
+
15
+ # Sherpa-onnx Pruned Stateless Zipformer RNN-T Streaming ID
16
+
17
+ Sherpa-onnx Pruned Stateless Zipformer RNN-T Streaming ID is an automatic speech recognition model trained on the following datasets:
18
+
19
+ - [Common Voice ID](https://huggingface.co/datasets/mozilla-foundation/common_voice_13_0)
20
+ - [LibriVox Indonesia](https://huggingface.co/datasets/indonesian-nlp/librivox-indonesia)
21
+ - [FLEURS ID](https://huggingface.co/datasets/google/fleurs)
22
+
23
+ Instead of being trained to predict sequences of words, this model was trained to predict sequence of phonemes, e.g. `['p', 'ə', 'r', 'b', 'u', 'a', 't', 'a', 'n', 'ɲ', 'a']`. Therefore, the model's [vocabulary](https://huggingface.co/bookbot/pruned-transducer-stateless7-streaming-id/blob/main/data/lang_phone/tokens.txt) contains the different IPA phonemes found in [g2p ID](https://github.com/bookbot-kids/g2p_id).
24
+
25
+ This model was converted from the TorchScript version of [Pruned Stateless Zipformer RNN-T Streaming ID](https://huggingface.co/bookbot/pruned-transducer-stateless7-streaming-id) to ONNX format.
26
+
27
+ ## Converting from TorchScript
28
+
29
+ Refer to the [official instructions](https://icefall.readthedocs.io/en/latest/model-export/export-onnx.html#export-the-model-to-onnx) for conversion to ONNX.
30
+
31
+ ## Frameworks
32
+
33
+ - [k2](https://github.com/k2-fsa/k2)
34
+ - [icefall](https://github.com/bookbot-hive/icefall)
35
+ - [lhotse](https://github.com/bookbot-hive/lhotse)
36
+ - [sherpa-onnx](https://github.com/k2-fsa/sherpa-onnx)
37
+ - [onnx](https://github.com/onnx/onnx)
38
+ - [onnxruntime](https://github.com/microsoft/onnxruntime)
decoder-epoch-30-avg-9-with-averaged-model.int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e32cb407c0101d118eb1ab3819586021355f0cb2b49ce8886f016a29ea9c74e7
3
+ size 350745
decoder-epoch-30-avg-9-with-averaged-model.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2ff401e8ecd799cfcc9cca5ec331c5015f218f4e93ab89a536ad202bf7c3fb77
3
+ size 1135854
encoder-epoch-30-avg-9-with-averaged-model.int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e2bd8ce21dc70bf2e24b536cbbfa4201e91c6ed7fcbb47e7407ddd97b3658a7b
3
+ size 126764049
encoder-epoch-30-avg-9-with-averaged-model.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5284285770f8f45e5a5ff32cd1a6fe00eeca53d74f686bc87449ef1dc3b0c6d8
3
+ size 292543537
joiner-epoch-30-avg-9-with-averaged-model.int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:799a3b2899f968700e9180ed83175cae03192f4612a1ec1db2aabb98becf07d8
3
+ size 18597
joiner-epoch-30-avg-9-with-averaged-model.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5c893621ea7d778f4537726a492b2fd379ec59c911fe5f1fce967b24062d90dd
3
+ size 68175
tokens.txt ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <eps> 0
2
+ ɡ 1
3
+ o 2
4
+ d 3
5
+ ʃ 4
6
+ v 5
7
+ t 6
8
+ <UNK> 7
9
+ x 8
10
+ r 9
11
+ ʔ 10
12
+ b 11
13
+ s 12
14
+ p 13
15
+ i 14
16
+ dʒ 15
17
+ | 16
18
+ ə 17
19
+ z 18
20
+ f 19
21
+ n 20
22
+ m 21
23
+ ɲ 22
24
+ tʃ 23
25
+ ŋ 24
26
+ k 25
27
+ j 26
28
+ l 27
29
+ h 28
30
+ w 29
31
+ a 30
32
+ u 31
33
+ e 32
34
+ #0 33