mesolitica
/

wav2vec2-xls-r-300m-mixed

Automatic Speech Recognition

generated_from_keras_callback

Inference Endpoints

Model card Files Files and versions Community

huseinzol05 commited on Jun 2, 2022

Commit

2d35b49

•

1 Parent(s): 276857a

add model

Files changed (2) hide show

README.md +19 -43
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -11,61 +11,37 @@ probably proofread and complete it, then remove this comment. -->
 # wav2vec2-xls-r-300m-mixed
-Finetuned https://huggingface.co/facebook/wav2vec2-xls-r-300m on https://github.com/huseinzol05/malaya-speech/tree/master/data/mixed-stt
-This model was finetuned on 3 languages,
-1. Malay
-2. Singlish
-3. Mandarin
-**This model trained on a single RTX 3090 Ti 24GB VRAM, provided by https://mesolitica.com/**.
-## Evaluation set
-Evaluation set from https://github.com/huseinzol05/malaya-speech/tree/master/pretrained-model/prepare-stt with sizes,
-```
-len(malay), len(singlish), len(mandarin)
--> (765, 3579, 614)
-```
-It achieves the following results on the evaluation set based on [evaluate-wav2vec2-xls-r-300m-mixed.ipynb](evaluate-wav2vec2-xls-r-300m-mixed.ipynb):
-Mixed evaluation,
-```
-CER: 0.0481054244857041
-WER: 0.1322198446007387
-CER with LM: 0.041196586938584696
-WER with LM: 0.09880169127621556
-```
-Malay evaluation,
-```
-CER: 0.051636391937588406
-WER: 0.19561999547293663
-CER with LM: 0.03917689630621449
-WER with LM: 0.12710746406824835
-```
-Singlish evaluation,
-```
-CER: 0.0494915200071987
-WER: 0.12763802881676573
-CER with LM: 0.04271234986432335
-WER with LM: 0.09677160640413336
-```
-Mandarin evaluation,
-```
-CER: 0.035626554824269824
-WER: 0.07993515937860181
-CER with LM: 0.03487760945087219
-WER with LM: 0.07536807168546154
-```
-Language model from https://huggingface.co/huseinzol05/language-model-bahasa-manglish-combined

 # wav2vec2-xls-r-300m-mixed
+This model was trained from scratch on an unknown dataset.
+It achieves the following results on the evaluation set:
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- optimizer: None
+- training_precision: float32
+### Training results
+### Framework versions
+- Transformers 4.18.0
+- TensorFlow 2.6.0
+- Datasets 2.1.0
+- Tokenizers 0.12.1

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ab874f96f3fd9d022b30636174df952c354c9f14eecf60a3b9af36d252673905
 size 1262429728

 version https://git-lfs.github.com/spec/v1
+oid sha256:a59f060b717345a69d7add89b01c3b6afc450fab5f6d19db679b0c9df6739172
 size 1262429728