mesolitica
/

wav2vec2-xls-r-300m-mixed

Automatic Speech Recognition

generated_from_keras_callback

Inference Endpoints

Model card Files Files and versions Community

huseinzol05 commited on Jun 1, 2022

Commit

0b9b0fb

•

1 Parent(s): 7799685

add model

Files changed (2) hide show

README.md +19 -43
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -11,61 +11,37 @@ probably proofread and complete it, then remove this comment. -->
 # wav2vec2-xls-r-300m-mixed
-Finetuned https://huggingface.co/facebook/wav2vec2-xls-r-300m on https://github.com/huseinzol05/malaya-speech/tree/master/data/mixed-stt
-This model was finetuned on 3 languages,
-1. Malay
-2. Singlish
-3. Mandarin
-**This model trained on a single RTX 3090 Ti 24GB VRAM, provided by https://mesolitica.com/**.
-## Evaluation set
-Evaluation set from https://github.com/huseinzol05/malaya-speech/tree/master/pretrained-model/prepare-stt with sizes,
-```
-len(malay), len(singlish), len(mandarin)
--> (765, 3579, 614)
-```
-It achieves the following results on the evaluation set based on [evaluate-wav2vec2-xls-r-300m-mixed.ipynb](evaluate-wav2vec2-xls-r-300m-mixed.ipynb):
-Mixed evaluation,
-```
-CER: 0.05082346216269688
-WER: 0.14251665517797765
-CER with LM: 0.042868860764264445
-WER with LM: 0.10380217528405207
-```
-Malay evaluation,
-```
-CER: 0.05027226867066105
-WER: 0.21723938552369926
-CER with LM: 0.03601546154013878
-WER with LM: 0.13593624603428525
-```
-Singlish evaluation,
-```
-CER: 0.05161275767676772
-WER: 0.1331819722523124
-CER with LM: 0.04419848182804781
-WER with LM: 0.09859626021111582
-```
-Mandarin evaluation,
-```
-CER: 0.04690941391603209
-WER: 0.10382926344585862
-CER with LM: 0.0436573568867001
-WER with LM: 0.09411065398455744
-```
-Language model from https://huggingface.co/huseinzol05/language-model-bahasa-manglish-combined

 # wav2vec2-xls-r-300m-mixed
+This model was trained from scratch on an unknown dataset.
+It achieves the following results on the evaluation set:
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- optimizer: None
+- training_precision: float32
+### Training results
+### Framework versions
+- Transformers 4.18.0
+- TensorFlow 2.6.0
+- Datasets 2.1.0
+- Tokenizers 0.12.1

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e0b6b35b0d6ba53e37190d4d4562c5b2de506cdc25e6e6907531a7a973550fe2
 size 1262429728

 version https://git-lfs.github.com/spec/v1
+oid sha256:ab874f96f3fd9d022b30636174df952c354c9f14eecf60a3b9af36d252673905
 size 1262429728