chuuhtetnaing
/

whisper-medium-myanmar

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

chuuhtetnaing commited on Aug 30

Commit

9656442

•

1 Parent(s): ad8fe52

Update README.md

Files changed (1) hide show

README.md +21 -11

README.md CHANGED Viewed

@@ -8,6 +8,12 @@ metrics:
 model-index:
 - name: whisper-medium-myanmar
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -15,24 +21,28 @@ should probably proofread and complete it, then remove this comment. -->
 # whisper-medium-myanmar
-This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.2282
 - Wer: 49.4657
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -108,4 +118,4 @@ The following hyperparameters were used during training:
 - Transformers 4.35.2
 - Pytorch 2.1.1+cu121
 - Datasets 2.14.5
-- Tokenizers 0.15.1

 model-index:
 - name: whisper-medium-myanmar
   results: []
+datasets:
+- chuuhtetnaing/myanmar-speech-dataset-openslr-80
+language:
+- my
+pipeline_tag: automatic-speech-recognition
+library_name: transformers
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # whisper-medium-myanmar
+This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on the [chuuhtetnaing/myanmar-speech-dataset-openslr-80](https://huggingface.co/datasets/chuuhtetnaing/myanmar-speech-dataset-openslr-80) dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.2282
 - Wer: 49.4657
+## Usage
+```python
+from datasets import Audio, load_dataset
+from transformers import pipeline
+# Load a sample audio
+dataset = load_dataset("chuuhtetnaing/myanmar-speech-dataset-openslr-80")
+dataset = dataset.cast_column("audio", Audio(sampling_rate=16000))
+test_dataset = dataset['test']
+input_speech = test_dataset[42]['audio']
+pipe = pipeline(model='chuuhtetnaing/whisper-large-v3-myanmar')
+output = pipe(input_speech, generate_kwargs={"language": "myanmar", "task": "transcribe"})
+print(output['text']) # ကျမ ပြည်ပ မှာ ပညာသင် တော့ စာမေးပွဲ ကို တပတ်တခါ စစ်တယ်
+```
 ### Training hyperparameters
 - Transformers 4.35.2
 - Pytorch 2.1.1+cu121
 - Datasets 2.14.5
+- Tokenizers 0.15.1