juierror
/

whisper-base-thai

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

juierror commited on May 25, 2023

Commit

6856152

•

1 Parent(s): 81e899c

Update README.md

Files changed (1) hide show

README.md +46 -0

README.md CHANGED Viewed

@@ -1,3 +1,49 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
+language:
+- th
+pipeline_tag: automatic-speech-recognition
 ---
+# Whisper-base Thai finetuned
+## 1) Environment Setup
+```bash
+# visit https://pytorch.org/get-started/locally/ to install pytorch
+pip3 install transformers librosa
+```
+## 2) Usage
+```python
+from transformers import WhisperForConditionalGeneration, WhisperProcessor
+import librosa
+device = "cuda" # cpu, cuda
+model = WhisperForConditionalGeneration.from_pretrained("juierror/whisper-base-thai").to(device)
+processor = WhisperProcessor.from_pretrained("juierror/whisper-base-thai", language="Thai", task="transcribe")
+path = "/path/to/audio/file"
+def inference(path: str) -> str:
+    """
+    Get the transcription from audio path
+    Args:
+        path(str): path to audio file (can be load with librosa)
+    Returns:
+        str: transcription
+    """
+    audio, sr = librosa.load(path, sr=16000)
+    input_features = processor(audio, sampling_rate=16000, return_tensors="pt").input_features
+    generated_tokens = model.generate(
+        input_features=input_features.to(device),
+        max_new_tokens=255,
+        language="Thai"
+    ).cpu()
+    transcriptions = processor.tokenizer.batch_decode(generated_tokens, skip_special_tokens=True)
+    return transcriptions[0]
+print(inference(path=path))
+```