Khmer Automatic Speech Recognition
Installation
Install from PyPI
pip install sdab
Install from source
git clone https://github.com/MetythornPenn/sdab.git
pip install -e .
Usage
Download sample audio
wget -O audio.wav https://github.com/MetythornPenn/sdab/blob/main/sample/audio.wav
Python API
from sdab import Sdab
file_path = "audio.wav"
model_name = "metythorn/khmer-asr-openslr"
sdab = Sdab( file_path = file_path, model_name = model_name)
print(sdab.result)
file_path
: path of audio file
model_name
: pretrain model path from huggingface
or local
device
: should be cpu
or cuda
but I use cpu
by default
tokenized
: show [PAD]
in output, False
by default
return
: Khmer text from ASR
Reference
license: apache-2.0
datasets:
- openslr/openslr
language:
- km
tags:
- asr
- khmer asr
- khmer speech to text
- speech to text