khmer-asr-openslr / README.md
metythorn's picture
Update README.md
219a0db verified

Khmer Automatic Speech Recognition

Installation

Install from PyPI

pip install sdab

Install from source


# clone repo 
git clone https://github.com/MetythornPenn/sdab.git

# install lib from source
pip install -e .

Usage

Download sample audio

wget -O audio.wav https://github.com/MetythornPenn/sdab/blob/main/sample/audio.wav

Python API

from sdab import Sdab

file_path = "audio.wav"
model_name = "metythorn/khmer-asr-openslr"  # or local directory path

sdab = Sdab( file_path = file_path, model_name = model_name)
print(sdab.result)

# result : αžŸαŸ’αž–αžΆαž“αž€αŸ†αž–αž„αž…αŸ†αž›αž„αž’αŸ’αž“αž€αž›αžΎαž„αž“αŸ…αž–αŸ’αžšαžΈαžœαŸ‚αž‰αž‡αžΆαžŸαŸ’αž–αžΆαž“αžœαŸαž‰αž‡αžΆαž„αž‚αŸαžŸαž€αŸ’αž“αž»αž„αž–αŸ’αžšαžŸαžšαžΆαž‡αžΆαž’αžΆαž…αž€αž˜αŸ’αž–αž»αž‡αžΆ
  • file_path: path of audio file
  • model_name : pretrain model path from huggingface or local
  • device : should be cpu or cuda but I use cpu by default
  • tokenized: show [PAD] in output, False by default
  • return: Khmer text from ASR

Reference


license: apache-2.0 datasets: - openslr/openslr language: - km tags: - asr - khmer asr - khmer speech to text - speech to text