Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Khmer Automatic Speech Recognition

Installation

Install from PyPI

pip install sdab

Install from source


# clone repo 
git clone https://github.com/MetythornPenn/sdab.git

# install lib from source
pip install -e .

Usage

Download sample audio

wget -O audio.wav https://github.com/MetythornPenn/sdab/blob/main/sample/audio.wav

Python API

from sdab import Sdab

file_path = "audio.wav"
model_name = "metythorn/khmer-asr-openslr"  # or local directory path

sdab = Sdab( file_path = file_path, model_name = model_name)
print(sdab.result)

# result : αžŸαŸ’αž–αžΆαž“αž€αŸ†αž–αž„αž…αŸ†αž›αž„αž’αŸ’αž“αž€αž›αžΎαž„αž“αŸ…αž–αŸ’αžšαžΈαžœαŸ‚αž‰αž‡αžΆαžŸαŸ’αž–αžΆαž“αžœαŸαž‰αž‡αžΆαž„αž‚αŸαžŸαž€αŸ’αž“αž»αž„αž–αŸ’αžšαžŸαžšαžΆαž‡αžΆαž’αžΆαž…αž€αž˜αŸ’αž–αž»αž‡αžΆ
  • file_path: path of audio file
  • model_name : pretrain model path from huggingface or local
  • device : should be cpu or cuda but I use cpu by default
  • tokenized: show [PAD] in output, False by default
  • return: Khmer text from ASR

Reference


license: apache-2.0 datasets: - openslr/openslr language: - km tags: - asr - khmer asr - khmer speech to text - speech to text

Downloads last month
35
Inference API
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.