mbarnig's picture
Update README.md
41147d1
|
raw
history blame
1.31 kB
metadata
license: cc-by-nc-sa-4.0
language:
  - lb
  - de
  - fr
  - en
  - pt
tags:
  - STT
  - ASR
  - audio
  - speech recognition
  - coqui.ai
datasets:
  - mbarnig/lb-STT-CORPUS

The luxembourgish part of my multilingual automatic speech recognition (ASR) model is the second Machine Learning (ML) model for Luxembourgish. The very first model has been published in May 2022 by Pr Peter Gilles of the University of Luxembourg.

My model has been trained from scratch with my customized dataset mbarnig/lb-STT_CORPUS and the deep-learning-toolkit 🐸 Coqui-STT (version 1.3.0). The model was trained without punctuations with the following alphabet:

characters="abcdefghijklmnopqrstuvwxyz ßàáâãäçèéêëíîïóôõöùúûü",
punctuations="!'(),-.:;? ",
phonemes=None,

A live inference-demo of the ASR system is available in my HuggingFace space ⌨️ 🇱🇺 🔈 mbarnig/lb-de-fr-en-pt-COQUI-STT.

Click the tab training metrics above to view the live Tensorboard of the model training.

tensorboard