nvidia
/

stt_uk_citrinet_1024_gamma_0_25

Automatic Speech Recognition

hf-asr-leaderboard

Model card Files Files and versions Community

dpykhtar commited on Jul 29, 2022

Commit

9bc9708

•

1 Parent(s): 2927ca9

Update README.md

Files changed (1) hide show

README.md +0 -10

README.md CHANGED Viewed

@@ -90,16 +90,6 @@ The tokenizer for this models was built using the text transcripts of the train
 Model is trained on Mozilla Common Voice Corpus 10.0 dataset comprising of 69 hours of Ukrainian speech.
-## Performance
-The list of the available models in this collection is shown in the following table. Performances of the ASR models are reported in terms of Word Error Rate (WER%) with greedy decoding.
-| Version | Tokenizer | Vocabulary Size | MCV-8 test | MCV-8 dev | MCV-9 test | MCV-9 dev | MCV-10 test | MCV-10 dev | Train Dataset |
-|---------|-----------------------|-----------------|---------------|---------------|------------|-----------|-----|---------|
-| 1.0.0 | SentencePiece Unigram | 1024 | null | null | null | null | null | MCV-10 validated |
-While deploying with [NVIDIA Riva](https://developer.nvidia.com/riva), you can combine this model with external language models to further improve WER. The WER(%) of the latest model with different language modeling techniques are reported in the following table.
 ## Limitations
 Since this model was trained on publicly available speech datasets, the performance of this model might degrade for speech that includes technical terms, or vernacular that the model has not been trained on. The model might also perform worse for accented speech.

 Model is trained on Mozilla Common Voice Corpus 10.0 dataset comprising of 69 hours of Ukrainian speech.
 ## Limitations
 Since this model was trained on publicly available speech datasets, the performance of this model might degrade for speech that includes technical terms, or vernacular that the model has not been trained on. The model might also perform worse for accented speech.