Alvenir
/

wav2vec2-base-da-ft-nst

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

arpelarpe commited on Mar 15, 2022

Commit

5e84820

·

1 Parent(s): 9eca7e3

Update README.md

Files changed (1) hide show

README.md +8 -10

README.md CHANGED Viewed

@@ -3,18 +3,18 @@ license: apache-2.0
 ---
 # wav2vec2-base-da-ft-nst
-This is a wav2vec2 model for Danish ASR finetuned by Alvenir on the public NST dataset. The model is trained on 16kHz, so make sure your data is the same sample rate.
 The model was trained using fairseq and then converted to huggingface/transformers format.
-Alvenir is always happy to help with your own open-source ASR projects or with customized domain specializations and high performance premium models. ;-)
 ## Usage
 ```Python
 import soundfile as sf
 import torch
-from transformers import Wav2Vec2CTCTokenizer, Wav2Vec2Tokenizer, Wav2Vec2FeatureExtractor, Wav2Vec2Processor, \
     Wav2Vec2ForCTC
@@ -22,10 +22,6 @@ def get_tokenizer(model_path: str) -> Wav2Vec2CTCTokenizer:
     return Wav2Vec2Tokenizer.from_pretrained(model_path)
-def get_feature_extractor(model_path: str) -> Wav2Vec2FeatureExtractor:
-    return Wav2Vec2FeatureExtractor.from_pretrained(model_path)
 def get_processor(model_path: str) -> Wav2Vec2Processor:
     return Wav2Vec2Processor.from_pretrained(model_path)
@@ -55,8 +51,10 @@ print(transcription)
 ```
 ## Benchmark results
-| Dataset             | WER Greddy | WER with Language Model |
 |---------------------|------------|--------------------|
 | NST test            | 15,8%      | 11.9%              |
-| alvenir-asr-da-eval | 18.2%      | 12.1%              |
-| Common-voice-da     | ??         | ??                 |

 ---
 # wav2vec2-base-da-ft-nst
+This the [alvenir wav2vec2 model](https://huggingface.co/Alvenir/wav2vec2-base-da) for Danish ASR finetuned by Alvenir on the public NST dataset. The model is trained on 16kHz, so make sure your data is the same sample rate.
 The model was trained using fairseq and then converted to huggingface/transformers format.
+Alvenir is always happy to help with your own open-source ASR projects, customized domain specializations or premium models. ;-)
 ## Usage
 ```Python
 import soundfile as sf
 import torch
+from transformers import Wav2Vec2CTCTokenizer, Wav2Vec2Tokenizer, Wav2Vec2Processor, \
     Wav2Vec2ForCTC
     return Wav2Vec2Tokenizer.from_pretrained(model_path)
 def get_processor(model_path: str) -> Wav2Vec2Processor:
     return Wav2Vec2Processor.from_pretrained(model_path)
 ```
 ## Benchmark results
+This is some benchmark results on the public available datasets in Danish.
+| Dataset             | WER Greedy | WER with Language Model |
 |---------------------|------------|--------------------|
 | NST test            | 15,8%      | 11.9%              |
+| alvenir-asr-da-eval | 19.0%      | 12.1%              |
+| common_voice_80 da test | 26,3% | ??                 |