patrickvonplaten's picture
Update README.md
82e9367
|
raw
history blame
501 Bytes
metadata
language: sv
datasets:
  - common_voice
tags:
  - audio
  - automatic-speech-recognition
  - speech
  - xlsr-fine-tuning-week
license: apache-2.0

wav2vec2-swedish-common-voice

This model is fine-tuned from Facebook's wav2vec2-large-xlsr-53 using Mozilla's common voice dataset. Wav2vec2-swedish trained on the common voice dataset.

Model metric

WER 0.511916 on a dataset of 402mb.

Model Fine-Tuning-Colab

https://colab.research.google.com/drive/1KkD4PeZwnIwxxxOP1bUE7XTZMK7-SzRj?usp=sharing