--- language: - la license: agpl-3.0 tags: - robust-speech-event - hf-asr-leaderboard datasets: - lsb/poetaexmachina-mp3-recitations metrics: - wer model-index: - name: wav2vec2-base-it-latin results: - task: type: automatic-speech-recognition name: Speech Recognition dataset: type: lsb/poetaexmachina-mp3-recitations name: Poeta Ex Machina mp3 recitations metrics: - type: wer value: 0.398 name: Test WER --- --- # wav2vec2-base-it-latin This model is a fine-tuned version of [wav2vec2-base-it-voxpopuli](https://huggingface.co/facebook/wav2vec2-base-it-voxpopuli) The dataset used is the [poetaexmachina-mp3-recitations](https://github.com/lsb/poetaexmachina-mp3-recitations), all of the 2-series texts (vergil) and every tenth 1-series text (words from Poeta Ex Machina's [database](https://github.com/lsb/poetaexmachina/blob/master/merged-scansions.db) of words with scansions). It achieves the following [results](https://github.com/lsb/tironiculum/blame/trunk/wav2vec2%20base%20it%20latin.ipynb#L1234) on the evaluation set: - Loss: 0.1943 - WER: 0.398