librarian-bot's picture
Librarian Bot: Add base_model information to model
31f49a3
|
raw
history blame
1.16 kB
---
language:
- la
license: agpl-3.0
tags:
- robust-speech-event
- hf-asr-leaderboard
datasets:
- lsb/poetaexmachina-mp3-recitations
metrics:
- wer
base_model: wav2vec2-base-it-voxpopuli
model-index:
- name: wav2vec2-base-it-latin
results:
- task:
type: automatic-speech-recognition
name: Speech Recognition
dataset:
name: Poeta Ex Machina mp3 recitations
type: lsb/poetaexmachina-mp3-recitations
metrics:
- type: wer
value: 0.398
name: Test WER
---
---
# wav2vec2-base-it-latin
This model is a fine-tuned version of [wav2vec2-base-it-voxpopuli](https://huggingface.co/facebook/wav2vec2-base-it-voxpopuli)
The dataset used is the [poetaexmachina-mp3-recitations](https://github.com/lsb/poetaexmachina-mp3-recitations),
all of the 2-series texts (vergil) and every tenth 1-series text (words from Poeta Ex Machina's [database](https://github.com/lsb/poetaexmachina/blob/master/merged-scansions.db) of words with scansions).
It achieves the following [results](https://github.com/lsb/tironiculum/blame/trunk/wav2vec2%20base%20it%20latin.ipynb#L1234) on the evaluation set:
- Loss: 0.1943
- WER: 0.398