psst-fairseq-rir / README.md
birgermoell's picture
Upload README.md
1d3618c
|
raw
history blame
489 Bytes
metadata
language:
  - en
license: apache-2.0
tags:
  - automatic-speech-recognition

This model is trained on the PSST Challenge data, with a subset of TIMIT that was augmented using Room Impulse Response (RIR). A file containing the list of TIMIT IDs is in the repository (timit-ids.txt)

The model was finetuned on Wav2vec 2.0 Base, No finetuning, and the results on the validation set were PER: 21.8%, FER: 9.6%.