File size: 490 Bytes
d8a9745 |
1 2 3 4 5 6 7 8 9 10 11 12 |
---
language:
- en
license: apache-2.0
tags:
- automatic-speech-recognition
---
This model is trained on the PSST Challenge data, with a subset of TIMIT that was augmented using Room Impulse Response (RIR). A file containing the list of TIMIT IDs is in the repository (`timit-ids.txt`)
The model was finetuned on [Wav2vec 2.0 Large, No finetuning](https://github.com/pytorch/fairseq/tree/main/examples/wav2vec), and the results on the validation set were **PER:** 21\.0%, **FER:** 9\.2%.
|