Speech Emotion Recognition model created by fine-tuning the Wav2Vec2 [model](https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english) pre-trained on xlsr for English. The dataset used to fine-tune this model is the RAVDESS dataset that can be found [here](https://zenodo.org/record/1188976#.YO6jYOgzaUk).