Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,40 @@
|
|
1 |
-
---
|
2 |
-
license: cc-by-nc-sa-4.0
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: cc-by-nc-sa-4.0
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
pipeline_tag: audio-classification
|
6 |
+
tags:
|
7 |
+
- wavlm
|
8 |
+
- wav2vec2
|
9 |
+
- msp-podcast
|
10 |
+
- emotion-recognition
|
11 |
+
- audio
|
12 |
+
- speech
|
13 |
+
- valence
|
14 |
+
- arousal
|
15 |
+
- dominance
|
16 |
+
- speech-emotion-recognition
|
17 |
+
- dkounadis
|
18 |
+
---
|
19 |
+
Tecaher model based on [wavlm](https://huggingface.co/3loi/SER-Odyssey-Baseline-WavLM-Multi-Attributes) and [wav2vec2](https://hf.rst.im/audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim) for arousal, dominance, valence prediction in range [0,1], used in dimensional Speech Emotion Recognition.
|
20 |
+
Acieves xx CCC on [MSP-Podcast](https://ecs.utdallas.edu/research/researchlabs/msp-lab/MSP-Podcast.html) test split.
|
21 |
+
|
22 |
+
|
23 |
+
|
24 |
+
# Benchmarks
|
25 |
+
CCC based on Test1 and Development sets of the Odyssey Competition
|
26 |
+
<table style="width:500px">
|
27 |
+
<tr><th colspan=6 align="center" >Multi-Task Setup</th></tr>
|
28 |
+
<tr><th colspan=3 align="center">Test 3</th><th colspan=3 align="center">Development</th></tr>
|
29 |
+
<tr> <td>Val</td> <td>Dom</td> <td>Aro</td> <td>Val</td> <td>Dom</td> <td>Aro</td> </tr>
|
30 |
+
<tr> <td> 0</td> <td>0</td> <td>0</td> <td>0</td> <td>0</td> <td>0</td> </tr>
|
31 |
+
</table>
|
32 |
+
|
33 |
+
|
34 |
+
|
35 |
+
# Usage
|
36 |
+
```python
|
37 |
+
from transformers import AutoModelForAudioClassification
|
38 |
+
import librosa, torch
|
39 |
+
|
40 |
+
```
|