emotion2vec
/

emotion2vec_plus_large

Model card Files Files and versions

BoJack commited on Jun 24, 2024

Commit

6b7a78b

·

verified ·

1 Parent(s): efd2932

Update README.md

Files changed (1) hide show

README.md +10 -8

README.md CHANGED Viewed

@@ -56,11 +56,15 @@ emotion2vec+ is a series of foundational models for speech emotion recognition (
 ![](emotion2vec+radar.png)
 This version (emotion2vec_plus_large) uses a large-scale pseudo-labeled data for finetuning to obtain a large size model (~300M),  and currently supports the following categories:
-    0: angry
-    1: happy
-    2: neutral
-    3: sad
-    4: unknown
 # Model Card
 GitHub Repo: [emotion2vec](https://github.com/ddlBoJack/emotion2vec)
@@ -76,14 +80,12 @@ emotion2vec+ large|[Link](https://modelscope.cn/models/iic/emotion2vec_plus_larg
 We offer 3 versions of emotion2vec+, each derived from the data of its predecessor. If you need a model focusing on spech emotion representation, refer to [emotion2vec: universal speech emotion representation model](https://huggingface.co/emotion2vec/emotion2vec).
-- emotion2vec+ seed: Fine-tuned with academic speech emotion data
 - emotion2vec+ base: Fine-tuned with filtered large-scale pseudo-labeled data to obtain the base size model (~90M)
 - emotion2vec+ large: Fine-tuned with filtered large-scale pseudo-labeled data to obtain the large size model (~300M)
 The iteration process is illustrated below, culminating in the training of the emotion2vec+ large model with 40k out of 160k hours of speech emotion data. Details of data engineering will be announced later.
-![](emotion2vec+data.png)
 # Installation
 `pip install -U funasr modelscope`

 ![](emotion2vec+radar.png)
 This version (emotion2vec_plus_large) uses a large-scale pseudo-labeled data for finetuning to obtain a large size model (~300M),  and currently supports the following categories:
+0: angry
+1: disgusted
+2: fearful
+3: happy
+4: neutral
+5: other
+6: sad
+7: surprised
+8: unknown
 # Model Card
 GitHub Repo: [emotion2vec](https://github.com/ddlBoJack/emotion2vec)
 We offer 3 versions of emotion2vec+, each derived from the data of its predecessor. If you need a model focusing on spech emotion representation, refer to [emotion2vec: universal speech emotion representation model](https://huggingface.co/emotion2vec/emotion2vec).
+- emotion2vec+ seed: Fine-tuned with academic speech emotion data from [EmoBox](https://github.com/emo-box/EmoBox)
 - emotion2vec+ base: Fine-tuned with filtered large-scale pseudo-labeled data to obtain the base size model (~90M)
 - emotion2vec+ large: Fine-tuned with filtered large-scale pseudo-labeled data to obtain the large size model (~300M)
 The iteration process is illustrated below, culminating in the training of the emotion2vec+ large model with 40k out of 160k hours of speech emotion data. Details of data engineering will be announced later.
 # Installation
 `pip install -U funasr modelscope`