Update README.md
Browse files
README.md
CHANGED
@@ -3,10 +3,13 @@ library_name: transformers
|
|
3 |
tags: []
|
4 |
---
|
5 |
|
6 |
-
# 5CD-AI/
|
7 |
## Overview
|
8 |
<!-- Provide a quick summary of what the model is/does. -->
|
9 |
-
We
|
|
|
|
|
|
|
10 |
|
11 |
Here are the results on 4 downstream tasks on Vietnamese social media texts, including Emotion Recognition(UIT-VSMEC), Hate Speech Detection(UIT-HSD), Spam Reviews Detection(ViSpamReviews), Hate Speech Spans Detection(ViHOS):
|
12 |
<table>
|
|
|
3 |
tags: []
|
4 |
---
|
5 |
|
6 |
+
# 5CD-AI/visobert-14gb-corpus-pretrained
|
7 |
## Overview
|
8 |
<!-- Provide a quick summary of what the model is/does. -->
|
9 |
+
We continually pretrain `uitnlp/visobert` on a merged 14GB dataset for 5 epochs, the training dataset includes:
|
10 |
+
- Internal data (100M comments and 15M posts on Facebook)
|
11 |
+
- UIT data, which is used to pretrain `uitnlp/visobert`
|
12 |
+
- MC4 ecommerce
|
13 |
|
14 |
Here are the results on 4 downstream tasks on Vietnamese social media texts, including Emotion Recognition(UIT-VSMEC), Hate Speech Detection(UIT-HSD), Spam Reviews Detection(ViSpamReviews), Hate Speech Spans Detection(ViHOS):
|
15 |
<table>
|