uer
/

roberta-base-word-chinese-cluecorpussmall

@@ -30,18 +30,18 @@ You can download the 5 Chinese RoBERTa miniatures either from the [UER-py Modelz
 Compared with [char-based models](https://huggingface.co/uer/chinese_roberta_L-2_H-128), word-based models achieve better results in most cases. Here are scores on the devlopment set of six Chinese tasks:
-| Model          | Score | douban | chnsenticorp | lcqmc | tnews(CLUE) | iflytek(CLUE) | ocnli(CLUE) |
 | -------------- | :---: | :----: | :----------: | :---: | :---------: | :-----------: | :---------: |
-| RoBERTa-Tiny(char)       | 72.3            |  83.0      |     91.4         | 81.8      |    62.0         |     55.0          |    60.3         |
-| **RoBERTa-Tiny(word)**   | **74.3(+2.0)**  |  **86.4**  |     **93.2**     | **82.0**  |    **66.4**     |     **58.2**      |    **59.6**     |
-| RoBERTa-Mini(char)       | 75.7            |  84.8      |     93.7         | 86.1      |    63.9         |     58.3          |    67.4         |
-| **RoBERTa-Mini(word)**   | **76.7(+1.0)**  |  **87.6**  |     **94.1**     | **85.4**  |    **66.9**     |     **59.2**      |    **67.3**     |
-| RoBERTa-Small(char)      | 76.8            |  86.5      |     93.4         | 86.5      |    65.1         |     59.4          |    69.7         |
-| **RoBERTa-Small(word)**  | **78.1(+1.3)**  |  **88.5**  |     **94.7**     | **87.4**  |    **67.6**     |     **60.9**      |    **69.8**     |
-| RoBERTa-Medium(char)     | 77.8            |  87.6      |     94.8         | 88.1      |    65.6         |     59.5          |    71.2         |
-| **RoBERTa-Medium(word)** | **78.9(+1.1)**  |  **89.2**  |     **95.1**     | **88.0**  |    **67.8**     |     **60.6**      |    **73.0**     |
-| RoBERTa-Base(char)       | 79.5            |  89.1      |     95.2         | 89.2      |    67.0         |     60.9          |    75.5         |
-| **RoBERTa-Base(word)**   | **80.2(+0.7)**  |  **90.3**  |     **95.7**     | **89.4**  |    **68.0**     |     **61.5**      |    **76.8**     |
 For each task, we selected the best fine-tuning hyperparameters from the lists below, and trained with the sequence length of 128:

 Compared with [char-based models](https://huggingface.co/uer/chinese_roberta_L-2_H-128), word-based models achieve better results in most cases. Here are scores on the devlopment set of six Chinese tasks:
+| Model          | Score | book_review | chnsenticorp | lcqmc | tnews(CLUE) | iflytek(CLUE) | ocnli(CLUE) |
 | -------------- | :---: | :----: | :----------: | :---: | :---------: | :-----------: | :---------: |
+| RoBERTa-Tiny(char)       | 72.3            |  83.4      |     91.4         | 81.8      |    62.0         |     55.0          |    60.3         |
+| **RoBERTa-Tiny(word)**   | **74.4(+2.1)**  |  **86.7**  |     **93.2**     | **82.0**  |    **66.4**     |     **58.2**      |    **59.6**     |
+| RoBERTa-Mini(char)       | 75.9            |  85.7      |     93.7         | 86.1      |    63.9         |     58.3          |    67.4         |
+| **RoBERTa-Mini(word)**   | **76.9(+1.0)**  |  **88.5**  |     **94.1**     | **85.4**  |    **66.9**     |     **59.2**      |    **67.3**     |
+| RoBERTa-Small(char)      | 76.9            |  87.5      |     93.4         | 86.5      |    65.1         |     59.4          |    69.7         |
+| **RoBERTa-Small(word)**  | **78.4(+1.5)**  |  **89.7**  |     **94.7**     | **87.4**  |    **67.6**     |     **60.9**      |    **69.8**     |
+| RoBERTa-Medium(char)     | 78.0            |  88.7      |     94.8         | 88.1      |    65.6         |     59.5          |    71.2         |
+| **RoBERTa-Medium(word)** | **79.1(+1.1)**  |  **90.0**  |     **95.1**     | **88.0**  |    **67.8**     |     **60.6**      |    **73.0**     |
+| RoBERTa-Base(char)       | 79.7            |  90.1      |     95.2         | 89.2      |    67.0         |     60.9          |    75.5         |
+| **RoBERTa-Base(word)**   | **80.4(+0.7)**  |  **91.1**  |     **95.7**     | **89.4**  |    **68.0**     |     **61.5**      |    **76.8**     |
 For each task, we selected the best fine-tuning hyperparameters from the lists below, and trained with the sequence length of 128: