Update README.md
Browse files
README.md
CHANGED
@@ -43,7 +43,7 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
|
|
43 |
|
44 |
\* Few-shot learning cannot effectively guide the model to generate the proper answer.
|
45 |
|
46 |
-
| Category ACC of TMMLU+
|
47 |
|-----------------------------------------------------|--------------|----------------|------------|------------|
|
48 |
| 01-ai/Yi-6B | 41.14 | 57.77 | 50.22 | 49.39 |
|
49 |
| MediaTek-Research/Breeze-7B-Base-v0.1 | 35.74 | 46.08 | 40.29 | 39.27 |
|
@@ -85,7 +85,7 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
|
|
85 |
|
86 |
\* Taiwan-LLM models responds to multi-turn questions (English) in Traditional Chinese.
|
87 |
|
88 |
-
| Category ACC of TMMLU+
|
89 |
|-----------------------------------------------------|--------------|----------------|------------|------------|
|
90 |
| 01-ai/Yi-6B-Chat | 26.28 | 33.48 | 29.48 | 27.62 |
|
91 |
| MediaTek-Research/Breeze-7B-Instruct-v0.1 | 37.45 | 48.35 | 40.26 | 40.44 |
|
|
|
43 |
|
44 |
\* Few-shot learning cannot effectively guide the model to generate the proper answer.
|
45 |
|
46 |
+
| Category ACC of TMMLU+ | STEM | Social Science | Humanities | Other |
|
47 |
|-----------------------------------------------------|--------------|----------------|------------|------------|
|
48 |
| 01-ai/Yi-6B | 41.14 | 57.77 | 50.22 | 49.39 |
|
49 |
| MediaTek-Research/Breeze-7B-Base-v0.1 | 35.74 | 46.08 | 40.29 | 39.27 |
|
|
|
85 |
|
86 |
\* Taiwan-LLM models responds to multi-turn questions (English) in Traditional Chinese.
|
87 |
|
88 |
+
| Category ACC of TMMLU+ | STEM | Social Science | Humanities | Other |
|
89 |
|-----------------------------------------------------|--------------|----------------|------------|------------|
|
90 |
| 01-ai/Yi-6B-Chat | 26.28 | 33.48 | 29.48 | 27.62 |
|
91 |
| MediaTek-Research/Breeze-7B-Instruct-v0.1 | 37.45 | 48.35 | 40.26 | 40.44 |
|