YC-Chen commited on
Commit
5a8b07b
1 Parent(s): 6709dd2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -43,7 +43,7 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
43
 
44
  \* Few-shot learning cannot effectively guide the model to generate the proper answer.
45
 
46
- | Category ACC of TMMLU+ Benchmark | STEM | Social Science | Humanities | Other |
47
  |-----------------------------------------------------|--------------|----------------|------------|------------|
48
  | 01-ai/Yi-6B | 41.14 | 57.77 | 50.22 | 49.39 |
49
  | MediaTek-Research/Breeze-7B-Base-v0.1 | 35.74 | 46.08 | 40.29 | 39.27 |
@@ -85,7 +85,7 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
85
 
86
  \* Taiwan-LLM models responds to multi-turn questions (English) in Traditional Chinese.
87
 
88
- | Category ACC of TMMLU+ Benchmark | STEM | Social Science | Humanities | Other |
89
  |-----------------------------------------------------|--------------|----------------|------------|------------|
90
  | 01-ai/Yi-6B-Chat | 26.28 | 33.48 | 29.48 | 27.62 |
91
  | MediaTek-Research/Breeze-7B-Instruct-v0.1 | 37.45 | 48.35 | 40.26 | 40.44 |
 
43
 
44
  \* Few-shot learning cannot effectively guide the model to generate the proper answer.
45
 
46
+ | Category ACC of TMMLU+ | STEM | Social Science | Humanities | Other |
47
  |-----------------------------------------------------|--------------|----------------|------------|------------|
48
  | 01-ai/Yi-6B | 41.14 | 57.77 | 50.22 | 49.39 |
49
  | MediaTek-Research/Breeze-7B-Base-v0.1 | 35.74 | 46.08 | 40.29 | 39.27 |
 
85
 
86
  \* Taiwan-LLM models responds to multi-turn questions (English) in Traditional Chinese.
87
 
88
+ | Category ACC of TMMLU+ | STEM | Social Science | Humanities | Other |
89
  |-----------------------------------------------------|--------------|----------------|------------|------------|
90
  | 01-ai/Yi-6B-Chat | 26.28 | 33.48 | 29.48 | 27.62 |
91
  | MediaTek-Research/Breeze-7B-Instruct-v0.1 | 37.45 | 48.35 | 40.26 | 40.44 |