MediaTek-Research
/

Breeze-7B-Base-v0_1

@@ -25,51 +25,23 @@ and is comparable with Mistral-7B-v0.1 on MMLU and MT-Bench in English.
 - **Model type:** Causal decoder-only transformer language model
 - **Language:** English and Traditional Chinese (zh-tw)
-## Performance
-| **[Traditional Chinese Benchmarks]**                | TMMLU+ 5-shot (ACC) | DRCD 5-shot (EM) | MT-Bench-tw (Score) |
-|-----------------------------------------------------|---------------------|------------------|---------------------|
-| MediaTek-Research/Breeze-7B-Base-v0.1               |                     |                  |     -               |
-| MediaTek-Research/Breeze-7B-Instruct-v0.1           |                     |                  |                     |
-| mistralai/Mistral-7B-v0.1                           |                     |                  |     -               |
-| mistralai/Mistral-7B-Instruct-v0.1                  |                     |                  |                     |
-| yentinglin/Taiwan-LLM-7B-v2.1-base                  |                     |                  |     -               |
-| yentinglin/Taiwan-LLM-7B-v2.1-chat                  |                     |                  |                     |
-| yentinglin/Taiwan-LLM-13B-v2.0-base                 |                     |                  |     -               |
-| yentinglin/Taiwan-LLM-13B-v2.0-chat                 |                     |                  |                     |
-| 01-ai/Yi-6B-Base                                    |                     |                  |     -               |
-| 01-ai/Yi-6B-Chat                                    |                     |                  |                     |
-| 01-ai/Yi-34B-Base                                   |                     |                  |     -               |
-| 01-ai/Yi-34B-Chat                                   |                     |                  |                     |
-| Qwen/Qwen-7B                                        |                     |                  |     -               |
-| Qwen/Qwen-7B-Chat                                   |                     |                  |                     |
-| Qwen/Qwen-14B                                       |                     |                  |     -               |
-| Qwen/Qwen-14B-Chat                                  |                     |                  |                     |
-| gpt-3.5-turbo-0613                                  |                     |                  |                     |
-| **[English Benchmarks]**                            | MMLU 5-shot (ACC)   | MT-Bench (Score) |
-|-----------------------------------------------------|---------------------|------------------|
-| MediaTek-Research/Breeze-7B-Base-v0.1               |                     |     -            |
-| MediaTek-Research/Breeze-7B-Instruct-v0.1           |                     |                  |
-| mistralai/Mistral-7B-v0.1                           |                     |     -            |
-| mistralai/Mistral-7B-Instruct-v0.1                  |                     |                  |
-| yentinglin/Taiwan-LLM-7B-v2.1-base                  |                     |     -            |
-| yentinglin/Taiwan-LLM-7B-v2.1-chat                  |                     |                  |
-| yentinglin/Taiwan-LLM-13B-v2.0-base                 |                     |     -            |
-| yentinglin/Taiwan-LLM-13B-v2.0-chat                 |                     |     -            |
-| 01-ai/Yi-6B-Base                                    |                     |     -            |
-| 01-ai/Yi-6B-Chat                                    |                     |                  |
-| 01-ai/Yi-34B-Base                                   |                     |     -            |
-| 01-ai/Yi-34B-Chat                                   |                     |                  |
-| Qwen/Qwen-7B                                        |                     |     -            |
-| Qwen/Qwen-7B-Chat                                   |                     |                  |
-| Qwen/Qwen-14B                                       |                     |     -            |
-| Qwen/Qwen-14B-Chat                                  |                     |                  |
-| gpt-3.5-turbo-0613                                  |                     |                  |
-| **[Inference Metrics on Traditional Chinese]**                     | Speed (char/sec)  | Compression Ratio | Max Character Size |
 |--------------------------------------------------------------------|-------------------|-------------------|--------------------|
 | MediaTek-Research/Breeze-7B-Base-v0.1                              |                   |                   |                    |                    |
 | mistralai/Mistral-7B-v0.1                                          |                   |                   |                    |
@@ -80,6 +52,22 @@ and is comparable with Mistral-7B-v0.1 on MMLU and MT-Bench in English.
 | Qwen/Qwen-7B                                                       |                   |                   |                    |
 | Qwen/Qwen-14B                                                      |                   |                   |                    |
 ## Use in Transformers

 - **Model type:** Causal decoder-only transformer language model
 - **Language:** English and Traditional Chinese (zh-tw)
+##  Base Model Performance
+| Models                                              | TMMLU+ (ACC) | DRCD (EM) | MMLU (ACC) |
+|-----------------------------------------------------|--------------|-----------|------------|
+|                                                     | 5 shot       | 3 shot    | 5 shot     |
+| MediaTek-Research/Breeze-7B-Base-v0.1               |              |           |            |
+| mistralai/Mistral-7B-v0.1                           |              |           |            |
+| yentinglin/Taiwan-LLM-7B-v2.1-base                  |              |           |            |
+| yentinglin/Taiwan-LLM-13B-v2.0-base                 |              |           |            |
+| 01-ai/Yi-6B                                         |              |           |            |
+| 01-ai/Yi-34B                                        |              |           |            |
+| Qwen/Qwen-7B                                        |              |           |            |
+| Qwen/Qwen-14B                                       |              |           |            |
+## Inference Performance
+| Models                                                             | Speed (char/sec)  | Compression Ratio | Max Character Size |
 |--------------------------------------------------------------------|-------------------|-------------------|--------------------|
 | MediaTek-Research/Breeze-7B-Base-v0.1                              |                   |                   |                    |                    |
 | mistralai/Mistral-7B-v0.1                                          |                   |                   |                    |
 | Qwen/Qwen-7B                                                       |                   |                   |                    |
 | Qwen/Qwen-14B                                                      |                   |                   |                    |
+##  Chat Model Performance
+| Models                                              | TMMLU+ (ACC) | DRCD (EM) | MT-Bench-tw (Score) | MMLU (ACC) | MT-Bench (Score) |
+|-----------------------------------------------------|--------------|-----------|---------------------|------------|------------------|
+|                                                     | 5 shot       | 3 shot    | 0 shot              | 5 shot     | 0 shot           |
+| MediaTek-Research/Breeze-7B-Instruct-v0.1           |              |           |                     |            |                  |
+| mistralai/Mistral-7B-Instruct-v0.1                  |              |           |                     |            |                  |
+| yentinglin/Taiwan-LLM-7B-v2.1-chat                  |              |           |                     |            |                  |
+| yentinglin/Taiwan-LLM-13B-v2.0-chat                 |              |           |                     |            |                  |
+| 01-ai/Yi-6B-Chat                                    |              |           |                     |            |                  |
+| 01-ai/Yi-34B-Chat                                   |              |           |                     |            |                  |
+| Qwen/Qwen-7B-Chat                                   |              |           |                     |            |                  |
+| Qwen/Qwen-14B-Chat                                  |              |           |                     |            |                  |
+| gpt-3.5-turbo-0613                                  |              | 76.30     |                     |            |                  |
 ## Use in Transformers