MediaTek-Research
/

Breeze-7B-Instruct-v0_1

@@ -26,28 +26,50 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
 - **Model type:** Causal decoder-only transformer language model
 - **Language:** English and Traditional Chinese (zh-tw)
-## Performance
-| **[Traditional Chinese Benchmarks]**                | TMMLU+ 5-shot (ACC) | DRCD 3-shot (EM) | MT-Bench-tw (Score) |
-|-----------------------------------------------------|---------------------|------------------|---------------------|
-| MediaTek-Research/Breeze-7B-Base-v0.1               |                     |                  |     -               |
-| MediaTek-Research/Breeze-7B-Instruct-v0.1           |                     |                  |                     |
-| mistralai/Mistral-7B-v0.1                           |                     |                  |     -               |
-| mistralai/Mistral-7B-Instruct-v0.1                  |                     |                  |                     |
-| yentinglin/Taiwan-LLM-7B-v2.1-base                  |                     |                  |     -               |
-| yentinglin/Taiwan-LLM-7B-v2.1-chat                  |                     |                  |                     |
-| yentinglin/Taiwan-LLM-13B-v2.0-base                 |                     |                  |     -               |
-| yentinglin/Taiwan-LLM-13B-v2.0-chat                 |                     |                  |                     |
-| 01-ai/Yi-6B                                         |                     |                  |     -               |
-| 01-ai/Yi-6B-Chat                                    |                     |                  |                     |
-| 01-ai/Yi-34B                                        |                     |                  |     -               |
-| 01-ai/Yi-34B-Chat                                   |                     |                  |                     |
-| Qwen/Qwen-7B                                        |                     |                  |     -               |
-| Qwen/Qwen-7B-Chat                                   |                     |                  |                     |
-| Qwen/Qwen-14B                                       |                     |                  |     -               |
-| Qwen/Qwen-14B-Chat                                  |                     |                  |                     |
-| gpt-3.5-turbo-0613                                  |                     | 76.30            |                     |
 | **[English Benchmarks]**                            | MMLU (ACC) <br/> 5-shot, log-likelihood, w/o chat template  | MT-Bench (Score) <br/>|
 |-----------------------------------------------------|---------------------|------------------|
@@ -70,18 +92,6 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
 | gpt-3.5-turbo-0613                                  |                     |                  |
-| **[Inference Metrics on Traditional Chinese]**                     | Speed (char/sec)  | Compression Ratio | Max Character Size |
-|--------------------------------------------------------------------|-------------------|-------------------|--------------------|
-| MediaTek-Research/Breeze-7B-Base-v0.1                              |                   |                   |                    |                    |
-| mistralai/Mistral-7B-v0.1                                          |                   |                   |                    |
-| yentinglin/Taiwan-LLM-7B-v2.1-base                                 |                   |                   |                    |
-| yentinglin/Taiwan-LLM-13B-v2.0-base                                |                   |                   |                    |
-| 01-ai/Yi-6B                                                        |                   |                   |                    |
-| 01-ai/Yi-34B                                                       |                   |                   |                    |
-| Qwen/Qwen-7B                                                       |                   |                   |                    |
-| Qwen/Qwen-14B                                                      |                   |                   |                    |
 ## Use in Transformers
 First install direct dependencies:

 - **Model type:** Causal decoder-only transformer language model
 - **Language:** English and Traditional Chinese (zh-tw)
+##  Base Model Performance
+| Models                                              | TMMLU+ (ACC) | DRCD (EM) | MMLU (ACC) |
+|-----------------------------------------------------|--------------|-----------|------------|
+|                                                     | 5 shot       | 3 shot    | 5 shot     |
+| MediaTek-Research/Breeze-7B-Base-v0.1               |              |           |            |
+| mistralai/Mistral-7B-v0.1                           |              |           |            |
+| yentinglin/Taiwan-LLM-7B-v2.1-base                  |              |           |            |
+| yentinglin/Taiwan-LLM-13B-v2.0-base                 |              |           |            |
+| 01-ai/Yi-6B                                         |              |           |            |
+| 01-ai/Yi-34B                                        |              |           |            |
+| Qwen/Qwen-7B                                        |              |           |            |
+| Qwen/Qwen-14B                                       |              |           |            |
+## Inference Performance
+| Models                                                             | Speed (char/sec)  | Compression Ratio | Max Character Size |
+|--------------------------------------------------------------------|-------------------|-------------------|--------------------|
+| MediaTek-Research/Breeze-7B-Base-v0.1                              |                   |                   |                    |                    |
+| mistralai/Mistral-7B-v0.1                                          |                   |                   |                    |
+| yentinglin/Taiwan-LLM-7B-v2.1-base                                 |                   |                   |                    |
+| yentinglin/Taiwan-LLM-13B-v2.0-base                                |                   |                   |                    |
+| 01-ai/Yi-6B                                                        |                   |                   |                    |
+| 01-ai/Yi-34B                                                       |                   |                   |                    |
+| Qwen/Qwen-7B                                                       |                   |                   |                    |
+| Qwen/Qwen-14B                                                      |                   |                   |                    |
+##  Chat Model Performance
+| Models                                              | TMMLU+ (ACC) | DRCD (EM) | MT-Bench-tw (Score) | MMLU (ACC) | MT-Bench (Score) |
+|-----------------------------------------------------|--------------|-----------|---------------------|------------|------------------|
+|                                                     | 5 shot       | 3 shot    | 0 shot              | 5 shot     | 0 shot           |
+| MediaTek-Research/Breeze-7B-Instruct-v0.1           |              |           |                     |            |                  |
+| mistralai/Mistral-7B-Instruct-v0.1                  |              |           |                     |            |                  |
+| yentinglin/Taiwan-LLM-7B-v2.1-chat                  |              |           |                     |            |                  |
+| yentinglin/Taiwan-LLM-13B-v2.0-chat                 |              |           |                     |            |                  |
+| 01-ai/Yi-6B-Chat                                    |              |           |                     |            |                  |
+| 01-ai/Yi-34B-Chat                                   |              |           |                     |            |                  |
+| Qwen/Qwen-7B-Chat                                   |              |           |                     |            |                  |
+| Qwen/Qwen-14B-Chat                                  |              |           |                     |            |                  |
+| gpt-3.5-turbo-0613                                  |              | 76.30     |                     |            |                  |
+## dupeca.
 | **[English Benchmarks]**                            | MMLU (ACC) <br/> 5-shot, log-likelihood, w/o chat template  | MT-Bench (Score) <br/>|
 |-----------------------------------------------------|---------------------|------------------|
 | gpt-3.5-turbo-0613                                  |                     |                  |
 ## Use in Transformers
 First install direct dependencies: