BAAI
/

AquilaMoE

Text Generation

Mixture of Experts

Model card Files Files and versions Community

ldwang commited on Jun 15, 2024

Commit

8298da2

·

verified ·

1 Parent(s): 1b9b2ce

Update README.md

Files changed (1) hide show

README.md +12 -8

README.md CHANGED Viewed

@@ -1,11 +1,11 @@
----
-license: apache-2.0
-language:
-- en
-- zh
-tags:
-- moe
----
 # AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies
 <p align="center">
     <br>
@@ -142,6 +142,10 @@ The performance of the AquilaMoE model series improves significantly across mult
 | mmlu-ppl         | 59.93         |
 | winograd-ppl     | 57.5          |
 *Table: Performance of AquilaMoE-SFT (16\*8B) on various benchmarks.*
 ## License Agreement

+---
+license: apache-2.0
+language:
+- en
+- zh
+tags:
+- moe
+---
 # AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies
 <p align="center">
     <br>
 | mmlu-ppl         | 59.93         |
 | winograd-ppl     | 57.5          |
+| Model            | GPT 3.5 Turbo (11/06) | GPT 3.5 Turbo (03/01) | AquilaMoE-SFT |
+|------------------|-----------------------|-----------------------|---------------|
+| AlpacaEval 2.0   | 19.3                  | 18.1                  | 21.1          |
 *Table: Performance of AquilaMoE-SFT (16\*8B) on various benchmarks.*
 ## License Agreement