update readme
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ For most benchmarks, we use the same evaluation methodology as in the Open LLM l
|
|
20 |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
21 |
|shot|||25|10|5|0|5|5||3|0|
|
22 |
||||acc_norm|acc_norm|acc|mc2|acc|acc||Pass@1|Pass@1|
|
23 |
-
|LLaMA2-7B|7B|2T|53.1|78.6|46.9|38.8
|
24 |
|LLaMA-13B|13B|1T|**56.2**|**80.9**|47.7|39.5|**76.2**|7.6|51.4|22.0|15.8|
|
25 |
|DeepseekMoE-16B|2.8B|2T|53.2|79.8|46.3|36.1|73.7|17.3|51.1|34.0|**25.0**|
|
26 |
|Gemma-2B|2B|2T|48.4|71.8|41.8|33.1|66.3|16.9|46.4|28.0|24.4|
|
|
|
20 |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
21 |
|shot|||25|10|5|0|5|5||3|0|
|
22 |
||||acc_norm|acc_norm|acc|mc2|acc|acc||Pass@1|Pass@1|
|
23 |
+
|LLaMA2-7B|7B|2T|53.1|78.6|46.9|38.8|74|14.5|51.0|20.8|12.8|
|
24 |
|LLaMA-13B|13B|1T|**56.2**|**80.9**|47.7|39.5|**76.2**|7.6|51.4|22.0|15.8|
|
25 |
|DeepseekMoE-16B|2.8B|2T|53.2|79.8|46.3|36.1|73.7|17.3|51.1|34.0|**25.0**|
|
26 |
|Gemma-2B|2B|2T|48.4|71.8|41.8|33.1|66.3|16.9|46.4|28.0|24.4|
|