Adding Evaluation Results

#6
Files changed (1) hide show
  1. README.md +14 -1
README.md CHANGED
@@ -84,4 +84,17 @@ Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams,
84
  Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom},
85
  year={2023}
86
  }
87
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
84
  Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom},
85
  year={2023}
86
  }
87
+ ```
88
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
89
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_uni-tianyan__Uni-TianYan)
90
+
91
+ | Metric | Value |
92
+ |-----------------------|---------------------------|
93
+ | Avg. | 62.78 |
94
+ | ARC (25-shot) | 72.1 |
95
+ | HellaSwag (10-shot) | 87.4 |
96
+ | MMLU (5-shot) | 69.91 |
97
+ | TruthfulQA (0-shot) | 65.81 |
98
+ | Winogrande (5-shot) | 82.32 |
99
+ | GSM8K (5-shot) | 22.14 |
100
+ | DROP (3-shot) | 39.79 |