Norquinal leaderboard-pr-bot commited on
Commit
8be3652
1 Parent(s): e65d34e

Adding Evaluation Results (#1)

Browse files

- Adding Evaluation Results (e80497fd95a7fd60db3412aa93c9bc53d2e0ec6c)


Co-authored-by: Open LLM Leaderboard PR Bot <leaderboard-pr-bot@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +14 -1
README.md CHANGED
@@ -26,4 +26,17 @@ ASSISTANT:
26
  >
27
  >A common cooking method is stir-frying where ingredients like chicken, beef or vegetables are quickly sautéed in oil at high heat until just cooked through. Stir fried green beans would be an example.
28
  >
29
- >Tea has been consumed for thousands of years as part of daily life in China. It is served before meals to cleanse palates and afterward to aid digestion. Teas range from mildly fragrant white teas to robust oolong varieties.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
26
  >
27
  >A common cooking method is stir-frying where ingredients like chicken, beef or vegetables are quickly sautéed in oil at high heat until just cooked through. Stir fried green beans would be an example.
28
  >
29
+ >Tea has been consumed for thousands of years as part of daily life in China. It is served before meals to cleanse palates and afterward to aid digestion. Teas range from mildly fragrant white teas to robust oolong varieties.
30
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
31
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Norquinal__llama-2-7b-claude-chat)
32
+
33
+ | Metric | Value |
34
+ |-----------------------|---------------------------|
35
+ | Avg. | 44.54 |
36
+ | ARC (25-shot) | 54.44 |
37
+ | HellaSwag (10-shot) | 80.66 |
38
+ | MMLU (5-shot) | 46.74 |
39
+ | TruthfulQA (0-shot) | 41.39 |
40
+ | Winogrande (5-shot) | 74.9 |
41
+ | GSM8K (5-shot) | 7.73 |
42
+ | DROP (3-shot) | 5.89 |