NotAiLOL commited on
Commit
571015d
1 Parent(s): dd4b87a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +126 -1
README.md CHANGED
@@ -10,6 +10,118 @@ tags:
10
  - trl
11
  - sft
12
  base_model: HuggingFaceH4/zephyr-7b-beta
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  ---
14
 
15
  # Uploaded model
@@ -20,4 +132,17 @@ base_model: HuggingFaceH4/zephyr-7b-beta
20
 
21
  This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
22
 
23
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  - trl
11
  - sft
12
  base_model: HuggingFaceH4/zephyr-7b-beta
13
+ model-index:
14
+ - name: Athena-zephyr-7B
15
+ results:
16
+ - task:
17
+ type: text-generation
18
+ name: Text Generation
19
+ dataset:
20
+ name: AI2 Reasoning Challenge (25-Shot)
21
+ type: ai2_arc
22
+ config: ARC-Challenge
23
+ split: test
24
+ args:
25
+ num_few_shot: 25
26
+ metrics:
27
+ - type: acc_norm
28
+ value: 56.14
29
+ name: normalized accuracy
30
+ source:
31
+ url: >-
32
+ https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=NotAiLOL/Athena-zephyr-7B
33
+ name: Open LLM Leaderboard
34
+ - task:
35
+ type: text-generation
36
+ name: Text Generation
37
+ dataset:
38
+ name: HellaSwag (10-Shot)
39
+ type: hellaswag
40
+ split: validation
41
+ args:
42
+ num_few_shot: 10
43
+ metrics:
44
+ - type: acc_norm
45
+ value: 81.63
46
+ name: normalized accuracy
47
+ source:
48
+ url: >-
49
+ https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=NotAiLOL/Athena-zephyr-7B
50
+ name: Open LLM Leaderboard
51
+ - task:
52
+ type: text-generation
53
+ name: Text Generation
54
+ dataset:
55
+ name: MMLU (5-Shot)
56
+ type: cais/mmlu
57
+ config: all
58
+ split: test
59
+ args:
60
+ num_few_shot: 5
61
+ metrics:
62
+ - type: acc
63
+ value: 59.8
64
+ name: accuracy
65
+ source:
66
+ url: >-
67
+ https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=NotAiLOL/Athena-zephyr-7B
68
+ name: Open LLM Leaderboard
69
+ - task:
70
+ type: text-generation
71
+ name: Text Generation
72
+ dataset:
73
+ name: TruthfulQA (0-shot)
74
+ type: truthful_qa
75
+ config: multiple_choice
76
+ split: validation
77
+ args:
78
+ num_few_shot: 0
79
+ metrics:
80
+ - type: mc2
81
+ value: 54.2
82
+ source:
83
+ url: >-
84
+ https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=NotAiLOL/Athena-zephyr-7B
85
+ name: Open LLM Leaderboard
86
+ - task:
87
+ type: text-generation
88
+ name: Text Generation
89
+ dataset:
90
+ name: Winogrande (5-shot)
91
+ type: winogrande
92
+ config: winogrande_xl
93
+ split: validation
94
+ args:
95
+ num_few_shot: 5
96
+ metrics:
97
+ - type: acc
98
+ value: 76.16
99
+ name: accuracy
100
+ source:
101
+ url: >-
102
+ https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=NotAiLOL/Athena-zephyr-7B
103
+ name: Open LLM Leaderboard
104
+ - task:
105
+ type: text-generation
106
+ name: Text Generation
107
+ dataset:
108
+ name: GSM8k (5-shot)
109
+ type: gsm8k
110
+ config: main
111
+ split: test
112
+ args:
113
+ num_few_shot: 5
114
+ metrics:
115
+ - type: acc
116
+ value: 28.13
117
+ name: accuracy
118
+ source:
119
+ url: >-
120
+ https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=NotAiLOL/Athena-zephyr-7B
121
+ name: Open LLM Leaderboard
122
+ datasets:
123
+ - nickrosh/Evol-Instruct-Code-80k-v1
124
+ ---
125
  ---
126
 
127
  # Uploaded model
 
132
 
133
  This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
134
 
135
+ [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
136
+
137
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
138
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_NotAiLOL__Athena-zephyr-7B)
139
+
140
+ | Metric |Value|
141
+ |---------------------------------|----:|
142
+ |Avg. |59.34|
143
+ |AI2 Reasoning Challenge (25-Shot)|56.14|
144
+ |HellaSwag (10-Shot) |81.63|
145
+ |MMLU (5-Shot) |59.80|
146
+ |TruthfulQA (0-shot) |54.20|
147
+ |Winogrande (5-shot) |76.16|
148
+ |GSM8k (5-shot) |28.13|