Adding Evaluation Results
#10
by
leaderboard-pr-bot
- opened
README.md
CHANGED
@@ -2,187 +2,271 @@
|
|
2 |
language:
|
3 |
- de
|
4 |
- en
|
|
|
5 |
tags:
|
6 |
- two stage dpo
|
7 |
- dpo
|
8 |
-
|
9 |
-
license: other
|
10 |
license_name: llama3
|
11 |
license_link: LICENSE
|
12 |
-
extra_gated_prompt:
|
13 |
-
|
14 |
-
|
15 |
-
|
16 |
-
|
17 |
-
"
|
18 |
-
|
19 |
-
|
20 |
-
|
21 |
-
|
22 |
-
|
23 |
-
|
24 |
-
|
25 |
-
|
26 |
-
|
27 |
-
|
28 |
-
"Meta
|
29 |
-
|
30 |
-
|
31 |
-
|
32 |
-
|
33 |
-
|
34 |
-
|
35 |
-
|
36 |
-
|
37 |
-
|
38 |
-
|
39 |
-
|
40 |
-
|
41 |
-
|
42 |
-
|
43 |
-
|
44 |
-
|
45 |
-
|
46 |
-
|
47 |
-
|
48 |
-
|
49 |
-
|
50 |
-
|
51 |
-
|
52 |
-
|
53 |
-
|
54 |
-
|
55 |
-
|
56 |
-
|
57 |
-
|
58 |
-
|
59 |
-
|
60 |
-
|
61 |
-
|
62 |
-
|
63 |
-
|
64 |
-
|
65 |
-
|
66 |
-
|
67 |
-
|
68 |
-
|
69 |
-
|
70 |
-
|
71 |
-
|
72 |
-
|
73 |
-
|
74 |
-
|
75 |
-
|
76 |
-
|
77 |
-
|
78 |
-
|
79 |
-
|
80 |
-
|
81 |
-
|
82 |
-
|
83 |
-
|
84 |
-
|
85 |
-
|
86 |
-
|
87 |
-
|
88 |
-
|
89 |
-
|
90 |
-
|
91 |
-
|
92 |
-
|
93 |
-
|
94 |
-
|
95 |
-
|
96 |
-
|
97 |
-
|
98 |
-
|
99 |
-
|
100 |
-
|
101 |
-
|
102 |
-
|
103 |
-
|
104 |
-
|
105 |
-
|
106 |
-
|
107 |
-
|
108 |
-
|
109 |
-
|
110 |
-
|
111 |
-
|
112 |
-
|
113 |
-
|
114 |
-
|
115 |
-
|
116 |
-
|
117 |
-
|
118 |
-
|
119 |
-
|
120 |
-
|
121 |
-
|
122 |
-
|
123 |
-
|
124 |
-
|
125 |
-
|
126 |
-
|
127 |
-
|
128 |
-
|
129 |
-
|
130 |
-
|
131 |
-
|
132 |
-
|
133 |
-
|
134 |
-
|
135 |
-
|
136 |
-
|
137 |
-
|
138 |
-
|
139 |
-
|
140 |
-
|
141 |
-
|
142 |
-
|
143 |
-
|
144 |
-
|
145 |
-
|
146 |
-
|
147 |
-
|
148 |
-
|
149 |
-
|
150 |
-
|
151 |
-
|
152 |
-
|
153 |
-
|
154 |
-
|
155 |
-
|
156 |
-
|
157 |
-
|
158 |
-
|
159 |
-
|
160 |
-
|
161 |
-
|
162 |
-
2. Generating, promoting, or furthering defamatory content, including the creation of defamatory statements, images, or other content
|
163 |
-
3. Generating, promoting, or further distributing spam
|
164 |
-
4. Impersonating another individual without consent, authorization, or legal right
|
165 |
-
5. Representing that the use of Meta Llama 3 or outputs are human-generated
|
166 |
-
6. Generating or facilitating false online engagement, including fake reviews and other means of fake online engagement
|
167 |
-
4. Fail to appropriately disclose to end users any known dangers of your AI system
|
168 |
-
|
169 |
-
Please report any violation of this Policy, software “bug,” or other problems that could lead to a violation
|
170 |
-
of this Policy through one of the following means:
|
171 |
-
* Reporting issues with the model: [https://github.com/meta-llama/llama3](https://github.com/meta-llama/llama3)
|
172 |
-
* Reporting risky content generated by the model:
|
173 |
-
developers.facebook.com/llama_output_feedback
|
174 |
-
* Reporting bugs and security concerns: facebook.com/whitehat/info
|
175 |
-
* Reporting violations of the Acceptable Use Policy or unlicensed uses of Meta Llama 3: LlamaUseReport@meta.com
|
176 |
extra_gated_fields:
|
177 |
First Name: text
|
178 |
Last Name: text
|
179 |
Date of birth: date_picker
|
180 |
Country: country
|
181 |
Affiliation: text
|
182 |
-
geo: ip_location
|
183 |
-
By clicking Submit below I accept the terms of the license and acknowledge that
|
184 |
-
|
|
|
|
|
|
|
|
|
185 |
extra_gated_button_content: Submit
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
186 |
---
|
187 |
|
188 |
|
@@ -360,3 +444,17 @@ We are also keenly seeking support and investment for our startups, VAGO solutio
|
|
360 |
## Acknowledgement
|
361 |
Many thanks to [Meta](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) for providing such valuable model to the Open-Source community.
|
362 |
Also many thanks to [bartowski](https://huggingface.co/bartowski) for super fast quantification of our Model in GGUF and EXL format.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
language:
|
3 |
- de
|
4 |
- en
|
5 |
+
license: other
|
6 |
tags:
|
7 |
- two stage dpo
|
8 |
- dpo
|
|
|
|
|
9 |
license_name: llama3
|
10 |
license_link: LICENSE
|
11 |
+
extra_gated_prompt: "### META LLAMA 3 COMMUNITY LICENSE AGREEMENT\nMeta Llama 3 Version\
|
12 |
+
\ Release Date: April 18, 2024\n\"Agreement\" means the terms and conditions for\
|
13 |
+
\ use, reproduction, distribution and modification of the Llama Materials set forth\
|
14 |
+
\ herein.\n\"Documentation\" means the specifications, manuals and documentation\
|
15 |
+
\ accompanying Meta Llama 3 distributed by Meta at https://llama.meta.com/get-started/.\n\
|
16 |
+
\"Licensee\" or \"you\" means you, or your employer or any other person or entity\
|
17 |
+
\ (if you are entering into this Agreement on such person or entity’s behalf), of\
|
18 |
+
\ the age required under applicable laws, rules or regulations to provide legal\
|
19 |
+
\ consent and that has legal authority to bind your employer or such other person\
|
20 |
+
\ or entity if you are entering in this Agreement on their behalf.\n\"Meta Llama\
|
21 |
+
\ 3\" means the foundational large language models and software and algorithms,\
|
22 |
+
\ including machine-learning model code, trained model weights, inference-enabling\
|
23 |
+
\ code, training-enabling code, fine-tuning enabling code and other elements of\
|
24 |
+
\ the foregoing distributed by Meta at https://llama.meta.com/llama-downloads.\n\
|
25 |
+
\"Llama Materials\" means, collectively, Meta’s proprietary Meta Llama 3 and Documentation\
|
26 |
+
\ (and any portion thereof) made available under this Agreement.\n\"Meta\" or \"\
|
27 |
+
we\" means Meta Platforms Ireland Limited (if you are located in or, if you are\
|
28 |
+
\ an entity, your principal place of business is in the EEA or Switzerland) and\
|
29 |
+
\ Meta Platforms, Inc. (if you are located outside of the EEA or Switzerland).\n\
|
30 |
+
\ \n1. License Rights and Redistribution.\na. Grant of Rights. You are granted\
|
31 |
+
\ a non-exclusive, worldwide, non-transferable and royalty-free limited license\
|
32 |
+
\ under Meta’s intellectual property or other rights owned by Meta embodied in the\
|
33 |
+
\ Llama Materials to use, reproduce, distribute, copy, create derivative works of,\
|
34 |
+
\ and make modifications to the Llama Materials.\nb. Redistribution and Use.\ni.\
|
35 |
+
\ If you distribute or make available the Llama Materials (or any derivative works\
|
36 |
+
\ thereof), or a product or service that uses any of them, including another AI\
|
37 |
+
\ model, you shall (A) provide a copy of this Agreement with any such Llama Materials;\
|
38 |
+
\ and (B) prominently display “Built with Meta Llama 3” on a related website, user\
|
39 |
+
\ interface, blogpost, about page, or product documentation. If you use the Llama\
|
40 |
+
\ Materials to create, train, fine tune, or otherwise improve an AI model, which\
|
41 |
+
\ is distributed or made available, you shall also include “Llama 3” at the beginning\
|
42 |
+
\ of any such AI model name.\nii. If you receive Llama Materials, or any derivative\
|
43 |
+
\ works thereof, from a Licensee as part of an integrated end user product, then\
|
44 |
+
\ Section 2 of this Agreement will not apply to you.\niii. You must retain in all\
|
45 |
+
\ copies of the Llama Materials that you distribute the following attribution notice\
|
46 |
+
\ within a “Notice” text file distributed as a part of such copies: “Meta Llama\
|
47 |
+
\ 3 is licensed under the Meta Llama 3 Community License, Copyright © Meta Platforms,\
|
48 |
+
\ Inc. All Rights Reserved.”\niv. Your use of the Llama Materials must comply with\
|
49 |
+
\ applicable laws and regulations (including trade compliance laws and regulations)\
|
50 |
+
\ and adhere to the Acceptable Use Policy for the Llama Materials (available at\
|
51 |
+
\ https://llama.meta.com/llama3/use-policy), which is hereby incorporated by reference\
|
52 |
+
\ into this Agreement.\nv. You will not use the Llama Materials or any output or\
|
53 |
+
\ results of the Llama Materials to improve any other large language model (excluding\
|
54 |
+
\ Meta Llama 3 or derivative works thereof).\n2. Additional Commercial Terms. If,\
|
55 |
+
\ on the Meta Llama 3 version release date, the monthly active users of the products\
|
56 |
+
\ or services made available by or for Licensee, or Licensee’s affiliates, is greater\
|
57 |
+
\ than 700 million monthly active users in the preceding calendar month, you must\
|
58 |
+
\ request a license from Meta, which Meta may grant to you in its sole discretion,\
|
59 |
+
\ and you are not authorized to exercise any of the rights under this Agreement\
|
60 |
+
\ unless or until Meta otherwise expressly grants you such rights.\n3. Disclaimer\
|
61 |
+
\ of Warranty. UNLESS REQUIRED BY APPLICABLE LAW, THE LLAMA MATERIALS AND ANY OUTPUT\
|
62 |
+
\ AND RESULTS THEREFROM ARE PROVIDED ON AN “AS IS” BASIS, WITHOUT WARRANTIES OF\
|
63 |
+
\ ANY KIND, AND META DISCLAIMS ALL WARRANTIES OF ANY KIND, BOTH EXPRESS AND IMPLIED,\
|
64 |
+
\ INCLUDING, WITHOUT LIMITATION, ANY WARRANTIES OF TITLE, NON-INFRINGEMENT, MERCHANTABILITY,\
|
65 |
+
\ OR FITNESS FOR A PARTICULAR PURPOSE. YOU ARE SOLELY RESPONSIBLE FOR DETERMINING\
|
66 |
+
\ THE APPROPRIATENESS OF USING OR REDISTRIBUTING THE LLAMA MATERIALS AND ASSUME\
|
67 |
+
\ ANY RISKS ASSOCIATED WITH YOUR USE OF THE LLAMA MATERIALS AND ANY OUTPUT AND RESULTS.\n\
|
68 |
+
4. Limitation of Liability. IN NO EVENT WILL META OR ITS AFFILIATES BE LIABLE UNDER\
|
69 |
+
\ ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, TORT, NEGLIGENCE, PRODUCTS LIABILITY,\
|
70 |
+
\ OR OTHERWISE, ARISING OUT OF THIS AGREEMENT, FOR ANY LOST PROFITS OR ANY INDIRECT,\
|
71 |
+
\ SPECIAL, CONSEQUENTIAL, INCIDENTAL, EXEMPLARY OR PUNITIVE DAMAGES, EVEN IF META\
|
72 |
+
\ OR ITS AFFILIATES HAVE BEEN ADVISED OF THE POSSIBILITY OF ANY OF THE FOREGOING.\n\
|
73 |
+
5. Intellectual Property.\na. No trademark licenses are granted under this Agreement,\
|
74 |
+
\ and in connection with the Llama Materials, neither Meta nor Licensee may use\
|
75 |
+
\ any name or mark owned by or associated with the other or any of its affiliates,\
|
76 |
+
\ except as required for reasonable and customary use in describing and redistributing\
|
77 |
+
\ the Llama Materials or as set forth in this Section 5(a). Meta hereby grants you\
|
78 |
+
\ a license to use “Llama 3” (the “Mark”) solely as required to comply with the\
|
79 |
+
\ last sentence of Section 1.b.i. You will comply with Meta’s brand guidelines (currently\
|
80 |
+
\ accessible at https://about.meta.com/brand/resources/meta/company-brand/ ). All\
|
81 |
+
\ goodwill arising out of your use of the Mark will inure to the benefit of Meta.\n\
|
82 |
+
b. Subject to Meta’s ownership of Llama Materials and derivatives made by or for\
|
83 |
+
\ Meta, with respect to any derivative works and modifications of the Llama Materials\
|
84 |
+
\ that are made by you, as between you and Meta, you are and will be the owner of\
|
85 |
+
\ such derivative works and modifications.\nc. If you institute litigation or other\
|
86 |
+
\ proceedings against Meta or any entity (including a cross-claim or counterclaim\
|
87 |
+
\ in a lawsuit) alleging that the Llama Materials or Meta Llama 3 outputs or results,\
|
88 |
+
\ or any portion of any of the foregoing, constitutes infringement of intellectual\
|
89 |
+
\ property or other rights owned or licensable by you, then any licenses granted\
|
90 |
+
\ to you under this Agreement shall terminate as of the date such litigation or\
|
91 |
+
\ claim is filed or instituted. You will indemnify and hold harmless Meta from and\
|
92 |
+
\ against any claim by any third party arising out of or related to your use or\
|
93 |
+
\ distribution of the Llama Materials.\n6. Term and Termination. The term of this\
|
94 |
+
\ Agreement will commence upon your acceptance of this Agreement or access to the\
|
95 |
+
\ Llama Materials and will continue in full force and effect until terminated in\
|
96 |
+
\ accordance with the terms and conditions herein. Meta may terminate this Agreement\
|
97 |
+
\ if you are in breach of any term or condition of this Agreement. Upon termination\
|
98 |
+
\ of this Agreement, you shall delete and cease use of the Llama Materials. Sections\
|
99 |
+
\ 3, 4 and 7 shall survive the termination of this Agreement.\n7. Governing Law\
|
100 |
+
\ and Jurisdiction. This Agreement will be governed and construed under the laws\
|
101 |
+
\ of the State of California without regard to choice of law principles, and the\
|
102 |
+
\ UN Convention on Contracts for the International Sale of Goods does not apply\
|
103 |
+
\ to this Agreement. The courts of California shall have exclusive jurisdiction\
|
104 |
+
\ of any dispute arising out of this Agreement.\n### Meta Llama 3 Acceptable Use\
|
105 |
+
\ Policy\nMeta is committed to promoting safe and fair use of its tools and features,\
|
106 |
+
\ including Meta Llama 3. If you access or use Meta Llama 3, you agree to this Acceptable\
|
107 |
+
\ Use Policy (“Policy”). The most recent copy of this policy can be found at [https://llama.meta.com/llama3/use-policy](https://llama.meta.com/llama3/use-policy)\n\
|
108 |
+
#### Prohibited Uses\nWe want everyone to use Meta Llama 3 safely and responsibly.\
|
109 |
+
\ You agree you will not use, or allow others to use, Meta Llama 3 to: 1. Violate\
|
110 |
+
\ the law or others’ rights, including to:\n 1. Engage in, promote, generate,\
|
111 |
+
\ contribute to, encourage, plan, incite, or further illegal or unlawful activity\
|
112 |
+
\ or content, such as:\n 1. Violence or terrorism\n 2. Exploitation\
|
113 |
+
\ or harm to children, including the solicitation, creation, acquisition, or dissemination\
|
114 |
+
\ of child exploitative content or failure to report Child Sexual Abuse Material\n\
|
115 |
+
\ 3. Human trafficking, exploitation, and sexual violence\n 4. The\
|
116 |
+
\ illegal distribution of information or materials to minors, including obscene\
|
117 |
+
\ materials, or failure to employ legally required age-gating in connection with\
|
118 |
+
\ such information or materials.\n 5. Sexual solicitation\n 6. Any\
|
119 |
+
\ other criminal activity\n 2. Engage in, promote, incite, or facilitate the\
|
120 |
+
\ harassment, abuse, threatening, or bullying of individuals or groups of individuals\n\
|
121 |
+
\ 3. Engage in, promote, incite, or facilitate discrimination or other unlawful\
|
122 |
+
\ or harmful conduct in the provision of employment, employment benefits, credit,\
|
123 |
+
\ housing, other economic benefits, or other essential goods and services\n 4.\
|
124 |
+
\ Engage in the unauthorized or unlicensed practice of any profession including,\
|
125 |
+
\ but not limited to, financial, legal, medical/health, or related professional\
|
126 |
+
\ practices\n 5. Collect, process, disclose, generate, or infer health, demographic,\
|
127 |
+
\ or other sensitive personal or private information about individuals without rights\
|
128 |
+
\ and consents required by applicable laws\n 6. Engage in or facilitate any action\
|
129 |
+
\ or generate any content that infringes, misappropriates, or otherwise violates\
|
130 |
+
\ any third-party rights, including the outputs or results of any products or services\
|
131 |
+
\ using the Llama Materials\n 7. Create, generate, or facilitate the creation\
|
132 |
+
\ of malicious code, malware, computer viruses or do anything else that could disable,\
|
133 |
+
\ overburden, interfere with or impair the proper working, integrity, operation\
|
134 |
+
\ or appearance of a website or computer system\n2. Engage in, promote, incite,\
|
135 |
+
\ facilitate, or assist in the planning or development of activities that present\
|
136 |
+
\ a risk of death or bodily harm to individuals, including use of Meta Llama 3 related\
|
137 |
+
\ to the following:\n 1. Military, warfare, nuclear industries or applications,\
|
138 |
+
\ espionage, use for materials or activities that are subject to the International\
|
139 |
+
\ Traffic Arms Regulations (ITAR) maintained by the United States Department of\
|
140 |
+
\ State\n 2. Guns and illegal weapons (including weapon development)\n 3.\
|
141 |
+
\ Illegal drugs and regulated/controlled substances\n 4. Operation of critical\
|
142 |
+
\ infrastructure, transportation technologies, or heavy machinery\n 5. Self-harm\
|
143 |
+
\ or harm to others, including suicide, cutting, and eating disorders\n 6. Any\
|
144 |
+
\ content intended to incite or promote violence, abuse, or any infliction of bodily\
|
145 |
+
\ harm to an individual\n3. Intentionally deceive or mislead others, including use\
|
146 |
+
\ of Meta Llama 3 related to the following:\n 1. Generating, promoting, or furthering\
|
147 |
+
\ fraud or the creation or promotion of disinformation\n 2. Generating, promoting,\
|
148 |
+
\ or furthering defamatory content, including the creation of defamatory statements,\
|
149 |
+
\ images, or other content\n 3. Generating, promoting, or further distributing\
|
150 |
+
\ spam\n 4. Impersonating another individual without consent, authorization,\
|
151 |
+
\ or legal right\n 5. Representing that the use of Meta Llama 3 or outputs are\
|
152 |
+
\ human-generated\n 6. Generating or facilitating false online engagement, including\
|
153 |
+
\ fake reviews and other means of fake online engagement\n4. Fail to appropriately\
|
154 |
+
\ disclose to end users any known dangers of your AI system\nPlease report any violation\
|
155 |
+
\ of this Policy, software “bug,” or other problems that could lead to a violation\
|
156 |
+
\ of this Policy through one of the following means:\n * Reporting issues with\
|
157 |
+
\ the model: [https://github.com/meta-llama/llama3](https://github.com/meta-llama/llama3)\n\
|
158 |
+
\ * Reporting risky content generated by the model:\n developers.facebook.com/llama_output_feedback\n\
|
159 |
+
\ * Reporting bugs and security concerns: facebook.com/whitehat/info\n * Reporting\
|
160 |
+
\ violations of the Acceptable Use Policy or unlicensed uses of Meta Llama 3: LlamaUseReport@meta.com"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
161 |
extra_gated_fields:
|
162 |
First Name: text
|
163 |
Last Name: text
|
164 |
Date of birth: date_picker
|
165 |
Country: country
|
166 |
Affiliation: text
|
167 |
+
geo: ip_location
|
168 |
+
? By clicking Submit below I accept the terms of the license and acknowledge that
|
169 |
+
the information I provide will be collected stored processed and shared in accordance
|
170 |
+
with the Meta Privacy Policy
|
171 |
+
: checkbox
|
172 |
+
extra_gated_description: The information you provide will be collected, stored, processed
|
173 |
+
and shared in accordance with the [Meta Privacy Policy](https://www.facebook.com/privacy/policy/).
|
174 |
extra_gated_button_content: Submit
|
175 |
+
model-index:
|
176 |
+
- name: Llama-3-SauerkrautLM-8b-Instruct
|
177 |
+
results:
|
178 |
+
- task:
|
179 |
+
type: text-generation
|
180 |
+
name: Text Generation
|
181 |
+
dataset:
|
182 |
+
name: IFEval (0-Shot)
|
183 |
+
type: HuggingFaceH4/ifeval
|
184 |
+
args:
|
185 |
+
num_few_shot: 0
|
186 |
+
metrics:
|
187 |
+
- type: inst_level_strict_acc and prompt_level_strict_acc
|
188 |
+
value: 74.45
|
189 |
+
name: strict accuracy
|
190 |
+
source:
|
191 |
+
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct
|
192 |
+
name: Open LLM Leaderboard
|
193 |
+
- task:
|
194 |
+
type: text-generation
|
195 |
+
name: Text Generation
|
196 |
+
dataset:
|
197 |
+
name: BBH (3-Shot)
|
198 |
+
type: BBH
|
199 |
+
args:
|
200 |
+
num_few_shot: 3
|
201 |
+
metrics:
|
202 |
+
- type: acc_norm
|
203 |
+
value: 28.05
|
204 |
+
name: normalized accuracy
|
205 |
+
source:
|
206 |
+
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct
|
207 |
+
name: Open LLM Leaderboard
|
208 |
+
- task:
|
209 |
+
type: text-generation
|
210 |
+
name: Text Generation
|
211 |
+
dataset:
|
212 |
+
name: MATH Lvl 5 (4-Shot)
|
213 |
+
type: hendrycks/competition_math
|
214 |
+
args:
|
215 |
+
num_few_shot: 4
|
216 |
+
metrics:
|
217 |
+
- type: exact_match
|
218 |
+
value: 5.74
|
219 |
+
name: exact match
|
220 |
+
source:
|
221 |
+
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct
|
222 |
+
name: Open LLM Leaderboard
|
223 |
+
- task:
|
224 |
+
type: text-generation
|
225 |
+
name: Text Generation
|
226 |
+
dataset:
|
227 |
+
name: GPQA (0-shot)
|
228 |
+
type: Idavidrein/gpqa
|
229 |
+
args:
|
230 |
+
num_few_shot: 0
|
231 |
+
metrics:
|
232 |
+
- type: acc_norm
|
233 |
+
value: 7.83
|
234 |
+
name: acc_norm
|
235 |
+
source:
|
236 |
+
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct
|
237 |
+
name: Open LLM Leaderboard
|
238 |
+
- task:
|
239 |
+
type: text-generation
|
240 |
+
name: Text Generation
|
241 |
+
dataset:
|
242 |
+
name: MuSR (0-shot)
|
243 |
+
type: TAUR-Lab/MuSR
|
244 |
+
args:
|
245 |
+
num_few_shot: 0
|
246 |
+
metrics:
|
247 |
+
- type: acc_norm
|
248 |
+
value: 11.28
|
249 |
+
name: acc_norm
|
250 |
+
source:
|
251 |
+
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct
|
252 |
+
name: Open LLM Leaderboard
|
253 |
+
- task:
|
254 |
+
type: text-generation
|
255 |
+
name: Text Generation
|
256 |
+
dataset:
|
257 |
+
name: MMLU-PRO (5-shot)
|
258 |
+
type: TIGER-Lab/MMLU-Pro
|
259 |
+
config: main
|
260 |
+
split: test
|
261 |
+
args:
|
262 |
+
num_few_shot: 5
|
263 |
+
metrics:
|
264 |
+
- type: acc
|
265 |
+
value: 31.75
|
266 |
+
name: accuracy
|
267 |
+
source:
|
268 |
+
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct
|
269 |
+
name: Open LLM Leaderboard
|
270 |
---
|
271 |
|
272 |
|
|
|
444 |
## Acknowledgement
|
445 |
Many thanks to [Meta](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) for providing such valuable model to the Open-Source community.
|
446 |
Also many thanks to [bartowski](https://huggingface.co/bartowski) for super fast quantification of our Model in GGUF and EXL format.
|
447 |
+
|
448 |
+
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
449 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_VAGOsolutions__Llama-3-SauerkrautLM-8b-Instruct)
|
450 |
+
|
451 |
+
| Metric |Value|
|
452 |
+
|-------------------|----:|
|
453 |
+
|Avg. |26.52|
|
454 |
+
|IFEval (0-Shot) |74.45|
|
455 |
+
|BBH (3-Shot) |28.05|
|
456 |
+
|MATH Lvl 5 (4-Shot)| 5.74|
|
457 |
+
|GPQA (0-shot) | 7.83|
|
458 |
+
|MuSR (0-shot) |11.28|
|
459 |
+
|MMLU-PRO (5-shot) |31.75|
|
460 |
+
|