Update README.md
Browse files
README.md
CHANGED
@@ -105,22 +105,22 @@ This DPO-enhanced version aims to:
|
|
105 |
## Evaluation
|
106 |
|
107 |
**AGIEVAL**
|
108 |
-
![SauerkrautLM-v2-14b-DPO-AGIEVAL](https://vago-solutions.ai/wp-content/uploads/2024/11/
|
109 |
|
110 |
**GPT4ALL**
|
111 |
-
![SauerkrautLM-v2-14b-DPO-GPT4ALL](https://vago-solutions.ai/wp-content/uploads/2024/11/
|
112 |
|
113 |
**TRUTHFULQA**
|
114 |
-
![SauerkrautLM-v2-14b-DPO-TRUTHFULQA](https://vago-solutions.ai/wp-content/uploads/2024/11/
|
115 |
|
116 |
**OPENLEADERBOARD 2**
|
117 |
-
![SauerkrautLM-14b-v2-DPO-OPENLEADERBOARD](https://vago-solutions.ai/wp-content/uploads/2024/11/
|
118 |
|
119 |
**MMLU 5-shot**
|
120 |
-
![SauerkrautLM-14b-v2-DPO-MMLU-5shot](https://vago-solutions.ai/wp-content/uploads/2024/11/
|
121 |
|
122 |
**Berkeley Function Calling Leaderboard**
|
123 |
-
![SauerkrautLM-v2-14b-DPO-BERKELEY](https://vago-solutions.ai/wp-content/uploads/2024/11/
|
124 |
|
125 |
Please note that our benchmark results in absolute numbers may differ from the Hugging Face Leaderboard due to variations in benchmark evaluation pipelines. However, the relative differences remain consistent.
|
126 |
|
|
|
105 |
## Evaluation
|
106 |
|
107 |
**AGIEVAL**
|
108 |
+
![SauerkrautLM-v2-14b-DPO-AGIEVAL](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-AGIEVAL.png "SauerkrautLM-v2-14b-DPO-AGIEVAL")
|
109 |
|
110 |
**GPT4ALL**
|
111 |
+
![SauerkrautLM-v2-14b-DPO-GPT4ALL](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-GPT4ALL.png "SauerkrautLM-v2-14b-DPO-GPT4ALL")
|
112 |
|
113 |
**TRUTHFULQA**
|
114 |
+
![SauerkrautLM-v2-14b-DPO-TRUTHFULQA](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-TRUTHFULQA.png "SauerkrautLM-v2-14b-DPO-TRUTHFULQA")
|
115 |
|
116 |
**OPENLEADERBOARD 2**
|
117 |
+
![SauerkrautLM-14b-v2-DPO-OPENLEADERBOARD](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-OPENLEADERBOARD.png "SauerkrautLM-v2-14b-DPO-OPENLEADERBOARD")
|
118 |
|
119 |
**MMLU 5-shot**
|
120 |
+
![SauerkrautLM-14b-v2-DPO-MMLU-5shot](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-MMLU-5shot.png "SauerkrautLM-v2-14b-DPO-MMLU-5shot")
|
121 |
|
122 |
**Berkeley Function Calling Leaderboard**
|
123 |
+
![SauerkrautLM-v2-14b-DPO-BERKELEY](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-BERKELEY.png "SauerkrautLM-v2-14b-DPO-BERKELEY")
|
124 |
|
125 |
Please note that our benchmark results in absolute numbers may differ from the Hugging Face Leaderboard due to variations in benchmark evaluation pipelines. However, the relative differences remain consistent.
|
126 |
|