Safetensors
qwen2
spectrum
sft
dpo
Eval Results
DavidGF commited on
Commit
02936d1
1 Parent(s): f8e167c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -105,22 +105,22 @@ This DPO-enhanced version aims to:
105
  ## Evaluation
106
 
107
  **AGIEVAL**
108
- ![SauerkrautLM-v2-14b-DPO-AGIEVAL](https://vago-solutions.ai/wp-content/uploads/2024/11/AGIeval-14b-dpo.png "SauerkrautLM-v2-14b-DPO-AGIEVAL")
109
 
110
  **GPT4ALL**
111
- ![SauerkrautLM-v2-14b-DPO-GPT4ALL](https://vago-solutions.ai/wp-content/uploads/2024/11/GPT4ALL-14b-dpo.png "SauerkrautLM-v2-14b-DPO-GPT4ALL")
112
 
113
  **TRUTHFULQA**
114
- ![SauerkrautLM-v2-14b-DPO-TRUTHFULQA](https://vago-solutions.ai/wp-content/uploads/2024/11/TQA-14b-dpo.png "SauerkrautLM-v2-14b-DPO-TRUTHFULQA")
115
 
116
  **OPENLEADERBOARD 2**
117
- ![SauerkrautLM-14b-v2-DPO-OPENLEADERBOARD](https://vago-solutions.ai/wp-content/uploads/2024/11/HF2-14b-dpo.png "SauerkrautLM-v2-14b-DPO-OPENLEADERBOARD")
118
 
119
  **MMLU 5-shot**
120
- ![SauerkrautLM-14b-v2-DPO-MMLU-5shot](https://vago-solutions.ai/wp-content/uploads/2024/11/MMLU-14b-dpo.png "SauerkrautLM-v2-14b-DPO-MMLU-5shot")
121
 
122
  **Berkeley Function Calling Leaderboard**
123
- ![SauerkrautLM-v2-14b-DPO-BERKELEY](https://vago-solutions.ai/wp-content/uploads/2024/11/Berkeley-14b-dpo.png "SauerkrautLM-v2-14b-DPO-BERKELEY")
124
 
125
  Please note that our benchmark results in absolute numbers may differ from the Hugging Face Leaderboard due to variations in benchmark evaluation pipelines. However, the relative differences remain consistent.
126
 
 
105
  ## Evaluation
106
 
107
  **AGIEVAL**
108
+ ![SauerkrautLM-v2-14b-DPO-AGIEVAL](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-AGIEVAL.png "SauerkrautLM-v2-14b-DPO-AGIEVAL")
109
 
110
  **GPT4ALL**
111
+ ![SauerkrautLM-v2-14b-DPO-GPT4ALL](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-GPT4ALL.png "SauerkrautLM-v2-14b-DPO-GPT4ALL")
112
 
113
  **TRUTHFULQA**
114
+ ![SauerkrautLM-v2-14b-DPO-TRUTHFULQA](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-TRUTHFULQA.png "SauerkrautLM-v2-14b-DPO-TRUTHFULQA")
115
 
116
  **OPENLEADERBOARD 2**
117
+ ![SauerkrautLM-14b-v2-DPO-OPENLEADERBOARD](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-OPENLEADERBOARD.png "SauerkrautLM-v2-14b-DPO-OPENLEADERBOARD")
118
 
119
  **MMLU 5-shot**
120
+ ![SauerkrautLM-14b-v2-DPO-MMLU-5shot](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-MMLU-5shot.png "SauerkrautLM-v2-14b-DPO-MMLU-5shot")
121
 
122
  **Berkeley Function Calling Leaderboard**
123
+ ![SauerkrautLM-v2-14b-DPO-BERKELEY](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-BERKELEY.png "SauerkrautLM-v2-14b-DPO-BERKELEY")
124
 
125
  Please note that our benchmark results in absolute numbers may differ from the Hugging Face Leaderboard due to variations in benchmark evaluation pipelines. However, the relative differences remain consistent.
126