MaziyarPanahi
/

Qwen2-72B-Instruct-v0.1

Text Generation

text-generation-inference

Model card Files Files and versions Community

Update README.md

#3

by MaziyarPanahi - opened Jun 8

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

Files changed (1) hide show

README.md +48 -2

README.md CHANGED Viewed

@@ -19,9 +19,55 @@ base_model: Qwen/Qwen2-72B-Instruct
 model_name: MaziyarPanahi/Qwen2-72B-Instruct-v0.1
 ---
-# Qwen2-72B-Instruct-v0.1
-## Model Details
 This is a fine-tuned version of the `Qwen/Qwen2-72B-Instruct` model. It aims to improve the base model across all benchmarks.

 model_name: MaziyarPanahi/Qwen2-72B-Instruct-v0.1
 ---
+<img src="./llama-3-merges.webp" alt="Llama-3 DPO Logo" width="500" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
+# MaziyarPanahi/Qwen2-72B-Instruct-v0.1
 This is a fine-tuned version of the `Qwen/Qwen2-72B-Instruct` model. It aims to improve the base model across all benchmarks.
+# ⚡ Quantized GGUF
+All GGUF models are available here: [MaziyarPanahi/Qwen2-72B-Instruct-v0.1-GGUF](https://huggingface.co/MaziyarPanahi/Qwen2-72B-Instruct-v0.1-GGUF)
+# 🏆 [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+coming soon!
+```
+|    Tasks     |Version|Filter|n-shot|Metric|Value |   |Stderr|
+|--------------|------:|------|-----:|------|-----:|---|-----:|
+|truthfulqa_mc2|      2|none  |     0|acc   |0.6761|±  |0.0148|
+|  Tasks   |Version|Filter|n-shot|Metric|Value |   |Stderr|
+|----------|------:|------|-----:|------|-----:|---|-----:|
+|winogrande|      1|none  |     5|acc   |0.8248|±  |0.0107|
+|    Tasks    |Version|Filter|n-shot| Metric |Value |   |Stderr|
+|-------------|------:|------|-----:|--------|-----:|---|-----:|
+|arc_challenge|      1|none  |    25|acc     |0.6852|±  |0.0136|
+|             |       |none  |    25|acc_norm|0.7184|±  |0.0131|
+|Tasks|Version|     Filter     |n-shot|  Metric   |Value |   |Stderr|
+|-----|------:|----------------|-----:|-----------|-----:|---|-----:|
+|gsm8k|      3|strict-match    |     5|exact_match|0.8582|±  |0.0096|
+|     |       |flexible-extract|     5|exact_match|0.8893|±  |0.0086|
+```
+# Prompt Template
+This model uses `ChatML` prompt template:
+```
+<|im_start|>system
+{System}
+<|im_end|>
+<|im_start|>user
+{User}
+<|im_end|>
+<|im_start|>assistant
+{Assistant}
+````
+# How to use