MaziyarPanahi
/

calme-3.1-instruct-78b

Text Generation

text-generation-inference

Model card Files Files and versions Community

MaziyarPanahi commited on Nov 28, 2024

Commit

6d91f5f

·

verified ·

1 Parent(s): 95b6eed

Update README.md

Files changed (1) hide show

README.md +12 -15

README.md CHANGED Viewed

@@ -127,9 +127,19 @@ This model is an advanced iteration of the powerful `Qwen/Qwen2.5-72B`, specific
 Thanks to `mradermacher`: [calme-3.1-instruct-78b-GGUF](https://huggingface.co/mradermacher/calme-3.1-instruct-78b-GGUF)
-# 🏆 [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
-Leaderboard 2 coming soon!
 # Prompt Template
@@ -173,16 +183,3 @@ model = AutoModelForCausalLM.from_pretrained("MaziyarPanahi/calme-3.1-instruct-7
 As with any large language model, users should be aware of potential biases and limitations. We recommend implementing appropriate safeguards and human oversight when deploying this model in production environments.
-# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
-Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__calme-3.1-instruct-78b)
-|      Metric       |Value|
-|-------------------|----:|
-|Avg.               |51.20|
-|IFEval (0-Shot)    |81.36|
-|BBH (3-Shot)       |62.41|
-|MATH Lvl 5 (4-Shot)|38.75|
-|GPQA (0-shot)      |19.46|
-|MuSR (0-shot)      |36.50|
-|MMLU-PRO (5-shot)  |68.72|

 Thanks to `mradermacher`: [calme-3.1-instruct-78b-GGUF](https://huggingface.co/mradermacher/calme-3.1-instruct-78b-GGUF)
+# 🏆 [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__calme-3.1-instruct-78b)
+|      Metric       |Value|
+|-------------------|----:|
+|Avg.               |51.20|
+|IFEval (0-Shot)    |81.36|
+|BBH (3-Shot)       |62.41|
+|MATH Lvl 5 (4-Shot)|38.75|
+|GPQA (0-shot)      |19.46|
+|MuSR (0-shot)      |36.50|
+|MMLU-PRO (5-shot)  |68.72|
 # Prompt Template
 As with any large language model, users should be aware of potential biases and limitations. We recommend implementing appropriate safeguards and human oversight when deploying this model in production environments.