Marsouuu
/

MistralBase-4x7B-MoE-ECE-PRYMMAL-Martial

@@ -1,15 +1,110 @@
 ---
-license: apache-2.0
 language:
 - en
-base_model:
-- mistralai/Mistral-7B-v0.3
-pipeline_tag: text-classification
 library_name: transformers
 tags:
 - moe
 - mergekit
 - MoErges
 ---
 Model Name: Marsouuu/MistralBase-4x7B-MoE-ECE-PRYMMAL-Martial - Mixture of Experts (MoE)
@@ -47,4 +142,17 @@ This model can be used for a wide range of applications:
 Limitations
 	•	The model may occasionally generate responses that are not entirely contextually appropriate, especially in cases requiring highly specialized domain knowledge.
-	•	Despite its 24-bit precision, it may not perform well with extremely large datasets or tasks that require higher precision levels.

 ---
 language:
 - en
+license: apache-2.0
 library_name: transformers
 tags:
 - moe
 - mergekit
 - MoErges
+base_model:
+- mistralai/Mistral-7B-v0.3
+pipeline_tag: text-classification
+model-index:
+- name: MistralBase-4x7B-MoE-ECE-PRYMMAL-Martial
+  results:
+  - task:
+      type: text-generation
+      name: Text Generation
+    dataset:
+      name: IFEval (0-Shot)
+      type: HuggingFaceH4/ifeval
+      args:
+        num_few_shot: 0
+    metrics:
+    - type: inst_level_strict_acc and prompt_level_strict_acc
+      value: 16.97
+      name: strict accuracy
+    source:
+      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Marsouuu/MistralBase-4x7B-MoE-ECE-PRYMMAL-Martial
+      name: Open LLM Leaderboard
+  - task:
+      type: text-generation
+      name: Text Generation
+    dataset:
+      name: BBH (3-Shot)
+      type: BBH
+      args:
+        num_few_shot: 3
+    metrics:
+    - type: acc_norm
+      value: 8.87
+      name: normalized accuracy
+    source:
+      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Marsouuu/MistralBase-4x7B-MoE-ECE-PRYMMAL-Martial
+      name: Open LLM Leaderboard
+  - task:
+      type: text-generation
+      name: Text Generation
+    dataset:
+      name: MATH Lvl 5 (4-Shot)
+      type: hendrycks/competition_math
+      args:
+        num_few_shot: 4
+    metrics:
+    - type: exact_match
+      value: 0.3
+      name: exact match
+    source:
+      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Marsouuu/MistralBase-4x7B-MoE-ECE-PRYMMAL-Martial
+      name: Open LLM Leaderboard
+  - task:
+      type: text-generation
+      name: Text Generation
+    dataset:
+      name: GPQA (0-shot)
+      type: Idavidrein/gpqa
+      args:
+        num_few_shot: 0
+    metrics:
+    - type: acc_norm
+      value: 1.23
+      name: acc_norm
+    source:
+      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Marsouuu/MistralBase-4x7B-MoE-ECE-PRYMMAL-Martial
+      name: Open LLM Leaderboard
+  - task:
+      type: text-generation
+      name: Text Generation
+    dataset:
+      name: MuSR (0-shot)
+      type: TAUR-Lab/MuSR
+      args:
+        num_few_shot: 0
+    metrics:
+    - type: acc_norm
+      value: 7.85
+      name: acc_norm
+    source:
+      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Marsouuu/MistralBase-4x7B-MoE-ECE-PRYMMAL-Martial
+      name: Open LLM Leaderboard
+  - task:
+      type: text-generation
+      name: Text Generation
+    dataset:
+      name: MMLU-PRO (5-shot)
+      type: TIGER-Lab/MMLU-Pro
+      config: main
+      split: test
+      args:
+        num_few_shot: 5
+    metrics:
+    - type: acc
+      value: 4.21
+      name: accuracy
+    source:
+      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Marsouuu/MistralBase-4x7B-MoE-ECE-PRYMMAL-Martial
+      name: Open LLM Leaderboard
 ---
 Model Name: Marsouuu/MistralBase-4x7B-MoE-ECE-PRYMMAL-Martial - Mixture of Experts (MoE)
 Limitations
 	•	The model may occasionally generate responses that are not entirely contextually appropriate, especially in cases requiring highly specialized domain knowledge.
+	•	Despite its 24-bit precision, it may not perform well with extremely large datasets or tasks that require higher precision levels.
+# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Marsouuu__MistralBase-4x7B-MoE-ECE-PRYMMAL-Martial)
+|      Metric       |Value|
+|-------------------|----:|
+|Avg.               | 6.57|
+|IFEval (0-Shot)    |16.97|
+|BBH (3-Shot)       | 8.87|
+|MATH Lvl 5 (4-Shot)| 0.30|
+|GPQA (0-shot)      | 1.23|
+|MuSR (0-shot)      | 7.85|
+|MMLU-PRO (5-shot)  | 4.21|