leaderboard-pr-bot's picture
Adding Evaluation Results
d4cfcaf verified
---
license: apache-2.0
datasets:
- Open-Orca/SlimOrca
base_model: mistralai/Mixtral-8x7B-v0.1
pipeline_tag: text-generation
model-index:
- name: Mixtral-SlimOrca-8x7B
results:
- task:
type: text-generation
name: Text Generation
dataset:
name: AI2 Reasoning Challenge (25-Shot)
type: ai2_arc
config: ARC-Challenge
split: test
args:
num_few_shot: 25
metrics:
- type: acc_norm
value: 67.66
name: normalized accuracy
source:
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Open-Orca/Mixtral-SlimOrca-8x7B
name: Open LLM Leaderboard
- task:
type: text-generation
name: Text Generation
dataset:
name: HellaSwag (10-Shot)
type: hellaswag
split: validation
args:
num_few_shot: 10
metrics:
- type: acc_norm
value: 85.11
name: normalized accuracy
source:
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Open-Orca/Mixtral-SlimOrca-8x7B
name: Open LLM Leaderboard
- task:
type: text-generation
name: Text Generation
dataset:
name: MMLU (5-Shot)
type: cais/mmlu
config: all
split: test
args:
num_few_shot: 5
metrics:
- type: acc
value: 67.98
name: accuracy
source:
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Open-Orca/Mixtral-SlimOrca-8x7B
name: Open LLM Leaderboard
- task:
type: text-generation
name: Text Generation
dataset:
name: TruthfulQA (0-shot)
type: truthful_qa
config: multiple_choice
split: validation
args:
num_few_shot: 0
metrics:
- type: mc2
value: 54.98
source:
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Open-Orca/Mixtral-SlimOrca-8x7B
name: Open LLM Leaderboard
- task:
type: text-generation
name: Text Generation
dataset:
name: Winogrande (5-shot)
type: winogrande
config: winogrande_xl
split: validation
args:
num_few_shot: 5
metrics:
- type: acc
value: 80.51
name: accuracy
source:
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Open-Orca/Mixtral-SlimOrca-8x7B
name: Open LLM Leaderboard
- task:
type: text-generation
name: Text Generation
dataset:
name: GSM8k (5-shot)
type: gsm8k
config: main
split: test
args:
num_few_shot: 5
metrics:
- type: acc
value: 45.56
name: accuracy
source:
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Open-Orca/Mixtral-SlimOrca-8x7B
name: Open LLM Leaderboard
---
# SlimOrca Mixtral 8x7B
[<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
![OpenOrca Logo](https://huggingface.co/openaccess-ai-collective/slimorca-mixstral-8x7b/resolve/main/assets/mixtral-slimorca.png "MixtralSlimOrca Logo")
Official release of the SlimOrca Mixtral finetune. More details to come.
## Model Details
### Model Description
- **Developed by:** OpenAccess AI Collective and OpenOrca
- **Finetuned from model [optional]:** mistralai/Mixtral-8x7B-v0.1
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Open-Orca__Mixtral-SlimOrca-8x7B)
| Metric |Value|
|---------------------------------|----:|
|Avg. |66.97|
|AI2 Reasoning Challenge (25-Shot)|67.66|
|HellaSwag (10-Shot) |85.11|
|MMLU (5-Shot) |67.98|
|TruthfulQA (0-shot) |54.98|
|Winogrande (5-shot) |80.51|
|GSM8k (5-shot) |45.56|