Apollo🎖️
Collection
Experimental models
•
3 items
•
Updated
This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.
The model uses the alpaca format:
Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.
### Instruction:
{}
### Input:
{}
### Response:
{}
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 71.81 |
AI2 Reasoning Challenge (25-Shot) | 64.85 |
HellaSwag (10-Shot) | 85.50 |
MMLU (5-Shot) | 63.93 |
TruthfulQA (0-shot) | 63.52 |
Winogrande (5-shot) | 84.06 |
GSM8k (5-shot) | 68.99 |