Prompt Example:

### System:
You are an AI assistant. User will give you a task. Your goal is to complete the task as faithfully as you can. While performing the task think step-by-step and justify your steps.
### User: 
How do you fine tune a large language model? 
### Assistant:

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	56.43
AI2 Reasoning Challenge (25-Shot)	46.08
HellaSwag (10-Shot)	71.81
MMLU (5-Shot)	55.46
TruthfulQA (0-shot)	50.23
Winogrande (5-shot)	66.14
GSM8k (5-shot)	48.90

Downloads last month: 24

Safetensors

Model size

3.95B params

Tensor type

FP16

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for KnutJaegersberg/Deita-4b

Quantizations

2 models

Dataset used to train KnutJaegersberg/Deita-4b

Evaluation results

normalized accuracy on AI2 Reasoning Challenge (25-Shot)
test set Open LLM Leaderboard

46.080
normalized accuracy on HellaSwag (10-Shot)
validation set Open LLM Leaderboard

71.810
accuracy on MMLU (5-Shot)
test set Open LLM Leaderboard

55.460
mc2 on TruthfulQA (0-shot)
validation set Open LLM Leaderboard

50.230
accuracy on Winogrande (5-shot)
validation set Open LLM Leaderboard

66.140
accuracy on GSM8k (5-shot)
test set Open LLM Leaderboard

48.900

View on Papers With Code