README.md · DamiFass/llama3.2-1B-finetuned-on-mlabonne at main

metadata

base_model:
  - meta-llama/Llama-3.2-1B

Model Description

This is the meta-llama/Llama-3.2-1B base model fine tuned on the mlabonne/orpo-dpo-mix-40k dataset.

We used lm-evalutation-harness from EleutherAI to evaluate this fine-tuned version of meta-llama/Llama-3.2-1B on the 'Hellaswag' benchmark.

Tasks	Version	Filter	n-shot	Metric		Value		Stderr
hellaswag	1	none	0	acc	↑	0.4773	±	0.0050
		none	0	acc_norm	↑	0.6358	±	0.0048