DamiFass's picture
Update README.md
a7ce2f3 verified
metadata
base_model:
  - meta-llama/Llama-3.2-1B

Model Description

This is the meta-llama/Llama-3.2-1B base model fine tuned on the mlabonne/orpo-dpo-mix-40k dataset.

Evaluation Results

We used lm-evalutation-harness from EleutherAI to evaluate this fine-tuned version of meta-llama/Llama-3.2-1B on the 'Hellaswag' benchmark.

Hellaswag

Tasks Version Filter n-shot Metric Value Stderr
hellaswag 1 none 0 acc 0.4773 ± 0.0050
none 0 acc_norm 0.6358 ± 0.0048