Edit model card

A finetuning experiment on llama3 8b it with selected 5k examples from argilla dpo 7k

Downloads last month
1

Adapter for

Dataset used to train eren23/DPOMixLLama-3-8B-lora