Edit model card

A finetuning experiment on llama3 8b it with selected 5k examples from argilla dpo 7k

Downloads last month
1
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Adapter for

Dataset used to train eren23/DPOMixLLama-3-8B-lora