Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
eren23
/
DPOMixLLama-3-8B-lora
like
0
Text Generation
PEFT
Safetensors
argilla/dpo-mix-7k
English
llama
orpo
llama3
text-generation-inference
conversational
License:
other
Model card
Files
Files and versions
Community
Deploy
Use this model
Edit model card
A finetuning experiment on llama3 8b it with selected 5k examples from argilla dpo 7k
Downloads last month
1
Adapter for
meta-llama/Meta-Llama-3-8B-Instruct
Dataset used to train
eren23/DPOMixLLama-3-8B-lora
argilla/dpo-mix-7k
Viewer
•
Updated
Mar 4
•
12.7k
•
120