Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Pierre-obi
/
llama-8B-Instruct-DPO
like
0
Text Generation
Transformers
Safetensors
argilla/distilabel-intel-orca-dpo-pairs
llama
conversational
Inference Endpoints
text-generation-inference
Model card
Files
Files and versions
Community
Train
Deploy
Use in Transformers
Edit model card
Model Description:
Llama pro 8 instruct finetuned on argilla/distilabel-intel-orca-dpo-pairs
Downloads last month
3
Safetensors
Model size
8.36B params
Tensor type
FP16
·
Dataset used to train
Pierre-obi/llama-8B-Instruct-DPO
argilla/distilabel-intel-orca-dpo-pairs
Viewer
•
Updated
Feb 5
•
1.79k
•
134