Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
thorirhrafn
/
llama_DPO_model_e2
like
0
PEFT
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
License:
llama2
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
10094a9
llama_DPO_model_e2
/
README.md
Commit History
End of training
10094a9
verified
thorirhrafn
commited on
May 13
End of training
5789437
verified
thorirhrafn
commited on
May 13
End of training
210bcff
verified
thorirhrafn
commited on
May 12