Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
LBK95
/
Llama-2-7b-hf-DPO-Filtered-0.2-version-4
like
0
PEFT
Safetensors
trl
dpo
Generated from Trainer
License:
llama2
Model card
Files
Files and versions
Community
Use this model
e9307df
Llama-2-7b-hf-DPO-Filtered-0.2-version-4
Commit History
End of training
e9307df
verified
LBK95
commited on
May 12
Training in progress, step 2163
3c6a40e
verified
LBK95
commited on
May 12
Training in progress, step 1442
c428e49
verified
LBK95
commited on
May 12
Training in progress, step 721
e1a4236
verified
LBK95
commited on
May 12
initial commit
16bd7c8
verified
LBK95
commited on
May 12