Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
LBK95
/
Llama-2-7b-hf-eval_threapist-DPO-filtered-version-1
like
0
PEFT
Safetensors
trl
dpo
Generated from Trainer
License:
llama2
Model card
Files
Files and versions
Community
Use this model
dd03769
Llama-2-7b-hf-eval_threapist-DPO-filtered-version-1
/
adapter_model.safetensors
Commit History
Training in progress, step 165
dd03769
verified
LBK95
commited on
May 5, 2024
Training in progress, step 110
9be2ec9
verified
LBK95
commited on
May 5, 2024
Training in progress, step 55
b6242bf
verified
LBK95
commited on
May 5, 2024