Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
statking
/
Meta-Llama-3-8B-Instruct-DPO-QLoRA
like
0
PEFT
Safetensors
HuggingFaceH4/ultrafeedback_binarized
llama
alignment-handbook
trl
dpo
Generated from Trainer
Model card
Files
Files and versions
Community
Train
Use this model
0083b65
Meta-Llama-3-8B-Instruct-DPO-QLoRA
Commit History
Model save
0083b65
verified
statking
commited on
May 21
Training in progress, step 1900
d6b856c
verified
statking
commited on
May 21
Training in progress, step 1800
2b29856
verified
statking
commited on
May 21
Training in progress, step 1700
f6dc61e
verified
statking
commited on
May 21
Training in progress, step 1600
6f02370
verified
statking
commited on
May 21
Training in progress, step 1500
59d6762
verified
statking
commited on
May 21
Training in progress, step 1400
6013750
verified
statking
commited on
May 21
Training in progress, step 1300
60615f7
verified
statking
commited on
May 21
Training in progress, step 1200
b275e27
verified
statking
commited on
May 21
Training in progress, step 1100
2662f21
verified
statking
commited on
May 21
Training in progress, step 1000
4c44f70
verified
statking
commited on
May 21
Training in progress, step 900
6a20e28
verified
statking
commited on
May 21
Training in progress, step 800
e526da2
verified
statking
commited on
May 21
Training in progress, step 700
5e99b1d
verified
statking
commited on
May 21
Training in progress, step 600
06fcb6c
verified
statking
commited on
May 21
Training in progress, step 500
a29d98f
verified
statking
commited on
May 21
Training in progress, step 400
ebd523e
verified
statking
commited on
May 21
Training in progress, step 300
932179f
verified
statking
commited on
May 21
Training in progress, step 200
4cb86d8
verified
statking
commited on
May 21
Training in progress, step 100
fdbeed8
verified
statking
commited on
May 21
initial commit
67a67d6
verified
statking
commited on
May 21