Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
dball
/
zephyr-7b-dpo-qlora
like
0
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
2
Train
Use this model
f56b060
zephyr-7b-dpo-qlora
Commit History
Training in progress, step 2400
50a57eb
verified
dball
commited on
Jan 23
Training in progress, step 2300
70201e5
verified
dball
commited on
Jan 23
Training in progress, step 2200
74debbf
verified
dball
commited on
Jan 23
Training in progress, step 2100
8f371f5
verified
dball
commited on
Jan 23
Training in progress, step 2000
1b33f90
verified
dball
commited on
Jan 23
Training in progress, step 1800
019840b
verified
dball
commited on
Jan 23
Training in progress, step 1600
4fcfab7
verified
dball
commited on
Jan 23
Training in progress, step 1400
f2966cf
verified
dball
commited on
Jan 23
Training in progress, step 1300
42251be
verified
dball
commited on
Jan 23
Training in progress, step 1200
d4a1fc9
verified
dball
commited on
Jan 23
Training in progress, step 900
8799f0a
verified
dball
commited on
Jan 23
Training in progress, step 700
5858fc3
verified
dball
commited on
Jan 23
Training in progress, step 600
e2fa348
verified
dball
commited on
Jan 23
Training in progress, step 500
9b0f584
verified
dball
commited on
Jan 23
Training in progress, step 400
0c99ab6
verified
dball
commited on
Jan 23
Training in progress, step 300
dc31bf4
verified
dball
commited on
Jan 23
Training in progress, step 200
cf81ac7
verified
dball
commited on
Jan 23
initial commit
509eca3
verified
dball
commited on
Jan 23
Previous
1
2
Next