Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
zephyr-7b-gpo-v4-i2
like
0
Follow
DUAL Group
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
1bac891
zephyr-7b-gpo-v4-i2
Commit History
End of training
1bac891
verified
lole25
commited on
May 12
Model save
bc4c4d5
verified
lole25
commited on
May 12
Training in progress, step 5100
332bd6c
verified
lole25
commited on
May 12
Training in progress, step 5000
d2eed83
verified
lole25
commited on
May 12
Training in progress, step 4900
a0d02d1
verified
lole25
commited on
May 12
Training in progress, step 4800
2f70445
verified
lole25
commited on
May 12
Training in progress, step 4700
440cfd6
verified
lole25
commited on
May 12
Training in progress, step 4600
4cc7d3b
verified
lole25
commited on
May 12
Training in progress, step 4500
b0d58af
verified
lole25
commited on
May 12
Training in progress, step 4400
ea1239f
verified
lole25
commited on
May 12
Training in progress, step 4300
36f3e44
verified
lole25
commited on
May 12
Training in progress, step 4100
2a23286
verified
lole25
commited on
May 12
Training in progress, step 4000
073d4e1
verified
lole25
commited on
May 12
Training in progress, step 3900
0ba11b0
verified
lole25
commited on
May 12
Training in progress, step 3700
2521b1f
verified
lole25
commited on
May 12
Training in progress, step 3600
4d752a7
verified
lole25
commited on
May 12
Training in progress, step 3500
2bacbad
verified
lole25
commited on
May 12
Training in progress, step 3300
30bf1cd
verified
lole25
commited on
May 12
Training in progress, step 3100
d8df357
verified
lole25
commited on
May 12
Training in progress, step 3000
c36d591
verified
lole25
commited on
May 12
Training in progress, step 2900
c8ff9ca
verified
lole25
commited on
May 12
Training in progress, step 2800
b2928b5
verified
lole25
commited on
May 12
Training in progress, step 2500
f9ee80a
verified
lole25
commited on
May 12
Training in progress, step 2400
4aedc37
verified
lole25
commited on
May 12
Training in progress, step 2200
2c78bd3
verified
lole25
commited on
May 12
Training in progress, step 2000
df1fb58
verified
lole25
commited on
May 12
Training in progress, step 1900
75c82fe
verified
lole25
commited on
May 12
Training in progress, step 1800
e5757fc
verified
lole25
commited on
May 12
Training in progress, step 1700
2990593
verified
lole25
commited on
May 12
Training in progress, step 1600
0197f28
verified
lole25
commited on
May 12
Training in progress, step 1500
d206cb0
verified
lole25
commited on
May 12
Training in progress, step 1300
a628b4e
verified
lole25
commited on
May 12
Training in progress, step 1200
384de42
verified
lole25
commited on
May 12
Training in progress, step 1100
6a2e8ea
verified
lole25
commited on
May 12
Training in progress, step 1000
d9a7ad7
verified
lole25
commited on
May 12
Training in progress, step 900
c2cc821
verified
lole25
commited on
May 12
Training in progress, step 800
b73bbcd
verified
lole25
commited on
May 12
Training in progress, step 700
cd94a81
verified
lole25
commited on
May 12
Training in progress, step 600
6232edb
verified
lole25
commited on
May 12
Training in progress, step 500
c46572f
verified
lole25
commited on
May 12
Training in progress, step 400
ff49111
verified
lole25
commited on
May 12
Training in progress, step 300
0b30fc8
verified
lole25
commited on
May 12
Training in progress, step 100
30da921
verified
lole25
commited on
May 12
initial commit
170758d
verified
lole25
commited on
May 12