Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
zephyr-7b-gpo-v11-i1
like
0
Follow
DUAL Group
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
mit
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
zephyr-7b-gpo-v11-i1
/
runs
Commit History
Model save
6b41881
verified
lole25
commited on
May 9
Training in progress, step 3700
d8ed3df
verified
lole25
commited on
May 9
Training in progress, step 3600
c58e0e2
verified
lole25
commited on
May 9
Training in progress, step 3500
9ff23ec
verified
lole25
commited on
May 9
Training in progress, step 3300
f3cc0d9
verified
lole25
commited on
May 9
Training in progress, step 3200
9604e12
verified
lole25
commited on
May 9
Training in progress, step 3100
4c5ab49
verified
lole25
commited on
May 9
Training in progress, step 2900
20f0c39
verified
lole25
commited on
May 9
Training in progress, step 2800
bbddcd0
verified
lole25
commited on
May 9
Training in progress, step 2700
6c1e85a
verified
lole25
commited on
May 9
Training in progress, step 2600
20c7641
verified
lole25
commited on
May 9
Training in progress, step 2500
e573417
verified
lole25
commited on
May 9
Training in progress, step 2300
aec2a41
verified
lole25
commited on
May 9
Training in progress, step 2100
5340c72
verified
lole25
commited on
May 9
Training in progress, step 1900
20067da
verified
lole25
commited on
May 9
Training in progress, step 1800
de01f51
verified
lole25
commited on
May 9
Training in progress, step 1300
7c08f3e
verified
lole25
commited on
May 9
Training in progress, step 1100
45019a5
verified
lole25
commited on
May 9
Training in progress, step 1000
5bef176
verified
lole25
commited on
May 9
Training in progress, step 900
b0a93e7
verified
lole25
commited on
May 9
Training in progress, step 800
9dc6789
verified
lole25
commited on
May 9
Training in progress, step 700
d1aa430
verified
lole25
commited on
May 9
Training in progress, step 500
f3011c4
verified
lole25
commited on
May 9
Training in progress, step 400
dcf1672
verified
lole25
commited on
May 9
Training in progress, step 200
594a867
verified
lole25
commited on
May 9
Training in progress, step 100
7fe3b41
verified
lole25
commited on
May 9