Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
zephyr-7b-gpo-update4-i0
like
0
Follow
DUAL Group
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
9153d7d
zephyr-7b-gpo-update4-i0
/
adapter_model.safetensors
Commit History
Model save
c899517
verified
lole25
commited on
Apr 6
Training in progress, step 15200
c2127a1
verified
lole25
commited on
Apr 6
Training in progress, step 15100
e13fecf
verified
lole25
commited on
Apr 6
Training in progress, step 15000
052c869
verified
lole25
commited on
Apr 6
Training in progress, step 14900
ca42763
verified
lole25
commited on
Apr 6
Training in progress, step 14800
1af075f
verified
lole25
commited on
Apr 6
Training in progress, step 14700
15f0b14
verified
lole25
commited on
Apr 6
Training in progress, step 14600
28ced70
verified
lole25
commited on
Apr 6
Training in progress, step 14500
3c16455
verified
lole25
commited on
Apr 6
Training in progress, step 14300
be44558
verified
lole25
commited on
Apr 6
Training in progress, step 14200
1f25ecf
verified
lole25
commited on
Apr 6
Training in progress, step 14100
63435b1
verified
lole25
commited on
Apr 6
Training in progress, step 14000
221bc51
verified
lole25
commited on
Apr 6
Training in progress, step 13900
edd298a
verified
lole25
commited on
Apr 6
Training in progress, step 13800
319cde0
verified
lole25
commited on
Apr 6
Training in progress, step 13700
7b3b5ee
verified
lole25
commited on
Apr 6
Training in progress, step 13600
85c3807
verified
lole25
commited on
Apr 6
Training in progress, step 13500
65e5f9b
verified
lole25
commited on
Apr 6
Training in progress, step 13300
fc9a6dc
verified
lole25
commited on
Apr 6
Training in progress, step 13200
56f6797
verified
lole25
commited on
Apr 6
Training in progress, step 13100
2993ad5
verified
lole25
commited on
Apr 6
Training in progress, step 13000
5a5526f
verified
lole25
commited on
Apr 6
Training in progress, step 12900
bb7bcde
verified
lole25
commited on
Apr 6
Training in progress, step 12800
36330b3
verified
lole25
commited on
Apr 6
Training in progress, step 12700
7e1873d
verified
lole25
commited on
Apr 6
Training in progress, step 12600
d46a956
verified
lole25
commited on
Apr 6
Training in progress, step 12400
0ee537f
verified
lole25
commited on
Apr 6
Training in progress, step 12300
3c93069
verified
lole25
commited on
Apr 6
Training in progress, step 12200
3dc87d1
verified
lole25
commited on
Apr 6
Training in progress, step 12100
8e4aafb
verified
lole25
commited on
Apr 6
Training in progress, step 12000
f4b5579
verified
lole25
commited on
Apr 6
Training in progress, step 11900
e3071f5
verified
lole25
commited on
Apr 6
Training in progress, step 11800
2257d32
verified
lole25
commited on
Apr 6
Training in progress, step 11700
0e46000
verified
lole25
commited on
Apr 6
Training in progress, step 11600
f063598
verified
lole25
commited on
Apr 6
Training in progress, step 11500
846d24a
verified
lole25
commited on
Apr 6
Training in progress, step 11400
5b788da
verified
lole25
commited on
Apr 6
Training in progress, step 11300
f9af038
verified
lole25
commited on
Apr 6
Training in progress, step 11200
7b5df10
verified
lole25
commited on
Apr 6
Training in progress, step 11100
8fbee00
verified
lole25
commited on
Apr 6
Training in progress, step 11000
4e102ec
verified
lole25
commited on
Apr 6
Training in progress, step 10900
eb3e4da
verified
lole25
commited on
Apr 6
Training in progress, step 10800
2c64728
verified
lole25
commited on
Apr 6
Training in progress, step 10700
7fb73af
verified
lole25
commited on
Apr 6
Training in progress, step 10600
f597456
verified
lole25
commited on
Apr 6
Training in progress, step 10500
57e0e86
verified
lole25
commited on
Apr 6
Training in progress, step 10400
1aefaaf
verified
lole25
commited on
Apr 6
Training in progress, step 10300
23b1736
verified
lole25
commited on
Apr 5
Training in progress, step 10200
f6a8518
verified
lole25
commited on
Apr 5
Training in progress, step 10100
f1b0914
verified
lole25
commited on
Apr 5
Previous
1
2
3
Next