Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
zephyr-7b-gpo-v2-i3
like
0
Follow
DUAL Group
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
4f3d3bd
zephyr-7b-gpo-v2-i3
Commit History
Model save
4f3d3bd
verified
lole25
commited on
May 13
Training in progress, step 3600
36aec53
verified
lole25
commited on
May 13
Training in progress, step 3500
0d4d272
verified
lole25
commited on
May 13
Training in progress, step 3300
4edbb66
verified
lole25
commited on
May 13
Training in progress, step 3100
9e5c3df
verified
lole25
commited on
May 13
Training in progress, step 3000
72f7c32
verified
lole25
commited on
May 13
Training in progress, step 2900
7f8c970
verified
lole25
commited on
May 13
Training in progress, step 2800
8e05290
verified
lole25
commited on
May 13
Training in progress, step 2700
07c1a49
verified
lole25
commited on
May 13
Training in progress, step 2500
f7c3450
verified
lole25
commited on
May 13
Training in progress, step 2200
ed03006
verified
lole25
commited on
May 13
Training in progress, step 1600
61cc7ca
verified
lole25
commited on
May 13
Training in progress, step 1500
200e3f0
verified
lole25
commited on
May 13
Training in progress, step 1400
d20a95a
verified
lole25
commited on
May 13
Training in progress, step 1300
7b941ab
verified
lole25
commited on
May 13
Training in progress, step 1200
b15a4f4
verified
lole25
commited on
May 13
Training in progress, step 1100
9dfd602
verified
lole25
commited on
May 13
Training in progress, step 900
95a8e14
verified
lole25
commited on
May 13
Training in progress, step 800
6140743
verified
lole25
commited on
May 13
Training in progress, step 700
b5d8d91
verified
lole25
commited on
May 13
Training in progress, step 500
9c76518
verified
lole25
commited on
May 13
Training in progress, step 400
5277c01
verified
lole25
commited on
May 13
Training in progress, step 200
00f26eb
verified
lole25
commited on
May 13
Training in progress, step 100
e9c5a4f
verified
lole25
commited on
May 13
initial commit
dd85215
verified
lole25
commited on
May 13