Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
zephyr-7b-gpo-new-v1-i0
like
0
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
449b1a8
zephyr-7b-gpo-new-v1-i0
/
runs
/
May11_08-07-40_gpu4-119-5
Commit History
Training in progress, step 6300
55b3a63
verified
lole25
commited on
May 11
Training in progress, step 6000
1004318
verified
lole25
commited on
May 11
Training in progress, step 5700
4b182a4
verified
lole25
commited on
May 11
Training in progress, step 5400
798a1bf
verified
lole25
commited on
May 11
Training in progress, step 5200
07e4000
verified
lole25
commited on
May 11
Training in progress, step 4900
d9c3793
verified
lole25
commited on
May 11
Training in progress, step 4800
254da6c
verified
lole25
commited on
May 11
Training in progress, step 4700
40de169
verified
lole25
commited on
May 11
Training in progress, step 4500
13e4314
verified
lole25
commited on
May 11
Training in progress, step 4200
6995921
verified
lole25
commited on
May 11
Training in progress, step 3800
e371913
verified
lole25
commited on
May 11
Training in progress, step 3600
634b11b
verified
lole25
commited on
May 11
Training in progress, step 3500
d867ab9
verified
lole25
commited on
May 11
Training in progress, step 3200
6209518
verified
lole25
commited on
May 11
Training in progress, step 2800
8622f1a
verified
lole25
commited on
May 11
Training in progress, step 2700
7538c96
verified
lole25
commited on
May 11
Training in progress, step 2400
eab2a27
verified
lole25
commited on
May 11
Training in progress, step 2300
e1eaf53
verified
lole25
commited on
May 11
Training in progress, step 2200
22aa3e1
verified
lole25
commited on
May 11
Training in progress, step 2100
09c3228
verified
lole25
commited on
May 11
Training in progress, step 2000
3a36fd8
verified
lole25
commited on
May 11
Training in progress, step 1600
9931048
verified
lole25
commited on
May 11
Training in progress, step 1400
048fc7a
verified
lole25
commited on
May 10
Training in progress, step 1300
bd19633
verified
lole25
commited on
May 10
Training in progress, step 1100
b757876
verified
lole25
commited on
May 10