Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
phi-2-gpo-renew2-b0.001-0.5ultrafeedback-rank256-i1
like
0
Follow
DUAL Group
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
phi
alignment-handbook
Generated from Trainer
trl
dpo
License:
mit
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
main
phi-2-gpo-renew2-b0.001-0.5ultrafeedback-rank256-i1
/
adapter_model.safetensors
Commit History
Model save
e460f00
verified
BraylonDash
commited on
May 6
Training in progress, step 1900
418c086
verified
BraylonDash
commited on
May 6
Training in progress, step 1800
e325b3e
verified
BraylonDash
commited on
May 6
Training in progress, step 1700
7ac2ce3
verified
BraylonDash
commited on
May 6
Training in progress, step 1600
12fc425
verified
BraylonDash
commited on
May 6
Training in progress, step 1500
9c2b136
verified
BraylonDash
commited on
May 6
Training in progress, step 1400
9b56225
verified
BraylonDash
commited on
May 6
Training in progress, step 1300
63d2f82
verified
BraylonDash
commited on
May 6
Training in progress, step 1100
4e6d752
verified
BraylonDash
commited on
May 6
Training in progress, step 1000
38f5b8a
verified
BraylonDash
commited on
May 6
Training in progress, step 900
963d6c1
verified
BraylonDash
commited on
May 6
Training in progress, step 800
7e39a60
verified
BraylonDash
commited on
May 6
Training in progress, step 700
fc695d0
verified
BraylonDash
commited on
May 6
Training in progress, step 600
4976573
verified
BraylonDash
commited on
May 6
Training in progress, step 500
9c71e1e
verified
BraylonDash
commited on
May 6
Training in progress, step 400
2979944
verified
BraylonDash
commited on
May 6
Training in progress, step 300
405c6a0
verified
BraylonDash
commited on
May 6
Training in progress, step 200
494853c
verified
BraylonDash
commited on
May 6
Training in progress, step 100
b5fc6f0
verified
BraylonDash
commited on
May 6