Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
EllieS
/
zephyr-7b-dpo-lora-pubmedqa-selfgen-ultrafeedback-old
like
0
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
8c89d20
zephyr-7b-dpo-lora-pubmedqa-selfgen-ultrafeedback-old
Commit History
Model save
8c89d20
verified
EllieS
commited on
Feb 23
Training in progress, step 7000
3d5b529
verified
EllieS
commited on
Feb 23
Training in progress, step 6000
c197fc9
verified
EllieS
commited on
Feb 23
Training in progress, step 5000
7495c59
verified
EllieS
commited on
Feb 23
Training in progress, step 4000
47bd01f
verified
EllieS
commited on
Feb 23
Training in progress, step 3000
ec920f6
verified
EllieS
commited on
Feb 23
Training in progress, step 2000
08be08f
verified
EllieS
commited on
Feb 23
Training in progress, step 1000
2b84994
verified
EllieS
commited on
Feb 23
initial commit
aa04d83
verified
EllieS
commited on
Feb 23