Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
wirthdrew1
/
zephyr-7b-dpo-qlora
like
1
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
9134be7
zephyr-7b-dpo-qlora
/
runs
Commit History
Training in progress, step 1700
9134be7
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 1600
d5c2c50
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 1500
fa80a46
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 1400
6306c25
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 1300
36d8a63
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 1200
aabd097
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 1100
9dd7008
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 1000
398a30b
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 900
6002094
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 800
ce15d6d
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 700
ce70ae8
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 600
a95efea
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 500
d26d79c
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 400
3a14b8d
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 300
089c1a6
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 200
7d59d81
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 100
fd30470
verified
wirthdrew1
commited on
Jan 19