Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
jikaixuan
/
zephyr-7b
like
0
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
b910b39
zephyr-7b
/
runs
/
Mar20_15-13-30_uclaml04.cs.ucla.edu
Commit History
End of training
cd97752
verified
jikaixuan
commited on
Mar 21
Model save
6b1b603
verified
jikaixuan
commited on
Mar 21
Training in progress, step 450
3c9b215
verified
jikaixuan
commited on
Mar 21
Training in progress, step 150
b79da88
verified
jikaixuan
commited on
Mar 20
Training in progress, step 50
7c78d17
verified
jikaixuan
commited on
Mar 20