Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
RikkiXu
/
zephyr-7b-dpo-full
like
0
Text Generation
Transformers
TensorBoard
Safetensors
mistral
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
0219782
zephyr-7b-dpo-full
/
runs
/
May09_23-57-28_n136-098-158
/
events.out.tfevents.1715270961.n136-098-158.1658072.0
Commit History
Training in progress, step 600
984c010
verified
RikkiXu
commited on
May 10
Training in progress, step 500
3651bd2
verified
RikkiXu
commited on
May 9
Training in progress, step 400
a85b47d
verified
RikkiXu
commited on
May 9
Training in progress, step 300
3f1a7e2
verified
RikkiXu
commited on
May 9
Training in progress, step 200
f09c771
verified
RikkiXu
commited on
May 9
Training in progress, step 100
a054fd9
verified
RikkiXu
commited on
May 9