Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
sanchit-gandhi
/
distil-zephyr-1.5b-dpo-ultrafeedback-200k
like
0
Text Generation
Transformers
TensorBoard
mistral
conversational
Inference Endpoints
text-generation-inference
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
distil-zephyr-1.5b-dpo-ultrafeedback-200k
/
runs
1 contributor
History:
2 commits
sanchit-gandhi
HF staff
Training in progress, step 200
b555414
verified
about 2 months ago
Apr26_16-38-17_ip-26-0-161-178
Training in progress, step 200
about 2 months ago