Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
sanchit-gandhi
/
distil-zephyr-1.5b-dpo-ultrafeedback-200k
like
0
Text Generation
Transformers
TensorBoard
mistral
conversational
Inference Endpoints
text-generation-inference
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
distil-zephyr-1.5b-dpo-ultrafeedback-200k
/
wandb
1 contributor
History:
2 commits
sanchit-gandhi
HF staff
Training in progress, step 200
b555414
verified
about 2 months ago
run-20240426_164617-71zld9et
Training in progress, step 200
about 2 months ago
debug-internal.log
67.7 kB
Training in progress, step 200
about 2 months ago
debug.log
8.94 kB
Training in progress, step 100
about 2 months ago