Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
wxzhang
/
selective-pairrm-33045197-mt0
like
0
Text Generation
Transformers
Safetensors
snorkelai/Snorkel-Mistral-PairRM-DPO-Dataset
mistral
alignment-handbook
Generated from Trainer
trl
dpo
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
selective-pairrm-33045197-mt0
Commit History
End of training
a5665b2
verified
wxzhang
commited on
Mar 21
Model save
1ec4438
verified
wxzhang
commited on
Mar 21
Training in progress, step 300
3e4c977
verified
wxzhang
commited on
Mar 21
Training in progress, step 200
4479968
verified
wxzhang
commited on
Mar 21
Training in progress, step 100
5ca595f
verified
wxzhang
commited on
Mar 21
initial commit
e6a88ee
verified
wxzhang
commited on
Mar 21