Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
eduagarcia
/
mistral-orpo-mix-7k
like
0
Text Generation
Transformers
TensorBoard
Safetensors
argilla/dpo-mix-7k
English
mistral
alignment-handbook
trl
orpo
Generated from Trainer
conversational
Inference Endpoints
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
mistral-orpo-mix-7k
Commit History
Update README.md
cb74054
verified
eduagarcia
commited on
Apr 28
Update README.md
2390265
verified
eduagarcia
commited on
Apr 28
End of training
5f55c3c
verified
eduagarcia
commited on
Apr 28
Model save
0f85d72
verified
eduagarcia
commited on
Apr 28
initial commit
3bb27e9
verified
eduagarcia
commited on
Apr 28