Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
wenqiglantz
/
MistralTrinity-7B-slerp-dpo
like
0
Text Generation
Transformers
Safetensors
mlabonne/chatml_dpo_pairs
English
mistral
instruct
finetune
chatml
synthetic data
distillation
dpo
rlhf
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
abd495e
MistralTrinity-7B-slerp-dpo
Commit History
Create README.md
abd495e
verified
wenqiglantz
commited on
Jan 19, 2024
Upload tokenizer
2147e5a
verified
wenqiglantz
commited on
Jan 19, 2024
Upload MistralForCausalLM
5b30377
verified
wenqiglantz
commited on
Jan 19, 2024
initial commit
b615c15
verified
wenqiglantz
commited on
Jan 19, 2024