Uploaded model
- Developed by: ludekcizinsky
- License: apache-2.0
- Finetuned from model : unsloth/Phi-3-mini-4k-instruct-bnb-4bit
This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 0
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
馃檵
Ask for provider support
Model tree for ludekcizinsky/phi3-dpo-align
Base model
unsloth/Phi-3-mini-4k-instruct-bnb-4bit