Edit model card

Update @ 2024.03.13

T3Q-Mistral-Orca-Math-DPO

This model is a DPO fine-tuned version of liminerity/M7-7b

Model Developers Chihoon Lee(chlee10), T3Q

T3Q-Mistral-Orca-Math-DPO

This model is a DPO fine-tuned version of liminerity/M7-7b

Model Developers Chihoon Lee(chlee10), T3Q

T3Q-Mistral-Orca-Math-DPO

This model is a DPO fine-tuned version of liminerity/M7-7b

Model Developers Chihoon Lee(chlee10), T3Q

Downloads last month
2,452
Safetensors
Model size
7.24B params
Tensor type
FP16
·

Finetuned from

Dataset used to train chihoonlee10/T3Q-Mistral-Orca-Math-DPO