--- library_name: transformers tags: - DPO - reasoning - mistral license: apache-2.0 datasets: - argilla/distilabel-intel-orca-dpo-pairs pipeline_tag: text-generation --- # Model Card for felladrin-tinymistral-248m-v4-dpo SFT model trained with orca DPO ## Model Details ### Model Description Experimental. ChatML format.