jtatman's picture
updated model card
25a3b2e verified
|
raw
history blame
No virus
328 Bytes
metadata
library_name: transformers
tags:
  - DPO
  - reasoning
  - mistral
license: apache-2.0
datasets:
  - argilla/distilabel-intel-orca-dpo-pairs
pipeline_tag: text-generation

Model Card for felladrin-tinymistral-248m-v4-dpo

SFT model trained with orca DPO

Model Details

Model Description

Experimental.

ChatML format.