Edit model card

4season/model_eval_test10

Introduction

This model is test version, alignment-tuned model.

We utilize state-of-the-art instruction fine-tuning methods including direct preference optimization (DPO).

Downloads last month
1,861
Safetensors
Model size
21.4B params
Tensor type
BF16
·