merge_model_test2 / README.md
kimdeokgi's picture
merge model upload2
763418c
metadata
license: apache-2.0
language:
  - en

kimdeokgi/merge_model_test2

Introduction

This model is test version, alignment-tuned model.

We utilize state-of-the-art instruction fine-tuning methods including direct preference optimization (DPO). After DPO training, we linearly merged models to boost performance.