merge_model_test1 / README.md
kimdeokgi's picture
Update README.md
0cd5953 verified
|
raw
history blame
324 Bytes
---
license: apache-2.0
language:
- en
---
# kimdeokgi/merge_model_test1
# **Introduction**
This model is test version, alignment-tuned model.
We utilize state-of-the-art instruction fine-tuning methods including direct preference optimization (DPO).
After DPO training, we linearly merged models to boost performance.