File size: 339 Bytes
17d1c63
0f8850b
 
 
17d1c63
0f8850b
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
---

license: apache-2.0
language:
  - en
---


# kimdeokgi/all_dpo_model_test1





# **Introduction**

This model is test version, alignment-tuned model.



We utilize state-of-the-art instruction fine-tuning methods including direct preference optimization (DPO).

After DPO training, we linearly merged models to boost performance.