Edit model card

Yi-6b-dpo

Model Details

Datasets

Benchmark

Model Average Ko-ARC Ko-HellaSwag Ko-MMLU Ko-TruthfulQA Ko-CommonGen V2
hyeogi/Yi-6b-dpo-v0.2 (Ours) 52.63 41.72 52.96 46.69 52.38 69.42
hyeogi/Yi-6b-dpo-v0.1(Ours) 51.38 41.3 52.23 45.34 54.03 63.99
Minirecord/Mini_DPO_7b_01 50.47 48.29 54.68 46.7 47.78 54.9

image/png

Downloads last month
1,165
Safetensors
Model size
6.18B params
Tensor type
FP16
·