DUAL-GPO
/

zephyr-7b-ipo-qlora-v0-merged

Model card Files Files and versions Community

lole25 commited on Sep 22

Commit

007761d

•

1 Parent(s): d9690fd

Create README.md

Files changed (1) hide show

README.md +5 -0

README.md ADDED Viewed

	@@ -0,0 +1,5 @@

+---
+license: apache-2.0
+---
+This model is a fine-tuned version of Zephyr-7B using DPO on the HuggingFaceH4/ultrafeedback_binarized dataset.