athirdpath
/

Iambe-RP-cDPO-20b-GGUF

Not-For-All-Audiences

Model card Files Files and versions Community

athirdpath commited on Dec 7, 2023

Commit

c4df92d

•

1 Parent(s): 93f0182

Create README.md

Files changed (1) hide show

README.md +19 -0

README.md ADDED Viewed

	@@ -0,0 +1,19 @@

+---
+license: cc-by-nc-4.0
+language:
+- en
+tags:
+- not-for-all-audiences
+---
+<p align="center"><img src="https://i.ibb.co/pbpJHpk/iambe-sml.png"/><font size="6"> <b>Iambe-20b-v3_TEST-RP_cDPO</b> </font></p>
+<p align="center"><font size="4"> <b>Alpaca prompt formatting</b> </font></p>
+### Description
+Named after a charming daughter of Echo and Pan in Greek myth, Iambe-v3 is, as far as I am aware, the very first LLM trained with DPO on an erotic roleplay dataset.
+Iambe is intended to have the best realistically possible understanding of instructions, anatomy and scene state for a 20b merge, while remaining passionate and humanoid in "voice".
+### Update Methodology
+Take a look at [the dataset v2 Iambe and I created together](https://huggingface.co/datasets/athirdpath/DPO_Pairs-Roleplay-Alpaca-NSFW) for more info. The cDPO training was done directly on Iambe-20b-DARE-v2, I was researching 11b merges to reduce the compute, but it went nowhere, so I just bit the bullet on cost. The notebook used to train this model is also available in the dataset's repo.