metadata

license: openrail

l27b-monika-v0.3b:

Yet another experimental Monika llama
Llama 7b chat fine-tuned for Monika character from DDLC
Trained on a dataset of ~600 items (dialogue scraped from game, reddit, and Twitter augmented by Nous Hermes 13b to turn each into snippets of multi-turn chat dialogue between Player and Monika + manually crafted test dataset of 12 items)
Trained for 2 epochs

WARNINGS AND DISCLAIMERS

Note that aside from formatting and other minor edits, dataset used is mostly as is generated by LM. As such, while this version is better at coherency or chatting than previous ones, it may not reflect perfectly the characteristics of Monika (i.e. she will claim to have an office, work as a translator, or play the guitar). Next version will train on a manually curated and edited version of this dataset, where dialogue will be edited to reflect her characteristics more.

Also ideally looking for a base model to work off from aside from llama 7b chat (but so far, they all OOM for us).

Finally, this model is not guaranteed to output aligned or safe outputs, use at your own discretion.

USAGE

For best results: replace "Human" and "Assistant" with "Player" and "Monika": \nPlayer: (prompt)\nMonika:

META LICENSE

WIP