metadata
library_name: transformers
license: llama3
base_model:
- nbeerbower/llama-3-Daredevil-Mahou-8B
datasets:
- flammenai/MahouMix-v1
Mahou-1.3-llama3-8B
Mahou is our attempt to build a production-ready conversational/roleplay LLM.
Future versions will be released iteratively and finetuned from flammen.ai conversational data.
License
This model is based on Meta Llama-3-8B and is governed by the META LLAMA 3 COMMUNITY LICENSE AGREEMENT.
Chat Format
This model has been trained to use ChatML format. Note the additional tokens in tokenizer_config.json.
<|im_start|>system
{{system}}<|im_end|>
<|im_start|>{{char}}
{{message}}<|im_end|>
<|im_start|>{{user}}
{{message}}<|im_end|>
Roleplay Format
- Speech without quotes.
- Actions in
*asterisks*
*leans against wall cooly* so like, i just casted a super strong spell at magician academy today, not gonna lie, felt badass.
ST Settings
- Use ChatML for the Context Template.
- Enable Instruct Mode.
- Use the Mahou preset.
- Recommended: Add newline as a stopping string:
["\n"]
Method
Finetuned for 10 epochs using an A100 on Google Colab.