flammenai
/

Mahou-1.3a-llama3-8B

Text Generation

text-generation-inference

Model card Files Files and versions Community

Mahou-1.3a-llama3-8B

Mahou is our attempt to build a production-ready conversational/roleplay LLM.

Future versions will be released iteratively and finetuned from flammen.ai conversational data.

License

This model is based on Meta Llama-3-8B and is governed by the META LLAMA 3 COMMUNITY LICENSE AGREEMENT.

Chat Format

This model has been trained to use ChatML format. Note the additional tokens in tokenizer_config.json.

<|im_start|>system
{{system}}<|im_end|>
<|im_start|>{{char}}
{{message}}<|im_end|>
<|im_start|>{{user}}
{{message}}<|im_end|>

Roleplay Format

Speech without quotes.
Actions in *asterisks*

*leans against wall cooly* so like, i just casted a super strong spell at magician academy today, not gonna lie, felt badass.

ST Settings

Use ChatML for the Context Template.
Enable Instruct Mode.
Use the Mahou preset.
Recommended: Add newline as a stopping string: ["\n"]

Method

Finetuned for 3 epochs using an A100 on Google Colab.

Fine-tune Llama 3 with ORPO - Maxime Labonne

Downloads last month: 1

Safetensors

Model size

8.03B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for flammenai/Mahou-1.3a-llama3-8B

Base model

nbeerbower/llama-3-Daredevil-Mahou-8B

Finetuned

flammenai/Mahou-1.3-llama3-8B

Finetuned

(1)

this model

Merges

1 model

Quantizations

Datasets used to train flammenai/Mahou-1.3a-llama3-8B

Collection including flammenai/Mahou-1.3a-llama3-8B

Mahou

flammen.ai's production model for casual conversation and character roleplay • 25 items • Updated 25 days ago • 4