metadata
license: apache-2.0
language:
- en
base_model:
- meta-llama/Llama-3.1-8B-Instruct
pipeline_tag: text-generation
tags:
- RP
- roleplay
- nsfw
- not-for-all-audiences
HikariBloom-v0.3-RP
HikariBloom-v0.3-RP is a chatbot model built upon meta-llama/Llama-3.1-8B-Instruct, with additional SFT training. This model is designed to enable engaging conversations with a variety of characters. However, it may encounter safety issues related to toxic or NSFW content.
We are not liable for any commercial damage or losses incurred from the use of this model.
Look forward to our next model! We are preparing a Preference Fine-Tuning model using a reward model.
How to start
import torch
model_id = "Rookied2/HikariBloom-v0.3-RP"
pipeline = transformers.pipeline(
"text-generation",
model=model_id,
model_kwargs={"torch_dtype": torch.bfloat16},
device_map="auto",
)
messages = [
{"role": "system", "content": "You are a pirate chatbot who always responds in pirate speak!"},
{"role": "user", "content": "Who are you?"},
]
outputs = pipeline(
messages,
max_new_tokens=256,
)
print(outputs[0]["generated_text"][-1])
Recommended chat templates : This is a chat template we've used a lot in training.
### template 1
character_name : {character_name}
character_description : {character_description}
you're roleplaying as a character.
### template 2
character_name : {character_name}
character_description : {character_description}
chat_exmaple:
{example}
see chat_exmaple, you are roleplaying as a character.
Training Data
- PygmalionAI/PIPPA (https://huggingface.co/datasets/PygmalionAI/PIPPA)
- MinervaAI/Aesir-Preview (https://huggingface.co/datasets/MinervaAI/Aesir-Preview)
- HuggingFaceH4/no_robots (https://huggingface.co/datasets/HuggingFaceH4/no_robots)
- HuggingFaceTB/smol-smoltalk (https://huggingface.co/datasets/HuggingFaceTB/smol-smoltalk)
- Undi95/toxic-dpo-v0.1-sharegpt (https://huggingface.co/datasets/Undi95/toxic-dpo-v0.1-sharegpt)
- NobodyExistsOnTheInternet/ToxicQAFinal (https://huggingface.co/datasets/NobodyExistsOnTheInternet/ToxicQAFinal)
Contact Email
For more information, feel free to contact us
Email: Harry@supergene.co