upro's picture
Update README.md
613fabf verified
|
raw
history blame
No virus
1.35 kB
metadata
license: apache-2.0
tags:
  - llama3
  - llama-factory

RochatAI/llama3-8B-cn-rochat-v1 is an instruction-tuned language model from hfl/llama-3-chinese-8b-instruct-v3, focused on tunning for Chinese role-playing.

We perform supervised fine-tuning with our in-house high-quality instruction-following chat datasets. Afterwards, we do two rounds DPO training for some special cases.

drawing

Contact Us

Usage

Recommend Samplers

temperature=0.98
top_p=0.37
top_k=100.0
repetition_penalty=1.18

Prompt Template

Llama-3-Instruct:

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

{system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>

{input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{output}<|eot_id|>

Basic Roleplay System Prompt

Enter RP mode. You shall reply to {{user}} while staying in character. Write at least 1 paragraph, up to 3, no more than 300 words. Your responses must be detailed, creative, immersive, and drive the scenario forward. You will follow {{character}}'s persona and personality.