--- base_model: - GoToCompany/llama3-8b-cpt-sahabatai-v1-instruct tags: - text-generation-inference - transformers - unsloth - llama - trl license: apache-2.0 language: - id datasets: - psetialana/multi_session_chat-informal_indonesian-transformed --- # Personalized Sahabat AI Llama 3.1 8 B - **Developed by:** [Pradana Setialana](https://www.linkedin.com/in/psetialana/) This model is a fine-tuned version of [GoToCompany/llama3-8b-cpt-sahabatai-v1-instruct](https://huggingface.co/GoToCompany/llama3-8b-cpt-sahabatai-v1-instruct) on [psetialana/multi_session_chat-informal_indonesian-transformed](https://huggingface.co/datasets/psetialana/multi_session_chat-informal_indonesian-transformed) dataset. ## Model description This model can be used to personalize conversations and role-play based on the persona given with the prompt ``` Kamu adalah sahabat user. Kamu memiliki karakter PERSONA_ASSISTANT. User memiliki karakter PERSONA_USER. Kamu berperilaku sesuai PERSONA_ASSISTANT dan menyesuaikan responmu sesuai PERSONA_USER. PERSONA_ASSISTANT: {assistant_persona} PERSONA_USER: {user_persona} ``` ## Training procedure ### LoRA config The following lora config were used during training: - alpha: 16 - r: 16 - droput: 0 - modules: "q_proj", "k_proj", "v_proj", "o_proj", "gate_proj", "up_proj", "down_proj" ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 2e-4 - optimizer: adamw_8bit ### Training results [TensorBoard](../../tensorboard)