A model retrained by removing the last 10 layers from the original Llama-3.1-8B-Instruct model.
To retrain the knowledge held by the original language model, we conducted broad fine-tuning to revive its extensive knowledge base. Following this, we applied refined fine-tuning using high-quality datasets to enhance the model's internal and linguistic representations, thereby improving its reliability.
after training the model on a specific task, we merged the pre-trained model with the task-trained model.
import transformers
import torch
model_id = "kikikara/ko-llama-3.1-5b-instruct-FrankenMerging"
pipeline = transformers.pipeline(
"text-generation",
model=model_id,
model_kwargs={"torch_dtype": torch.bfloat16},
device_map="auto",
)
messages = [
{"role": "system", "content": "๋น์ ์ ํ๊ตญ์ด ai ๋ชจ๋ธ์
๋๋ค."},
{"role": "user", "content": "์ธ์์ ์๋ฏธ๋ ๋ญ์ผ?"},
]
outputs = pipeline(
messages,
max_new_tokens=256,
)
print(outputs[0]["generated_text"][-1])
- Downloads last month
- 10
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.