Edit model card

Model Card for Model ID

Model Details

Model Card: LLaMA3-ENG-KO-8B-SL with Fine-Tuning Model Overview Model Name: LLaMA3-ENG-KO-8B-SL

Model Type: Transformer-based Language Model

Model Size: 8 billion parameters

by: 4yo1

Languages: English and Korean

Model Description

LLaMA3-ENG-KO-8B-SL is a language model pre-trained on a diverse corpus of English and Korean texts. This fine-tuning approach allows the model to adapt to specific tasks or datasets with a minimal number of additional parameters, making it efficient and effective for specialized applications.

how to use - sample code

from transformers import AutoConfig, AutoModel, AutoTokenizer

config = AutoConfig.from_pretrained("4yo1/llama3-eng-ko-80-sl")
model = AutoModel.from_pretrained("4yo1/llama3-eng-ko-8b-sl")
tokenizer = AutoTokenizer.from_pretrained("4yo1/llama3-eng-ko-8b-sl")

datasets:

  • 4yo1/llama3_test1

license: mit

Downloads last month
1,413
Safetensors
Model size
8.03B params
Tensor type
FP16
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Space using 4yo1/llama3-eng-ko-8b-sl 1