Edit model card

Model Card for Model ID

Model Details

Model Card: LLaMA3-ENG-KO-8B-SL2 with Fine-Tuning Model Overview Model Name: LLaMA3-ENG-KO-8B-SL2

Model Type: Transformer-based Language Model

Model Size: 8 billion parameters

by: 4yo1

Languages: English and Korean

Model Description

LLaMA3-ENG-KO-8B-SL2 is a language model pre-trained on a diverse corpus of English and Korean texts. This fine-tuning approach allows the model to adapt to specific tasks or datasets with a minimal number of additional parameters, making it efficient and effective for specialized applications.

how to use - sample code

from transformers import AutoConfig, AutoModel, AutoTokenizer

config = AutoConfig.from_pretrained("4yo1/llama3-eng-ko-80-sl2")
model = AutoModel.from_pretrained("4yo1/llama3-eng-ko-8b-sl2")
tokenizer = AutoTokenizer.from_pretrained("4yo1/llama3-eng-ko-8b-sl2")

datasets:

  • 4yo1/llama3_test1

license: mit

Downloads last month
1,709
Safetensors
Model size
8.03B params
Tensor type
FP16
Β·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for 4yo1/llama3-eng-ko-8b-sl2

Quantizations
1 model

Spaces using 4yo1/llama3-eng-ko-8b-sl2 5