Edit model card

Model Card for Model ID

Model Details

Model Card: llama3-pre1-ds-lora3 with Fine-Tuning Model Overview Model Name: llama3-pre1-ds-lora3

Model Type: Transformer-based Language Model

Model Size: 8 billion parameters

by: 4yo1

Languages: English and Korean

Model Description

llama3-pre1-ds-lora3 is a language model pre-trained on a diverse corpus of English and Korean texts. This fine-tuning approach allows the model to adapt to specific tasks or datasets with a minimal number of additional parameters, making it efficient and effective for specialized applications.

how to use - sample code

from transformers import AutoConfig, AutoModel, AutoTokenizer

config = AutoConfig.from_pretrained("4yo1/llama3-pre1-ds-lora3")
model = AutoModel.from_pretrained("4yo1/llama3-pre1-ds-lora3")
tokenizer = AutoTokenizer.from_pretrained("4yo1/llama3-pre1-ds-lora3")

datasets:

  • recipes

license: mit

Downloads last month
217
Safetensors
Model size
8.03B params
Tensor type
F32
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.