Edit model card

The license is cc-by-nc-4.0.

GAI-LLM/Llama-3-8B_classification

Model Details

Model Developers Donghoon Oh, Hanmin Myung, SuKyung Park (SK C&C G.AI Eng)

Input Models input text only.

Output Models generate text only.

Model Architecture
GAI-LLM/Llama-3-8B_classification is an auto-regressive language model based on the LLaMA2 transformer architecture.

Base Model [meta-llama/Meta-Llama-3-8B]

Training Dataset

  • We combined Open Korean Dateset using mixed-strategy
  • We use A100 GPU 80GB * 8, when training.

Model Benchmark

Implementation Code

### GAI-LLM/Llama-3-8B_classification
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

repo = "GAI-LLM/Llama-3-8B_classification"
model = AutoModelForCausalLM.from_pretrained(
        repo,
        return_dict=True,
        torch_dtype=torch.float16,
        device_map='auto'
)
tokenizer = AutoTokenizer.from_pretrained(repo)
Downloads last month
20
Safetensors
Model size
8.03B params
Tensor type
BF16
·
Inference API
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.