werty1248
/

Llama-3-Ko-8B-OpenOrca

Text Generation

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

Edit model card

Llama-3-Ko-OpenOrca

Model Details

Model Description

Original model: beomi/Llama-3-Open-Ko-8B

Dataset: kyujinpy/OpenOrca-KO

Training details

Training: Axolotl을 이용해 LoRA-8bit로 4epoch 학습 시켰습니다.

sequence_len: 4096
bf16

학습 시간: A6000x2, 6시간

Evaluation

0 shot kobest

Tasks	n-shot	Metric	Value		Stderr
kobest_boolq	0	acc	0.5021	±	0.0133
kobest_copa	0	acc	0.6920	±	0.0146
kobest_hellaswag	0	acc	0.4520	±	0.0223
kobest_sentineg	0	acc	0.7330	±	0.0222
kobest_wic	0	acc	0.4881	±	0.0141

5 shot kobest

Tasks	n-shot	Metric	Value		Stderr
kobest_boolq	5	acc	0.7123	±	0.0121
kobest_copa	5	acc	0.7620	±	0.0135
kobest_hellaswag	5	acc	0.4780	±	0.0224
kobest_sentineg	5	acc	0.9446	±	0.0115
kobest_wic	5	acc	0.6103	±	0.0137

License:

https://llama.meta.com/llama3/license

Downloads last month: 424

Safetensors

Model size

8.03B params

Tensor type

BF16

·

Finetuned from

Dataset used to train werty1248/Llama-3-Ko-8B-OpenOrca