werty1248's picture
Update README.md
ac40a26 verified
---
library_name: transformers
base_model: beomi/Llama-3-Open-Ko-8B
datasets:
- kyujinpy/OpenOrca-KO
pipeline_tag: text-generation
license: llama3
---
# Llama-3-Ko-OpenOrca
<!-- Provide a quick summary of what the model is/does. -->
## Model Details
### Model Description
<!-- Provide a longer summary of what this model is. -->
Original model: [beomi/Llama-3-Open-Ko-8B](https://huggingface.co/beomi/Llama-3-Open-Ko-8B) (2024.04.24 버전)
Dataset: [kyujinpy/OpenOrca-KO](https://huggingface.co/datasets/kyujinpy/OpenOrca-KO)
### Training details
Training: Axolotl을 이용해 LoRA-8bit로 4epoch 학습 시켰습니다.
- sequence_len: 4096
- bf16
학습 시간: A6000x2, 6시간
### Evaluation
- 0 shot kobest
| Tasks |n-shot| Metric |Value | |Stderr|
|----------------|-----:|--------|-----:|---|------|
|kobest_boolq | 0|acc |0.5021|± |0.0133|
|kobest_copa | 0|acc |0.6920|± |0.0146|
|kobest_hellaswag| 0|acc |0.4520|± |0.0223|
|kobest_sentineg | 0|acc |0.7330|± |0.0222|
|kobest_wic | 0|acc |0.4881|± |0.0141|
- 5 shot kobest
| Tasks |n-shot| Metric |Value | |Stderr|
|----------------|-----:|--------|-----:|---|------|
|kobest_boolq | 5|acc |0.7123|± |0.0121|
|kobest_copa | 5|acc |0.7620|± |0.0135|
|kobest_hellaswag| 5|acc |0.4780|± |0.0224|
|kobest_sentineg | 5|acc |0.9446|± |0.0115|
|kobest_wic | 5|acc |0.6103|± |0.0137|
### License:
[https://llama.meta.com/llama3/license](https://llama.meta.com/llama3/license)