daekeun-ml
commited on
Commit
•
00a4c6e
1
Parent(s):
a29fb54
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,25 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- ko
|
4 |
+
tags:
|
5 |
+
- llama-2
|
6 |
+
- instruct
|
7 |
+
- instruction
|
8 |
+
pipeline_tag: text-generation
|
9 |
+
license: llama2
|
10 |
+
---
|
11 |
+
|
12 |
+
# Llama-2-ko-instruct-13B
|
13 |
+
|
14 |
+
### Model Details
|
15 |
+
- Base Model: [LLaMA-2-koen-13B](https://huggingface.co/beomi/llama-2-koen-13b)
|
16 |
+
|
17 |
+
### Datasets
|
18 |
+
- Added some English to Korean translation data based on the KOpen-platypus and KoAlpaca datasets. Translations utilized AWS blog content that I translated myself.
|
19 |
+
- Extracted only sentences longer than 100 characters and removed similar sentences with KoSimCSE (daekeun-ml/KoSimCSE-supervised-kobigbird-roberta-large)
|
20 |
+
- Created category-specific prompts that encourage AI to answer despite hallucination for future RLHF (Reinforcement Learning From Human Feedback) or DPO (Direct Preference Optimization) tuning.
|
21 |
+
|
22 |
+
### License
|
23 |
+
- Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International Public License, under LLAMA 2 COMMUNITY LICENSE AGREEMENT
|
24 |
+
|
25 |
+
This model was created as a personal experiment, unrelated to the organization I work for.
|