RichardErkhov
commited on
Commit
โข
5aaade7
1
Parent(s):
92b417e
uploaded readme
Browse files
README.md
ADDED
@@ -0,0 +1,76 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
Quantization made by Richard Erkhov.
|
2 |
+
|
3 |
+
[Github](https://github.com/RichardErkhov)
|
4 |
+
|
5 |
+
[Discord](https://discord.gg/pvy7H8DZMG)
|
6 |
+
|
7 |
+
[Request more models](https://github.com/RichardErkhov/quant_request)
|
8 |
+
|
9 |
+
|
10 |
+
AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3 - GGUF
|
11 |
+
- Model creator: https://huggingface.co/AIFT/
|
12 |
+
- Original model: https://huggingface.co/AIFT/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3/
|
13 |
+
|
14 |
+
|
15 |
+
| Name | Quant method | Size |
|
16 |
+
| ---- | ---- | ---- |
|
17 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q2_K.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q2_K.gguf) | Q2_K | 2.24GB |
|
18 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.IQ3_XS.gguf) | IQ3_XS | 2.48GB |
|
19 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.IQ3_S.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.IQ3_S.gguf) | IQ3_S | 2.6GB |
|
20 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q3_K_S.gguf) | Q3_K_S | 2.59GB |
|
21 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.IQ3_M.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.IQ3_M.gguf) | IQ3_M | 2.69GB |
|
22 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q3_K.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q3_K.gguf) | Q3_K | 2.86GB |
|
23 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q3_K_M.gguf) | Q3_K_M | 2.86GB |
|
24 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q3_K_L.gguf) | Q3_K_L | 3.08GB |
|
25 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.IQ4_XS.gguf) | IQ4_XS | 3.18GB |
|
26 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q4_0.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q4_0.gguf) | Q4_0 | 3.32GB |
|
27 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.IQ4_NL.gguf) | IQ4_NL | 3.35GB |
|
28 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q4_K_S.gguf) | Q4_K_S | 3.34GB |
|
29 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q4_K.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q4_K.gguf) | Q4_K | 3.5GB |
|
30 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q4_K_M.gguf) | Q4_K_M | 3.5GB |
|
31 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q4_1.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q4_1.gguf) | Q4_1 | 3.66GB |
|
32 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q5_0.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q5_0.gguf) | Q5_0 | 4.0GB |
|
33 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q5_K_S.gguf) | Q5_K_S | 4.0GB |
|
34 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q5_K.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q5_K.gguf) | Q5_K | 4.09GB |
|
35 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q5_K_M.gguf) | Q5_K_M | 4.09GB |
|
36 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q5_1.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q5_1.gguf) | Q5_1 | 4.34GB |
|
37 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q6_K.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q6_K.gguf) | Q6_K | 4.72GB |
|
38 |
+
| [AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q8_0.gguf](https://huggingface.co/RichardErkhov/AIFT_-_AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3-gguf/blob/main/AIFT-ko-orca-plat-Yi-ko-6b-v1.2-dpo-3.Q8_0.gguf) | Q8_0 | 6.12GB |
|
39 |
+
|
40 |
+
|
41 |
+
|
42 |
+
|
43 |
+
Original model description:
|
44 |
+
---
|
45 |
+
license: cc-by-sa-4.0
|
46 |
+
---
|
47 |
+
<h1>orca-platypus - instruct-dpo-3 ๋ชจ๋ธ v1.2</h1>
|
48 |
+
|
49 |
+
<b><ํ์ต ๋ฐ์ดํฐ ๊ตฌ์ถ></b>
|
50 |
+
kyujinpy ๋์ด ๊ณต๊ฐํ์ KOR-OpenOrca-Platypus ๋ฐ์ดํฐ๋ฅผ ์ผ๋ถ ์ญ์ (์ํ๋ง) ๋ฐ ์ ์ ์์
์งํํ์ฌ ํ์ฉ.
|
51 |
+
๊ทธ ์ดํ ํด๋น ๋ฐ์ดํฐ๋ค์ ๋ณด๋ฉฐ ๊ด๋ จ ํ์คํฌ๋ฅผ ์ถ์ถํ์๊ณ ์ด๋ฅผ ๊ธฐ๋ฐ์ผ๋ก
|
52 |
+
ํด๋น ํ์คํฌ์ ๋ง์ถฐ์ NLP ๊ด๋ จ ์คํ์์ค ๋ฐ์ดํฐ๋ฅผ ํ์ฉํ์ฌ ํ์ต๋ฐ์ดํฐ๋ฅผ ์์ฒด์ ์ผ๋ก
|
53 |
+
์ญ์ฌ, ๊ณผํ, ์ํ, ๊ธฐ๊ณ๋
ํด, ๋ฆฌ๋ทฐ ๋ถ์ ๋ฌธ์ ๋ฅผ gpt๋ฅผ ํตํด์ ๊ตฌ์ถํ์๊ณ ,
|
54 |
+
aihub ์ผ๋ฐ์์ ๋ฐ ๊ธฐ๊ณ๋
ํด ๋ฐ์ดํฐ๋ฅผ ํ์ฉํ์ฌ ์ถ๊ฐ๋ก ํ์ต ๋ฐ์ดํฐ๋ฅผ ๊ตฌ์ถ(ํํ์ ๊ด๋ จ, ๊ธฐ๊ณ๋
ํด ๊ด๋ จ ๋ฐ ์์ฝ)
|
55 |
+
๊ฐ์ข
๋ธ๋ก๊ทธ์์ ์ญ์ฌ ๋ฐ ์์ ํด์ฆ๋ฅผ ์ฌ๋์ด ์ง์ ํ์ต๋ฐ์ดํฐ ํํ๋ก ๋ณ๊ฒฝ
|
56 |
+
AI2AI Challenge ๋ฐ์ดํฐ ํํ๋ฅผ ๋ณด๊ณ gpt๋ฅผ ํตํด ์ด๋ฑ ์์ค์ ๊ณผํ ์ํ ๋ฌธ์ ์ ํ์ ์ ์ 500๋ฌธ์
|
57 |
+
์์ด ๋ฒ์ญ ๋ฐ์ดํฐ ์ํ/ํ์ ๋ฐ์ดํฐ ํ์ต ๋ฐ์ดํฐ๋ก ํ์ฉ ์งํ
|
58 |
+
์ด ๋ฐ์ดํฐ 4๋ง๊ฐ ์ ๋ ์ฌ์ฉํ์์ต๋๋ค.
|
59 |
+
|
60 |
+
<br>
|
61 |
+
|
62 |
+
<DPOํ์ต ๋ฐ์ดํฐ>
|
63 |
+
DPO ๋ฐ์ดํฐ๋ CommonGen๊ณผ TruthfulQA์ ์ด์ ์ ๋ง์ถ์ด ์ฝ 17,000๊ฐ์ ๋ฐ์ดํฐ๋ฅผ ํ์ตํ์์ต๋๋ค.
|
64 |
+
+ ko-hh-rlhf ๋ฐ์ดํฐ์์ chosen ๋ฐ์ดํฐ๋ถ๋ถ์ ChatGPT๋ฅผ ํตํด ๋ณ๊ฒฝํ ๋ฐ์ดํฐ๋ฅผ ์ถ๊ฐ ํ์ตํ์์ต๋๋ค.
|
65 |
+
+ ko-hh-rlhf 59000์ฌ๊ฐ์ ๋ฐ์ดํฐ์ chosen ๋ฐ์ดํฐ๋ฅผ ๋ชจ๋ gpt-3.5๋ฅผ ํตํด ์ฌ์์ฑํ ํ ์ผ๋ถ ๋ฐ์ดํฐ๋ฅผ ํํฐ๋งํ์ฌ ์ญ์ ์งํํ์์ต๋๋ค.
|
66 |
+
<br>
|
67 |
+
+ TruthfulQA ๊ด๋ จ ๋ฌธ์ ์ถ๊ฐ๋ฅผ ์งํํ์์ต๋๋ค.(์์ค ๊ด๋ จ ์ฐธ๊ฑฐ์ง ๋ฌธ์ )
|
68 |
+
+ ๊ธฐ๊ณ๋
ํด ๊ด๋ จ ํ์ต ๋ฐ์ดํฐ๋ฅผ ChatGPT๋ฅผ ํตํด์ ๋ต๋ณ์ ์ป์ด ํ์ต
|
69 |
+
+ ๋ฌธ๋ฒ๊ด๋ จ ํ์ต ๋ฐ์ดํฐ
|
70 |
+
<br>
|
71 |
+
###ํ์ต ๋ฐ์ดํฐ ํ์ผ์ ๋น๊ณต๊ฐ์
๋๋ค.
|
72 |
+
<br>
|
73 |
+
<b><ํ์ต></b>
|
74 |
+
ํ์ต์ LoRA๋ฅผ ์ฌ์ฉํ์ฌ A100 40G *2์์ ํ์ต์ ์งํํ์์ต๋๋ค.
|
75 |
+
|
76 |
+
|