Oh commited on
Commit
4eb49b0
·
1 Parent(s): a4907bb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md CHANGED
@@ -1,3 +1,52 @@
1
  ---
2
  license: cc-by-nc-4.0
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: cc-by-nc-4.0
3
+ language:
4
+ - ko
5
+ library_name: transformers
6
+ pipeline_tag: text-generation
7
  ---
8
+
9
+
10
+ **The license is `cc-by-nc-4.0`.**
11
+
12
+ # **GAI-LLM/polyglot-12.8b-mixed-v3**
13
+
14
+ ## Model Details
15
+
16
+ **Model Developers** Donghoon Oh, Hanmin Myung, Eunyoung Kim (SK C&C G.AI Eng)
17
+
18
+ **Input** Models input text only.
19
+
20
+ **Output** Models generate text only.
21
+
22
+ **Model Architecture**
23
+ GAI-LLM/polyglot-12.8b-mixed-v3 is an auto-regressive language model based on the polyglot transformer architecture.
24
+
25
+ **Base Model** [EleutherAI/polyglot-ko-12.8b](https://huggingface.co/EleutherAI/polyglot-ko-12.8b)
26
+
27
+ **Training Dataset**
28
+
29
+ - We combined Open Korean Dateset.
30
+ - We use A100 GPU 80GB * 8, when trianing.
31
+
32
+ # **Model Benchmark**
33
+
34
+ ## KO-LLM leaderboard
35
+ - Follow up as [Open KO-LLM LeaderBoard](https://huggingface.co/spaces/upstage/open-ko-llm-leaderboard).
36
+
37
+
38
+ # Implementation Code
39
+ ```python
40
+ ### GAI-LLM/polyglot-12.8b-mixed-v3
41
+ from transformers import AutoModelForCausalLM, AutoTokenizer
42
+ import torch
43
+
44
+ repo = "GAI-LLM/polyglot-12.8b-mixed-v3"
45
+ model = AutoModelForCausalLM.from_pretrained(
46
+ repo,
47
+ return_dict=True,
48
+ torch_dtype=torch.float16,
49
+ device_map='auto'
50
+ )
51
+ tokenizer = AutoTokenizer.from_pretrained(repo)
52
+ ```