minhtoan commited on
Commit
b6b7458
1 Parent(s): b1b1257

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -0
README.md ADDED
@@ -0,0 +1,47 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: vi
3
+ tags:
4
+ - vi
5
+ - vietnamese
6
+ - gpt2
7
+ - text-generation
8
+ - lm
9
+ - nlp
10
+ datasets:
11
+ - wikilinguage
12
+ widget:
13
+ - text: "Hoa quả và rau thường rẻ hơn khi vào mùa. "
14
+ ---
15
+
16
+ # GPT-2
17
+
18
+ Pretrained gpt3 small model (gpt-neo) on Vietnamese for text generation
19
+
20
+ # How to use the model
21
+
22
+ ~~~~
23
+ from transformers import GPT2Tokenizer, GPTNeoForCausalLM
24
+
25
+ tokenizer = GPT2Tokenizer.from_pretrained('minhtoan/gpt3-small-vietnamese')
26
+ model = GPTNeoForCausalLM.from_pretrained('minhtoan/gpt3-small-vietnamese')
27
+
28
+ text = "Hoa quả và rau thường rẻ hơn khi vào mùa"
29
+ input_ids = tokenizer.encode(text, return_tensors='pt')
30
+ max_length = 150
31
+
32
+ sample_outputs = model.generate(input_ids,pad_token_id=tokenizer.eos_token_id,
33
+ do_sample=True,
34
+ max_length=max_length,
35
+ min_length=max_length,
36
+ num_return_sequences=1)
37
+
38
+ for i, sample_output in enumerate(sample_outputs):
39
+ print(">> Generated text {}\n\n{}".format(i+1, tokenizer.decode(sample_output.tolist())))
40
+ print('\n---')
41
+ ~~~~
42
+
43
+
44
+ ## Author
45
+ `
46
+ Phan Minh Toan
47
+ `