ClassCat commited on
Commit
37b1b55
1 Parent(s): 2ecf9f0

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +83 -0
README.md ADDED
@@ -0,0 +1,83 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Hugging Face's logo
2
+ Hugging Face
3
+ Search models, datasets, users...
4
+ Models
5
+ Datasets
6
+ Spaces
7
+ Docs
8
+ Solutions
9
+ Pricing
10
+
11
+
12
+
13
+
14
+ ClassCat
15
+ /
16
+ gpt2-greek Copied
17
+ private
18
+ Text Generation
19
+ PyTorch
20
+ Transformers
21
+
22
+ cc100
23
+ Greek
24
+ gpt2
25
+ License:
26
+ cc-by-sa-4.0
27
+ Model card
28
+ Files and versions
29
+ Community
30
+ Settings
31
+ gpt2-greek
32
+ /
33
+ README.md
34
+ ClassCat's picture
35
+ ClassCat
36
+ Create README.md
37
+ 0765806
38
+ less than a minute ago
39
+ raw
40
+ history
41
+ blame
42
+ edit
43
+ delete
44
+ 965 Bytes
45
+ ---
46
+ language: el
47
+ license: cc-by-sa-4.0
48
+ datasets:
49
+ - cc100
50
+ widget:
51
+ - text: "Αυτό είναι ένα"
52
+ - text: "Ανοιξα την"
53
+ - text: "Ευχαριστώ για το"
54
+ - text: "Έχει πολύ καιρό που δεν έχουμε"
55
+ ---
56
+
57
+ ## Greek GPT2 model (Uncased)
58
+
59
+ ### Prerequisites
60
+
61
+ transformers==4.19.2
62
+
63
+ ### Model architecture
64
+
65
+ This model uses approximately half the size of GPT2 base model parameters.
66
+
67
+ ### Tokenizer
68
+
69
+ Using BPE tokenizer with vocabulary size 50,000.
70
+
71
+ ### Training Data
72
+
73
+ * Subset of [CC-100/el](https://data.statmt.org/cc-100/) : Monolingual Datasets from Web Crawl Data
74
+ * Subset of [oscar](https://huggingface.co/datasets/oscar)
75
+ * [wiki40b/el](https://www.tensorflow.org/datasets/catalog/wiki40b#wiki40bel) (French Wikipedia)
76
+
77
+ ### Usage
78
+
79
+ ```python
80
+ from transformers import pipeline
81
+ generator = pipeline('text-generation', model='ClassCat/gpt2-greek')
82
+ generator("Αυτό είναι ένα", max_length=50, num_return_sequences=5)
83
+ ```