pchatz commited on
Commit
8240f7c
1 Parent(s): 1388aee

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -0
README.md CHANGED
@@ -30,6 +30,44 @@ In order to use 'palobert-base-greek-social-media', the text needs to be pre-pro
30
  * convert to lowercase
31
  * remove all punctuation
32
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
33
 
34
  ## Evaluation on MLM and Sentiment Analysis tasks
35
 
 
30
  * convert to lowercase
31
  * remove all punctuation
32
 
33
+ ## Load Model
34
+
35
+ ```python
36
+ from transformers import AutoTokenizer, AutoModelForMaskedLM
37
+
38
+ tokenizer = AutoTokenizer.from_pretrained("pchatz/palobert-base-greek-social-media")
39
+
40
+ model = AutoModelForMaskedLM.from_pretrained("pchatz/palobert-base-greek-social-media")
41
+ ```
42
+ You can use this model directly with a pipeline for masked language modeling
43
+
44
+ ```python
45
+ from transformers import pipeline
46
+
47
+ fill = pipeline('fill-mask', model=model, tokenizer=tokenizer)
48
+ fill(f'μεσα {fill.tokenizer.mask_token} δικτυωσης')
49
+
50
+ [{'score': 0.8760559558868408,
51
+ 'token': 12853,
52
+ 'token_str': ' κοινωνικης',
53
+ 'sequence': 'μεσα κοινωνικης δικτυωσης'},
54
+ {'score': 0.020922638475894928,
55
+ 'token': 1104,
56
+ 'token_str': ' μεσα',
57
+ 'sequence': 'μεσα μεσα δικτυωσης'},
58
+ {'score': 0.017568595707416534,
59
+ 'token': 337,
60
+ 'token_str': ' της',
61
+ 'sequence': 'μεσα της δικτυωσης'},
62
+ {'score': 0.006678201723843813,
63
+ 'token': 1258,
64
+ 'token_str': 'τικης',
65
+ 'sequence': 'μεσατικης δικτυωσης'},
66
+ {'score': 0.004737381357699633,
67
+ 'token': 16245,
68
+ 'token_str': 'τερης',
69
+ 'sequence': 'μεσατερης δικτυωσης'}]
70
+ ```
71
 
72
  ## Evaluation on MLM and Sentiment Analysis tasks
73