kykim commited on
Commit
8ea2767
1 Parent(s): abc96ec

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -0
README.md ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: ko
3
+ ---
4
+
5
+ # Bert base model for Korean
6
+
7
+ * 70GB Korean text dataset and 42000 lower-cased subwords are used
8
+ * Check the model performance and other language models for Korean in [github](https://github.com/kiyoungkim1/LM-kor)
9
+
10
+ ```python
11
+ # only for pytorch in transformers
12
+ from transformers import BertTokenizerFast, EncoderDecoderModel
13
+
14
+ tokenizer = BertTokenizerFast.from_pretrained("kykim/bertshared-kor-base")
15
+ model = EncoderDecoderModel.from_pretrained("kykim/bertshared-kor-base")
16
+ ```