tianyuz commited on
Commit
d35a68c
1 Parent(s): 36c6328

update readme

Browse files
Files changed (3) hide show
  1. README.md +3 -1
  2. config.json +1 -1
  3. tokenizer_config.json +1 -1
README.md CHANGED
@@ -12,13 +12,15 @@ license: mit
12
  datasets:
13
  - cc100
14
  - wikipedia
 
 
15
  ---
16
 
17
  # japanese-gpt2-small
18
 
19
  ![rinna-icon](./rinna.png)
20
 
21
- This repository provides a small-sized Japanese GPT-2 model. The model is provided by [rinna](https://corp.rinna.co.jp/).
22
 
23
  # How to use the model
24
 
 
12
  datasets:
13
  - cc100
14
  - wikipedia
15
+ widget:
16
+ - text: "生命、宇宙、そして万物についての究極の疑問の答えは"
17
  ---
18
 
19
  # japanese-gpt2-small
20
 
21
  ![rinna-icon](./rinna.png)
22
 
23
+ This repository provides a small-sized Japanese GPT-2 model. The model was trained using code from Github repository [rinnakk/japanese-pretrained-models](https://github.com/rinnakk/japanese-pretrained-models) by [rinna Co., Ltd.](https://corp.rinna.co.jp/)
24
 
25
  # How to use the model
26
 
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "../../../code/huggingface_models/japanese-gpt2-small-huggingface",
3
  "activation_function": "gelu_new",
4
  "architectures": [
5
  "GPT2LMHeadModel"
 
1
  {
2
+ "_name_or_path": "rinna/japanese-gpt2-small",
3
  "activation_function": "gelu_new",
4
  "architectures": [
5
  "GPT2LMHeadModel"
tokenizer_config.json CHANGED
@@ -1 +1 @@
1
- {"eos_token": "</s>", "unk_token": "<unk>", "pad_token": "[PAD]", "extra_ids": 0, "additional_special_tokens": [], "bos_token": "<s>", "cls_token": "[CLS]", "sep_token": "[SEP]", "mask_token": "[MASK]", "do_lower_case": true}
 
1
+ {"eos_token": "</s>", "unk_token": "<unk>", "pad_token": "[PAD]", "extra_ids": 0, "additional_special_tokens": [], "sp_model_kwargs": {}, "bos_token": "<s>", "cls_token": "[CLS]", "sep_token": "[SEP]", "mask_token": "[MASK]", "do_lower_case": true, "tokenizer_class": "T5Tokenizer"}