rinna
/

japanese-gpt2-xsmall

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

tianyuz commited on Aug 23, 2021

Commit

e2dac72

•

1 Parent(s): 80dd97b

update readme

Files changed (2) hide show

README.md +3 -1
config.json +2 -2

README.md CHANGED Viewed

@@ -12,13 +12,15 @@ license: mit
 datasets:
 - cc100
 - wikipedia
 ---
 # japanese-gpt2-xsmall
 ![rinna-icon](./rinna.png)
-This repository provides an extra-small-sized Japanese GPT-2 model. The model is provided by [rinna](https://corp.rinna.co.jp/).
 # How to use the model

 datasets:
 - cc100
 - wikipedia
+widget:
+- text: "生命、宇宙、そして万物についての究極の疑問の答えは"
 ---
 # japanese-gpt2-xsmall
 ![rinna-icon](./rinna.png)
+This repository provides an extra-small-sized Japanese GPT-2 model. The model was trained using code from Github repository [rinnakk/japanese-pretrained-models](https://github.com/rinnakk/japanese-pretrained-models) by [rinna Co., Ltd.](https://corp.rinna.co.jp/)
 # How to use the model

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "../../huggingface_models/japanese-gpt2-xsmall/",
   "activation_function": "gelu_new",
   "architectures": [
     "GPT2LMHeadModel"
@@ -12,7 +12,7 @@
   "initializer_range": 0.02,
   "layer_norm_epsilon": 1e-05,
   "model_type": "gpt2",
-  "n_ctx": 768,
   "n_embd": 512,
   "n_head": 8,
   "n_inner": 2304,

 {
+  "_name_or_path": "rinna/japanese-gpt2-xsmall",
   "activation_function": "gelu_new",
   "architectures": [
     "GPT2LMHeadModel"
   "initializer_range": 0.02,
   "layer_norm_epsilon": 1e-05,
   "model_type": "gpt2",
+  "n_ctx": 1024,
   "n_embd": 512,
   "n_head": 8,
   "n_inner": 2304,