rinna
/

japanese-gpt-neox-small

Text Generation

text-generation-inference

Model card Files Files and versions Community

keisawada commited on Apr 3

Commit

b167c16

•

1 Parent(s): f33d445

Update README.md

Files changed (1) hide show

README.md +18 -0

README.md CHANGED Viewed

@@ -62,5 +62,23 @@ Here are a few samples generated with and without the toy prefix weights, respec
 # Inference with FasterTransformer
 After version 5.1, [NVIDIA FasterTransformer](https://github.com/NVIDIA/FasterTransformer) now supports both inference for GPT-NeoX and a variety of soft prompts (including prefix-tuning). The released pretrained model and prefix weights in this repo have been verified to work with FasterTransformer 5.1.
 # Licenese
 [The MIT license](https://opensource.org/licenses/MIT)

 # Inference with FasterTransformer
 After version 5.1, [NVIDIA FasterTransformer](https://github.com/NVIDIA/FasterTransformer) now supports both inference for GPT-NeoX and a variety of soft prompts (including prefix-tuning). The released pretrained model and prefix weights in this repo have been verified to work with FasterTransformer 5.1.
+# How to cite
+~~~
+@misc{rinna-japanese-gpt-neox-small,
+    title = {rinna/japanese-gpt-neox-small},
+    author = {Zhao, Tianyu and Sawada, Kei}
+    url = {https://huggingface.co/rinna/japanese-gpt-neox-small},
+}
+@inproceedings{sawada2024release,
+    title = {Release of Pre-Trained Models for the {J}apanese Language},
+    author = {Sawada, Kei and Zhao, Tianyu and Shing, Makoto and Mitsui, Kentaro and Kaga, Akio and Hono, Yukiya and Wakatsuki, Toshiaki and Mitsuda, Koh},
+    booktitle = {Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)},
+    month = {5},
+    year = {2024},
+    url = {https://arxiv.org/abs/2404.01657},
+}
+~~~
 # Licenese
 [The MIT license](https://opensource.org/licenses/MIT)