naclbit
/

gpt-j-japanese-6.8b

Text Generation

Inference Endpoints

Model card Files Files and versions Community

naclbit commited on Oct 17, 2021

Commit

bd92f5e

•

1 Parent(s): aabbf7b

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -38,13 +38,13 @@ EleutherAIによるMesh Transformer JAXをコードベースとした、GPT-J-6B
 ## Instructions
-We recommend to use finetuneanon's transformer codebase for inferencing as split checkpoint loads up a lot faster than monolithic checkpoint supported by HuggingFace Transformers repository.
 The tokenizer still uses 50256 as the <|endoftext|> substitute. Therefore 50256 should be excluded when inferencing.
 ## Datasets
-Lack of quality Japanese corpus is one of the major challenges when we trained the model. We aimed to compile well-formatted corpuses outside of Common Crawl.
 The dataset is normalized and sanitized against leading and trailing spaces, excessive CR/LF repetitions.

 ## Instructions
+We recommend to use finetuneanon's forked transformer codebase for inferencing as split checkpoint loads up a lot faster than monolithic checkpoint supported by HuggingFace Transformers repository.
 The tokenizer still uses 50256 as the <|endoftext|> substitute. Therefore 50256 should be excluded when inferencing.
 ## Datasets
+Lack of quality Japanese corpus was one of the major challenges when we trained the model. We aimed to compile well-formatted corpuses outside of Common Crawl.
 The dataset is normalized and sanitized against leading and trailing spaces, excessive CR/LF repetitions.