stefan-it
/

german-gpt2-larger

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

stefan-it commited on Sep 7, 2021

Commit

fda8864

·

1 Parent(s): bc800a9

readme: minor fixes

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -6,7 +6,7 @@ license: mit
 ---
 # German GPT-2 model
-In this repository we release (yet another) GPT-2 model, that was trained on ~90 GB from the ["German colossal, clean Common Crawl corpus" (GC4)](https://german-nlp-group.github.io/projects/gc4-corpus.html).
 The model is meant to be an entry point for fine-tuning on other texts, and it is definitely not as good or "dangerous" as the English GPT-3 model. We do not plan extensive PR or staged releases for this model 😉
@@ -85,8 +85,8 @@ This results in a total training corpus size of 90GB.
 # Training Details
-We use the recently re-trained `dbmdz/german-gpt2` (version 2!) model as back-bone model.
-Thus, the tokenizer and vocab is the same as used in the `dbmdz/german-gpt2` model.
 The model was trained on a v3-8 TPU, with the following parameters:

 ---
 # German GPT-2 model
+In this repository we release (yet another) GPT-2 model, that was trained on ~90 GB from the ["German colossal, clean Common Crawl corpus"](https://german-nlp-group.github.io/projects/gc4-corpus.html) (GC4).
 The model is meant to be an entry point for fine-tuning on other texts, and it is definitely not as good or "dangerous" as the English GPT-3 model. We do not plan extensive PR or staged releases for this model 😉
 # Training Details
+We use the recently re-trained `dbmdz/german-gpt2` ([version 2](https://huggingface.co/dbmdz/german-gpt2)!)
+model as back-bone model. Thus, the tokenizer and vocab is the same as used in the `dbmdz/german-gpt2` model.
 The model was trained on a v3-8 TPU, with the following parameters: