crumb commited on
Commit
421585d
1 Parent(s): d9ce65c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -1
README.md CHANGED
@@ -3,7 +3,10 @@ license: mit
3
  language:
4
  - en
5
  ---
6
- The smallest GPT-2 finetuned on approximately 2.23B tokens (almost the 2.48B needed to 'chinchilla-optimally' pretrain it!) consisting of 1.3B from common crawl sites from 2023, 540M from ArXiv, and 390M from GitHub.
 
 
 
7
 
8
  *(from GPT-2 model card)*
9
 
 
3
  language:
4
  - en
5
  ---
6
+
7
+ # GPT2(023) Model Card
8
+
9
+ This is the smallest GPT-2 model (124m) from OpenAi finetuned on approximately 2.23B tokens (almost the 2.48B needed to 'chinchilla-optimally' pretrain it!) consisting of 1.3B from common crawl sites from 2023, 540M from ArXiv, and 390M from GitHub.
10
 
11
  *(from GPT-2 model card)*
12