catherinearnett
commited on
Commit
•
df698f1
1
Parent(s):
a448fb1
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -23,7 +23,6 @@ Details for this model specifically:
|
|
23 |
* Architecture: gpt2
|
24 |
* Parameters: 124770816
|
25 |
* Maximum sequence length: 512 tokens
|
26 |
-
* Training text data (raw): [XXXX]
|
27 |
* Training tokens: 12B
|
28 |
* Vocabulary size: 50000
|
29 |
* Compute cost: ~9 NVIDIA A6000 GPU hours
|
|
|
23 |
* Architecture: gpt2
|
24 |
* Parameters: 124770816
|
25 |
* Maximum sequence length: 512 tokens
|
|
|
26 |
* Training tokens: 12B
|
27 |
* Vocabulary size: 50000
|
28 |
* Compute cost: ~9 NVIDIA A6000 GPU hours
|