EleutherAI
/

gpt-neo-125m

Text Generation

text generation

Inference Endpoints

Model card Files Files and versions Community

lg commited on May 20, 2021

Commit

5e6c180

•

1 Parent(s): 8741c10

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -23,7 +23,7 @@ GPT-Neo 125M was trained on the Pile, a large scale curated dataset created by E
 ## Training procedure
-This model was trained for 572,300 steps on the Pile. It was trained as a masked autoregressive language model, using cross-entropy loss.
 ## Intended Use and Limitations

 ## Training procedure
+This model was trained on the Pile for 300 billion tokens over 572,300 steps. It was trained as a masked autoregressive language model, using cross-entropy loss.
 ## Intended Use and Limitations