crumb
/

gpt-j-6b-shakespeare

Text Generation

Model card Files Files and versions Community

mia naomi commited on Jul 13, 2022

Commit

23fe965

•

1 Parent(s): 2e9561e

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -38,7 +38,9 @@ This checkpoint was afterwards finetuned on [tiny_shakespeare](https://huggingfa
 | batch size | 8 |
 | context length (tokens) | 256 |
-Trained on 1 Tesla T4 ([google colab](https://colab.research.google.com/)) for ~45m
 A good starting point to finetune your own gpt-j-6b would be [hivemind's 8bit training code](https://huggingface.co/hivemind/gpt-j-6B-8bit).

 | batch size | 8 |
 | context length (tokens) | 256 |
+Trained on 1 Tesla T4 from ([google colab](https://colab.research.google.com/))
+```TrainOutput(global_step=147, training_loss=1.665000240818984, metrics={'train_runtime': 2828.7347, 'train_samples_per_second': 0.417, 'train_steps_per_second': 0.052, 'total_flos': 1555992281088.0, 'train_loss': 1.665000240818984, 'epoch': 1.0})```
 A good starting point to finetune your own gpt-j-6b would be [hivemind's 8bit training code](https://huggingface.co/hivemind/gpt-j-6B-8bit).