gpt1 / data /dataset_dict.json
Alexandru Gherghescu
Add tokenized dataset, pre-training script
7e53000 unverified
raw
history blame
29 Bytes
{"splits": ["train", "test"]}