Are there datasets available?

#1
by Sciumo - opened

It would be very helpful to have at least a sample of the datasets.
Thanks.

i found there is a dataset under the same user

Ah yes. thanks.
https://huggingface.co/datasets/roneneldan/TinyStories
TinyStories, a synthetic dataset of short stories that only contain words that a typical 3 to 4-year-olds usually understand, generated by GPT-3.5 and GPT-4. ~2Gb

Sciumo changed discussion status to closed

Sign up or log in to comment