from datasets import load_dataset dataset = load_dataset("openwebtext")