Datasets documentation

Overview

You are viewing main version, which requires installation from source. If you'd like regular pip install, checkout the latest stable version (v3.2.0).
Hugging Face's logo
Join the Hugging Face community

and get access to the augmented documentation experience

to get started

Overview

Welcome to the 🤗 Datasets tutorials! These beginner-friendly tutorials will guide you through the fundamentals of working with 🤗 Datasets. You’ll load and prepare a dataset for training with your machine learning framework of choice. Along the way, you’ll learn how to load different dataset configurations and splits, interact with and see what’s inside your dataset, preprocess, and share a dataset to the Hub.

The tutorials assume some basic knowledge of Python and a machine learning framework like PyTorch or TensorFlow. If you’re already familiar with these, feel free to check out the quickstart to see what you can do with 🤗 Datasets.

The tutorials only cover the basic skills you need to use 🤗 Datasets. There are many other useful functionalities and applications that aren’t discussed here. If you’re interested in learning more, take a look at Chapter 5 of the Hugging Face course.

If you have any questions about 🤗 Datasets, feel free to join and ask the community on our forum.

Let’s get started! 🏁

< > Update on GitHub