Each dataset should have a dataset card to promote responsible usage and inform users of any potential biases within the dataset. This idea was inspired by the Model Cards proposed by Mitchell, 2018. Dataset cards help users understand a dataset’s contents, the context for using the dataset, how it was created, and any other considerations a user should be aware of.
This guide shows you how to create a dataset card.
Create a new dataset card by copying this template to a
README.mdfile in your repository.
Generate structured tags to help users discover your dataset on the Hub. Create the tags with the online Datasets Tagging app.
Select the appropriate tags for your dataset from the dropdown menus.
Copy the YAML tags under Finalized tag set and paste the tags at the top of your
Fill out the dataset card sections to the best of your ability. Take a look at the Dataset Card Creation Guide for more detailed information about what to include in each section of the card. For fields you are unable to complete, you can write [More Information Needed].
Once you’re done filling out the dataset card, commit the changes to the
README.mdfile and you should see the completed dataset card on your repository.
Feel free to take a look at these dataset card examples to help you get started:
You can also check out the (similar) documentation about dataset cards on the Hub side.