← All terms

Datasheets for datasets

Also known as: Dataset documentation, Data cards

A standardized documentation framework proposed by Gebru et al. that accompanies machine learning datasets with information about their creation, composition, intended use, and limitations. For accessibility, datasheets help surface representation gaps — such as whether people with disabilities are included in training data — and make transparent the labeling decisions and biases that affect how AI systems perceive and describe the world.

Category: artificial intelligence · ethics · data science

Related: Algorithmic bias · Differential privacy · Explainable AI

Sources