Datasheets for datasets
Also known as: Dataset documentation, Data cards
A standardized documentation framework proposed by Gebru et al. that accompanies machine learning datasets with information about their creation, composition, intended use, and limitations. For accessibility, datasheets help surface representation gaps — such as whether people with disabilities are included in training data — and make transparent the labeling decisions and biases that affect how AI systems perceive and describe the world.
Category: artificial intelligence · ethics · data science
Related: Algorithmic bias · Differential privacy · Explainable AI