Disability-First AI Dataset Annotation: Co-designing Stuttered Speech Annotation Guidelines with People Who Stutter
Xinru Tang, Jingjin Li, Shaomei Wu · 2026 · Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems (CHI '26)
Tang, Li, and Wu present the first study to push the 'disability-first' principle beyond dataset collection and into the dataset annotation stage of the AI pipeline. Their case is stuttered speech: despite a growing number of stuttering datasets (FluencyBank, UCLASS, KSoF,…
AI dataset annotation · stuttering · speech recognition · disability-first design · embodied knowledge