Enhancing Walk-Light Detector Usage for the Visually Impaired: A Comparison of VR Exploration and Verbal Instructions

Jonggi Hong, James Coughlan · 2024 · Proceedings of the 21st International Web for All Conference (W4A '24) · doi:10.1145/3677846.3677849

Summary

This paper from Stevens Institute of Technology and Smith-Kettlewell Eye Research Institute compares two methods for teaching people with visual impairments (PVI) to use a walk-light detector smartphone app at traffic intersections: virtual reality (VR) exploration versus traditional text/audio (TA) instructions. Camera aiming is a fundamental challenge for PVI using smartphone-based assistive apps — they must point the camera precisely at distant objects like walk lights without visual feedback. The researchers first conducted interviews with 10 PVI (ages 30-75, seven totally blind) to understand their camera use challenges, finding that image framing difficulties were the most common problem (7 of 10 participants). They then built a custom walk-light detector using a YOLO v2 object detection model trained on 21,208 images of walk lights in four states (Walk, Count down, Don't walk, Nothing). Preliminary experiments established that detection accuracy drops substantially when camera yaw deviates more than 10 degrees from straight-up, and that image blur from fast camera movement degrades performance. The VR training app, built with ARKit on iOS, placed virtual walk lights in a simulated intersection environment, providing audio and haptic feedback on camera orientation, speed, and image quality — allowing PVI to practice the physical skill of aiming a camera at a distant target in a safe indoor setting.

Key findings

Thirteen PVI participants (ages 39-76, nine totally blind) were divided into VR (n=7) and TA (n=6) groups for a between-subjects study. After indoor training, both groups used the walk-light detector at real intersections along a predetermined route with six trials. Quantitatively, there was no statistically significant difference in walk-light detection accuracy between groups — both achieved F1 scores around 0.98-0.99. However, qualitative differences were notable. The VR group moved their cameras more slowly during the scanning phase (31.3% of images at suitable speed vs 18.6% for TA), and had shorter waiting times after initial detection (M=19.8s vs M=38.9s, p=.022), suggesting better camera maintenance skills. The TA group had better camera orientation (93.6% properly oriented images vs 86.5% for VR), possibly because the VR app's random walk-light offsets encouraged more exploratory camera angles. Subjectively, 71.4% of VR participants felt they could not use the detector well without the training, compared to only 33.3% of TA participants — suggesting VR training created a deeper understanding of the task's demands. However, more TA participants found the training easy (66% strongly agreed vs 28% for VR), partly because the VR environment lacked real-world contextual cues like traffic sounds and ground textures, and the confined indoor space made VR navigation confusing.

Relevance

This study addresses a practical gap in assistive technology adoption: PVI need to learn camera-aiming skills that are inherently spatial and difficult to convey through verbal instructions alone. The VR approach shows promise for teaching these embodied skills — participants gained deeper understanding of camera manipulation even though the training was perceived as harder. For assistive technology developers, the key takeaway is that VR training can complement rather than replace verbal instructions, particularly for tasks requiring precise spatial awareness. The finding that camera yaw beyond 10 degrees significantly degrades object detection has direct design implications for any camera-based assistive app. The study also reveals an important tension in VR training design for PVI: virtual environments must closely match real-world parameters (distances, sizes, error rates) or the transfer of skills suffers. Smart glasses were the preferred form factor (8 of 10 interview participants), suggesting future walk-light detectors and similar tools should consider wearable designs that eliminate the camera-aiming burden entirely.

Tags: blindness · virtual reality · navigation · camera aiming · pedestrian safety · assistive technology training · orientation and mobility