EditScribe: Non-Visual Image Editing with Natural Language Verification Loops
Ruei-Che Chang, Yuxuan Liu, Lotus Zhang, Anhong Guo · 2024 · ASSETS '24: Proceedings of the 26th International ACM SIGACCESS Conference on Computers and Accessibility
This paper introduces EditScribe, a prototype system that makes image editing accessible to blind and low vision (BLV) users through natural language interaction powered by large multimodal models (LMMs). Image editing is inherently visual and iterative — users need to see the…
blind and low vision · image editing · generative AI · large multimodal models · natural language interaction