Text-to-Image

Also known as: Text-to-Image Generation, T2I

An AI capability that generates visual images from natural language text descriptions (prompts). Text-to-image models like DALL-E, Midjourney, and Stable Diffusion have opened new creative possibilities for blind individuals by allowing them to create visual content through verbal description rather than visual manipulation. However, challenges remain around prompt engineering, verifying the accuracy and quality of generated images, and ensuring that outputs match the creators intent without visual confirmation.

Category: artificial intelligence · creative accessibility

Related: Generative AI · Vision-Language Model · Prompt Engineering · Blind Photography

Sources

https://doi.org/10.1145/3663547.3746345