- AI excels in complex tasks but struggles with simple spelling.
- Image and text generators prioritize recreating patterns.
- Resolving AI’s spelling issues is challenging due to the complexities of language and learning algorithms.
Acing SATs, failing spelling bees
AIs have demonstrated remarkable capabilities, from acing the SAT to defeating chess grandmasters and debugging code effortlessly. However, when pitted against middle schoolers in a spelling bee, AI falls short, unable to spell even simple words correctly.
Text-to-image generators like DALL-E often produce menus with misspelled items like “taao,” “burto,” and “enchida,” revealing a glaring weakness in AI’s abilities.
The math behind the madness
The underlying technology behind image and text generators differs, yet both struggle with intricate details like spelling. Image generators use diffusion models to reconstruct images from noise, while text generators employ large language models (LLMs) to match prompts with patterns in their latent space.
The algorithms prioritize recreating patterns that cover more pixels, leading to inaccuracies in smaller elements like text and fingers.
Language complexity takes the blame
Experts believe that resolving AI’s spelling issues will not be a quick fix. The English language’s intricacies, coupled with the numerous languages AI must learn, pose significant challenges. Some models, like Adobe Firefly, circumvent the problem by not generating text at all.
However, these guardrails can be easily bypassed with detailed prompts. LLMs, at their core, do not understand the concept of letters, even if they can generate impressive written content.