Machine learning generated alternative texts – too little, too soon

Note: This post is older than two years. It may still be totally valid, but things change and technology moves fast. Code based posts may be especially prone to changes...

(Read 866 times)

Wouldn’t it be nice if computer would just make a perfect alternative text for us when we include the image? While there are some services promising this – they should be manually checked and corrected! But in the future…

I’ve been looking for automatic solutions for alternative text generation that every graphical content must have. And I’ve seen that although they are getting better and better in recognizing the objects inside the pictures, for example, they are not providing much value when providing contextual alternative texts.

So – my thesis is that authors are still the best source of good alternative text and they should not let the computer vision alone to decide what is their graphic trying to say in the context of their content.

Authors of the future will still need to think about alternative texts, artificial intelligence will not be enough

We can quickly imagine how would it be if our authoring tool would just run the article and image through an machine learning / artificial intelligence powered computer vision tool /API and all of our graphical elements would get the best possible alternative text auto-magically.

Such automatic tools can help authors when it comes to looking for synonyms or phrases with equal meaning but the tool itself should not provide the whole alternative text for the author.

Alt text can maybe be automated when/if our interfaces will gain direct access to our thoughts

If we try to make an educated guess about future interfaces – there is no better than direct connection between human brain and computer.

I will not go into ethics and possibilities of abuse but if we take only the good from it’s possibilities then we can agree that computer being able to read our mind could also automate the alternative texts on our graphical elements if we would let it.

Author: Bogdan Cerovac

I am IAAP certified Web Accessibility Specialist (from 2020) and was Google certified Mobile Web Specialist.

Work as digital agency co-owner web developer and accessibility lead.

Sole entrepreneur behind IDEA-lab Cerovac (Inclusion, Diversity, Equity and Accessibility lab) after work. Check out my Accessibility Services if you want me to help your with digital accessibility.

Also head of the expert council at Institute for Digital Accessibility A11Y.si (in Slovenian).

Living and working in Norway (🇳🇴), originally from Slovenia (🇸🇮), loves exploring the globe (🌐).

Nurturing the web from 1999, this blog from 2019.

More about me and how to contact me: