Stones, water

Automating alt text generation for all blog images

Yesterday, I wrote about how I cleaned up a lot of old mess here on this blog. This made me realize that I have not paid attention to creating alt text for all my images, that is, descriptive text that describes the content of the image. That is an enormous job when you have several thousand images on a blog like this. So I decided to ask CoPilot for help. Getting AI help CoPilot first suggested creating alt text based on the image file names. That could have been a good idea if the filenames had been descriptive. However, they are not (in general), hence I asked for a solution that would actually analyse the content of the images. After a series of iterations, we (CoPilot and I) ended up with a Python script that does the job. ...

December 31, 2025 · 3 min · 535 words · ARJ

An object-action-context approach to writing alt text

I came across an interesting blog post by Alex Chen on how to write better image descriptions for web pages. They propose an “object-action-context” approach when writing image descriptions. I see that such an approach could also be helpful for my sound actions project. Adding better descriptions I am soon getting to the end of my year-long project of recording one sound action daily. A sound action is a multimodal entity consisting of body motion and its resultant sound. My starting point is that when we only see a sound action, we can imagine its sound. If we only hear a sound action, we can imagine the body motion and objects involved in the interaction. Here is the playlist with all the recorded sound actions: ...

December 22, 2022 · 3 min · 482 words · ARJ