Hacker News new | ask | show | jobs
by gostsamo 711 days ago
It is in my screen reader, but it is used mainly in the browser - accessible description of an image without alt text. Translation of languages which I don't know is also nice.

Generally, I thing that the llm should be its own service and everything else should have an easy way to connect to it, but I'm a lowly user, not a product manager.

1 comments

For accessibility it is truly awesome. Yes it can be wrong and isn't perfect, but the alternative is having no knowledge of the image at all. Alt descriptions are often missing or not detailed enough to be useful for vision impaired.

Having much better text to voice could also be nice for the blind. While screen readers are fine I don't know how bothersome that robot voice is for longer texts.

I'm okay with robot voices, tbh. The neural voices need modern hardware, and there is latency even then, together with some artifacts. Especially when speeding through a known screens I prefer responsiveness over fidelity or other nicities.