Hacker News new | ask | show | jobs
by taskforcegemini 180 days ago
They are using OCR for selecting plain text?
3 comments

It's possible to use the Gemini "ask me about this screen" to OCR the selected area of the screenshot. I guess that might be more efficient in some contexts then trying to use the native text select.
On iPhone too, taking a screenshot is the single reliable way to select text.
It becomes possible. Getting the handles to move correctly is still often a frustrating experience.
At least it's not AI... yet.
Multi-modal LLMs like Gemini are better than traditional OCR in most ways.
It is a poor person, sitting in a 3rd world country, thanscribing the text in your clipboard. See Alexa for details. /s

I'm only half joking.

There’s an API (Actually People Implemented) for that.