Y
Hacker News
new
|
ask
|
show
|
jobs
by
taskforcegemini
180 days ago
They are using OCR for selecting plain text?
3 comments
aoeusnth1
180 days ago
It's possible to use the Gemini "ask me about this screen" to OCR the selected area of the screenshot. I guess that might be more efficient in some contexts then trying to use the native text select.
link
eastbound
180 days ago
On iPhone too, taking a screenshot is the single reliable way to select text.
link
throwaway894345
180 days ago
It becomes possible. Getting the handles to move correctly is still often a frustrating experience.
link
AlienRobot
180 days ago
At least it's not AI... yet.
link
xnx
180 days ago
Multi-modal LLMs like Gemini are better than traditional OCR in most ways.
link
hulitu
179 days ago
It is a poor person, sitting in a 3rd world country, thanscribing the text in your clipboard. See Alexa for details. /s
I'm only half joking.
link
doubled112
179 days ago
There’s an API (Actually People Implemented) for that.
link