Y
Hacker News
new
|
ask
|
show
|
jobs
by
tinyhouse
246 days ago
OCR is not a great name for these models. While they can do traditional OCR such as digitize and scanned PDF for example, they do so much more.
1 comments
intalentive
246 days ago
>they do so much more I'm not familiar. What else are they good for?
link
tinyhouse
246 days ago
They can take something like an image of a graph and provide a description of it. From my understanding, these are multimodal models with reasoning capabilities.
link