| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by tinyhouse 246 days ago
	OCR is not a great name for these models. While they can do traditional OCR such as digitize and scanned PDF for example, they do so much more.

1 comments

intalentive 246 days ago

>they do so much more I'm not familiar. What else are they good for?

link

tinyhouse 246 days ago

They can take something like an image of a graph and provide a description of it. From my understanding, these are multimodal models with reasoning capabilities.

link