|
|
|
|
|
by Etheryte
246 days ago
|
|
I'm not really sure if that's an accurate summary of the state of the art, [0] is a better overview. In short, SOTA multi-modal LLMs are the best option for handwriting, nearly anything is good at printed text, for printed media, specialty models from hyperscalers are slightly better than multi-modal LLMs. [0] https://research.aimultiple.com/ocr-accuracy/ |
|