|
|
|
|
|
by bonoboTP
446 days ago
|
|
LLMs for OCR is super risky because just as much as they can fix OCR mistakes, they can inadvertently "fix" correct stuff too and hallucinate instead. Its that xerox bug on steroids, where scanned pages would get their digits swapped by other digits... I'd want to see some proper hallucination analysis. |
|
https://arxiv.org/pdf/2405.15306
Most OCR pipelines like this, along with excellent commercial ones like doctly.ai, are focused on OCR for LLM consumption - while I’d like to be able to recreate the original scientific work that predates digital typesetting in modern typeset - for yes LLM but also to preserve and promote science of yore, much of which includes discoveries forgotten but relevant still to problems we face today.