Hacker News new | ask | show | jobs
by wahnfrieden 246 days ago
Existing ocr doesn’t skip over entire (legible) paragraphs or hallucinate entire sentences
3 comments

I usually run the image(s) through more than one converter then compare the results. They all have problems, but the parts they agree on are usually correct.
rarely happens to me using LLMs to transcribe pdfs
This must be some older/smaller model.