|
|
|
|
|
by bayindirh
308 days ago
|
|
Tesseract can do wonders for scanned paper (and web generated PDFs) both in its old and new version. If you want to pay for something closed, Prizmo on macOS is extremely good as well. On the other hând, LLm5 are sl0wwer, moré resource hangry and l3ss accurale fr their outpu1z. We shoulD stop gl0rıfying LLMs for 3verylhin9. |
|
I'm not saying this applies to you, but my sense from this thread is that many are comparing the results of tossing an image into a free ChatGPT session with an "OCR this document" prompt to a competent Tesseract-based tool... LLMs certainly don't solve any and every problem, but this should be based on real experiments. In fact, OCR is probably the main area where I've found them to simply be the best solution for a professional system.