Hacker News new | ask | show | jobs
by constantinum 335 days ago
LLMs are not yet there for complex and diverse document parsing use cases, especially at an enterprise scale (processing millions of pages).

Some of the reasons are:

Complex layouts, nested tables, tables spanning multiple pages, checkboxes, radio-buttons, off-oriented scans, controlling LLM costs, checking hallucinations, Human-in-the-loop integration, and privacy.

More on the issues: https://unstract.com/blog/why-llms-struggle-with-unstructure...