Hacker News new | ask | show | jobs
Do we still need OCR? An implementation of a pure vision-based agent (pageindex.ai)
7 points by mingtianzhang 237 days ago
1 comments

We discuss the limitations of the classic OCR pipeline and provide a pure vision-based RAG system for document analysis (https://github.com/VectifyAI/PageIndex/blob/main/cookbook/vi...)

Any feedback is welcome!