Do we still need OCR? An implementation of a pure vision-based agent

Y	Hacker News new \| ask \| show \| jobs

	Do we still need OCR? An implementation of a pure vision-based agent (pageindex.ai)
	7 points by mingtianzhang 237 days ago

1 comments

We discuss the limitations of the classic OCR pipeline and provide a pure vision-based RAG system for document analysis (https://github.com/VectifyAI/PageIndex/blob/main/cookbook/vi...)

Any feedback is welcome!