Hacker News new | ask | show | jobs
Structured OCR with GPT Vision (binal.pub)
1 points by binalpatel 915 days ago
1 comments

Here's a direct link to the code: https://github.com/caesarnine/llm-experiments/blob/main/3_st...

Just a Pydantic model to define an extraction schema + a simple prompt works reasonably well.