Hacker News new | ask | show | jobs
by spaceship__sun 747 days ago
Have you tried gpt4o?
1 comments

Or Gemini 1.5 Pro. The latest multimodal models, while still far from perfect, do seem to be getting better at image recognition and OCR.