Hacker News new | ask | show | jobs
by dhon_ 484 days ago
I've seen Gemini Flash 2 mention "in the OCR text" when responding to VQA tasks which makes me question of they have a traditional OCR process mixed in the pipeline.