Hacker News new | ask | show | jobs
by kibibu 618 days ago
Am I looking at the wrong table? It dominates everything on visual interpretation benchmarks.

Edit: specifically ocrbench and VQAv2