Y
Hacker News
new
|
ask
|
show
|
jobs
by
kibibu
618 days ago
Am I looking at the wrong table? It dominates everything on visual interpretation benchmarks.
Edit: specifically ocrbench and VQAv2