Hacker News new | ask | show | jobs
DatBench: Discriminative, faithful, and efficient VLM evaluations (arxiv.org)
18 points by circuithunter 168 days ago