Hacker News new | ask | show | jobs
VideoGameBench from Princeton: Can vision-language models play 90s video games? (vgbench.com)
6 points by ofirpress 387 days ago
1 comments

Wow so without scaffolding the LLMs can't solve any of these games... Super cool work!