Hacker News new | ask | show | jobs
by DrewADesign 177 days ago
Yeah, but tests like that deliberately prod the boundaries of its capability rather than how well it does what it’s good at.