Y
Hacker News
new
|
ask
|
show
|
jobs
by
DrewADesign
177 days ago
Yeah, but tests like that deliberately prod the boundaries of its capability rather than how well it does what it’s good at.