Y
Hacker News
new
|
ask
|
show
|
jobs
by
tomasphan
71 days ago
I answered 8/10 correctly but mostly on instinct, for example betting that the Trump tweet is misleading. Opus 4.6 got 9/10 correct. You might need an internal time limit (don't show the user) and some strawberry questions.