Hacker News new | ask | show | jobs
by int_19h 425 days ago
Which SOTA LLM fails at tic-tac-toe?
1 comments

I don't know, but it's not a hard test, get the LLM to play a perfect game of tic-tac-toe against itself, look at the output and see if it goes wrong.