| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by dismalaf 350 days ago
	Tic-tac-toe is solved and a draw can be forced 100% of the time...

2 comments

miller24 350 days ago

That's exactly why it's so crazy that GPT-5 with Thinking still loses...

link

dismalaf 350 days ago

Ah, your first comment said "can't win". Which is different than "always loses".

link

miller24 350 days ago

Ah okay, well it will still lose some of the time, which is surprising. And it will lose in surprising way, e.g., thinking for 14 seconds and then making an extremely basic mistake like not seeing it already have two on a row and could just win.

link

HappMacDonald 350 days ago

.. and you can "program" a neural network — so simple it can be implemented by boxes full of marbles and simple rules about how to interact with the boxes — to learn by playing tictactoe until it always plays perfect games. This is frequently chosen as a lesson in how neural network training even works.

But I have a different challenge for you: train a human to play tictactoe, but never allow them to see the game visually, even in examples. You have to train them to play only by spoken words.

Point being that tictactoe is a visual game and when you're only teaching a model to learn from the vast sea of stream-of-tokens (similar to stream-of-phonemes) language, visual games like this aren't going to be well covered in the training set, nor is it going to be easy to generalize to playing them.

link

gowld 339 days ago

tic-tac toe is merely a visualization of a small arithmetic game "sum 3 digits to 15"

   618   
   753
   294

link

miller24 350 days ago

Well whatever your story is, I know with near certainty that no amount of scaffolding is going to get you from an LLM that can't figure out tic-tac-toe (but will confidently make bad moves) to something that can replace a human in an economically important job.

link

bwfan123 349 days ago

llm maximalists' apologies:

- but tokens are not letters - but humans fail too - just wait, we are on an S curve to AGI - but your prompt was incorrect - but I tried and here it works

Meanwhile, their claims:

- LLMs are performing at PhD levels. - AGI is around the corner - humanity will be wiped out - situational awareness report

link