Hacker News new | ask | show | jobs
by simianwords 105 days ago
why do you think this falsifies that it can't reason?
1 comments

i ran the benchmark without the valid moves tool as well as the three mistakes grace and gpt-5.4 holds well. it can achieve 1000 ELO which is much higher than my own.

this clearly tells me that GPT is good at chess, at least better than a normal person who has played ~30-40 games in their life.