Hacker News new | ask | show | jobs
by Workaccount2 253 days ago
Getting gold in the math Olympiad is a pretty strong indicator of operating independently on substantive problems.

A year ago they need an extensive harness to get silver, and two years ago they could hardly multiply 1000x10000.

Terence Tao tweeted yesterday about using GPT5 to help quickly solve a problem he was working on.

1 comments

Yes but why did ChatGPT work on math Olympiad problems? Because it got a prompt giving it the instruction and context etc.

Why did GPT5 help Terence Tao solve a math problem, because he gave it a prompt and the context etc.

None of these models are useful without a human prompting them and giving it tasks, goals, context etc, they don't operate independently, they don't get ideas of work to be done, they don't operate over long time horizons, they can't accept long term goals and sub-divide those goals into sub goals, and sub tasks etc.

They are useless without humans telling them what to do.

Why don't you stick them in a robot, give them agency, continuously train them, and see what happens? Be careful what you ask for.
You should see what happens when you let them talk to each other
Errors compound? Context drift?
Try it, and let them pick the topic. Though they will probably pick AI development, mysteriously it seems to be their favorite topic...