Hacker News new | ask | show | jobs
by jbergqvist 60 days ago
"Helped build itself" is a bit of a stretch here, it makes it sound as if the model was doing lasting self-improvements.

What the article describes is that the model was able to tweak to its own deployment harness (memory, skills, experimental loop etc) to improve performance on benchmarks. While impressive, it's not doing any modifications to its own weights by e.g. modifying the training code.

1 comments

By this standard, Claude "helped build itself" since Claude Code is 100% vibe coded. Not sure if this also applies somewhat to ChatGPT and Codex.