Hacker News new | ask | show | jobs
by bigstrat2003 19 days ago
Pretend? I don't have to pretend, I haven't seen any real improvement. I wouldn't let the models of today write code one bit more than the models of several years ago, because they still suck at it.
1 comments

Models several years ago would struggle to provide code that would compile, and need to be fed whatever errors were thrown to be able to resolve them.

Today's models often output working code. I've had OpenClaw instances one shot simple static web-page HTML, Apache installation, and deployment. It may not meet modern standards or be as secure as you'd like, but fundamentally this is an improvement from previous models.

Agreed, the "one-shot a static site" demo is the new "hello world" for agents. It's a real step up.
yep, they need something a bit more challenging than printing two words on the screen
:p