Hacker News new | ask | show | jobs
by janeway 1136 days ago
Seems like if it can eventually test that the output meets the criteria then it will excel.
1 comments

But when the code doesn't meet the requirements, the AI needs to know what's incorrect and what changes it needs to make, and that still requires a human. Unless you just put it into a loop and hope that it produces a working result eventually.
So what if you don't "just put it into a loop and hope" but actually make a complex AI agent with static code analysis capabilities, a graph DB, a work memory etc?

I'm doing just that and it works surprisingly well. Currently it's as good as people with 2-3 years of experience. Do you really believe it's not going to improve?

Now I'm making a virtual webcam so it has a face and you can talk to it on a Zoom meeting...

Do you have a presentable demo? LLM augmented by static code analysis sounds very interesting
I don't have GPT-4 API access yet... Using my ChatGPT Plus subscription so far. Will make a release once I get the API.