| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by janeway 1182 days ago
	Seems like if it can eventually test that the output meets the criteria then it will excel.

1 comments

RussianCow 1182 days ago

But when the code doesn't meet the requirements, the AI needs to know what's incorrect and what changes it needs to make, and that still requires a human. Unless you just put it into a loop and hope that it produces a working result eventually.

link

throwaway50606 1182 days ago

So what if you don't "just put it into a loop and hope" but actually make a complex AI agent with static code analysis capabilities, a graph DB, a work memory etc?

I'm doing just that and it works surprisingly well. Currently it's as good as people with 2-3 years of experience. Do you really believe it's not going to improve?

Now I'm making a virtual webcam so it has a face and you can talk to it on a Zoom meeting...

link

rapiz 1182 days ago

Do you have a presentable demo? LLM augmented by static code analysis sounds very interesting

link

throwaway50606 1182 days ago

I don't have GPT-4 API access yet... Using my ChatGPT Plus subscription so far. Will make a release once I get the API.

link