| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ftxbro 1159 days ago
	> "We evaluate SELF-DEBUGGING on code-davinci-002 in the GPT-3 model family" Putting aside the incongruity of Google researchers using the OpenAI model, I'm curious how GPT-4 would do in this situation. Probably its zero shot attempts at coding would be better, and maybe its self criticisms would be better too.

1 comments

Google's recent LLM agent paper also used ChatGPT.