Hacker News new | ask | show | jobs
by ftxbro 1159 days ago
> "We evaluate SELF-DEBUGGING on code-davinci-002 in the GPT-3 model family"

Putting aside the incongruity of Google researchers using the OpenAI model, I'm curious how GPT-4 would do in this situation. Probably its zero shot attempts at coding would be better, and maybe its self criticisms would be better too.

1 comments

Google's recent LLM agent paper also used ChatGPT.