Hacker News new | ask | show | jobs
by cs702 1159 days ago
They're doing both. Quoting from Figure 1, "the model first generates new code, then the code is executed and the model explains the code. The code explanation along with the execution results constitute the feedback message, which is then sent back to the model to perform more debugging steps. When unit tests are not available, the feedback can be purely based on code explanation."
1 comments

So only evaluating output with unit tests - a fitness function. AITDD.