What? Coding is like the one thing that RL can do without any further human input because there is a testable provable ground truth; run the code.