https://arxiv.org/abs/2509.06503
They set up scoreable computational science problems and do search over solutions.