Hacker News new | ask | show | jobs
by astrange 537 days ago
It's not trained on its own output. You can generate infinite correctly worked out math traces and train on those.