Hacker News new | ask | show | jobs
by Davidzheng 346 days ago
Well Alphaproof used test-time-training methods to generate similar problems (alphazero style) for each question it encounters.