Y
Hacker News
new
|
ask
|
show
|
jobs
by
Davidzheng
346 days ago
Well Alphaproof used test-time-training methods to generate similar problems (alphazero style) for each question it encounters.