Hacker News new | ask | show | jobs
by whymauri 304 days ago
Papers have been doing rollouts that involve a model proposing N solutions and then self-reviewing to choose the best one (prior to the verifier). So far, I think that's been counted as one pass.