Hacker News new | ask | show | jobs
by taneq 697 days ago
Huh, so MCTS to find the ‘best’ token using a (relatively) small, quick language model? Sounds like an interesting approach to small model text generation too…