Hacker News new | ask | show | jobs
by bhy 937 days ago
My bet is a general search algorithm similar AlphaGo that uses LLM as world model, and heuristic search to find the right path to goal. There's already evidence in academics that this can significantly boost model performance [1].

[1] https://llm-mcts.github.io/