|
|
|
|
|
by bhy
937 days ago
|
|
My bet is a general search algorithm similar AlphaGo that uses LLM as world model, and heuristic search to find the right path to goal. There's already evidence in academics that this can significantly boost model performance [1]. [1] https://llm-mcts.github.io/ |
|