Hacker News new | ask | show | jobs
by hislaziness 597 days ago
As I understand, the LLM uses the techniques of searchformer - https://arxiv.org/abs/2402.14083. To do "slow thinking" doing a A* search using a transofrmer.