Hacker News new | ask | show | jobs
by xeckr 154 days ago
>As an example, ask an LLM to do some 10th grade math. Inspect the thinking process. It can regurgitate the process and the rules but cannot perform them.

It seems to me that the solution is just RL to get the language model to delegate the actual calculation to the appropriate tool.