Hacker News new | ask | show | jobs
by jeremyjh 27 days ago
> LLMs don't really create novel mathematics, they mostly "connect the dots".

That is not what the mathematicians are saying. I don't have the knowledge to evaluate this myself, but a number of mathematicians - for example, in the SP - are saying it goes further than that - they really do introduce novel ideas. Of course everything is based on and inspired by some previous work, but that is true of all human mathematics as well.

LLMs that have been trained through reinforcement learning on mathematics are NOT simply token predictors. Only base models can be accurately described that way. They have learned how to do mathematics. They have learned to do coding. Its really amazing we're three years into instruct models and such a large part of Hacker News still does not understand the most basic facts about this field.

1 comments

Reinforcement learning perturbs the model such that the token prediction process (inference) tends towards the desired result.