Hacker News new | ask | show | jobs
by silveraxe93 355 days ago
It's ironic how people write this without a shred of reasoning. This is just _wrong_. LLMs are not simply token prediction machines since GPT-3.

During pre-training, yeah they are. But there's a ton of RL being done on top after that.

If you want to argue that they can't reason, hey fair be my guest. But this argument keeps getting repeated as a central reason and it's just not true for years.