Hacker News new | ask | show | jobs
by imtringued 840 days ago
Reinforcement learning is a completely different strategy compared to how most LLMs work.