Hacker News new | ask | show | jobs
Reinforcement learning is all you need, for next generation language models (yuxili.substack.com)
5 points by zh217 1139 days ago