Hacker News new | ask | show | jobs
Avatarl: Training langauge models from scratch with pure RL (tokenbender.com)
2 points by krkartikay 309 days ago