Hacker News new | ask | show | jobs
Avatarl: Training language models from scratch with pure reinforcement learning (tokenbender.com)
2 points by haneefmubarak 307 days ago