Hacker News new | ask | show | jobs
Kimi K1.5: Scaling Reinforcement Learning with LLMs (arxiv.org)
4 points by anjneymidha 508 days ago