Hacker News new | ask | show | jobs
by WiSaGaN 520 days ago
If what here says is true: https://x.com/teortaxesTex/status/1880768996225769738, then R1 may as well just be the better model. You can scale up R1 with lower token count to achieve better than o1 high results.