Hacker News new | ask | show | jobs
by nodja 202 days ago
Because success for them doesn't mean it works, it means it works much better than what they currently have. If a 1% improvement comes at the cost of spending 10x more on training and 2x more on inference then you're failing at runs. (numbers out of ass)
1 comments

That makes sense. It's not that the training didn't complete or returned a moronic model, but the capabilities have plateaued.