Hacker News new | ask | show | jobs
by CuriouslyC 1119 days ago
Parameters don't have diminishing returns so much as we don't have enough (distinct) data to train models to use that many parameters efficiently.