Hacker News new | ask | show | jobs
by anandnair 483 days ago
We don't even need that example. The example is in front of us. Take a smaller parameter model and ask it to do the same complex thing that a larger parameter model did. It will struggle.

Btw, I'm not saying it's just the number of parameters that matters.