Hacker News new | ask | show | jobs
by tedivm 849 days ago
The same model will not get better by having more processing power or time. However, that's not the full story.

Larger models generally perform better than smaller models (this is a generalization, but a good enough one for now). The problem is that larger models are also slower.

This ends up being a balancing act for model developers. They could get better results but it may end up being a worse user experience. Models size can also limit where the model can be deployed.