Hacker News new | ask | show | jobs
by ehsankia 1641 days ago
The word "Algo" here is definitely awkward. The point is though that what matters most here is the number of parameters, as those correlate quite closely with training and inference time. Storage space is pretty trivial, but TPU cycles are less so.
1 comments

Thanks for this. Rereading + your comment and I think I have a better understanding of why this is progress.