Hacker News new | ask | show | jobs
by ACCount37 126 days ago
By now, model lifetime inference compute is >10x model training compute, for mainstream models. Further amortized by things like base model reuse.