Hacker News new | ask | show | jobs
by mlyle 811 days ago
Coherency doesn't matter at all for a lot of cases. E.g. for inference, model weights are the vast majority of the memory use and are unchanging.