Hacker News new | ask | show | jobs
by Alex-Programs 191 days ago
I think using total parameters is fair, it correlates well with the RAM prerequisites to run it. Otherwise Kimi K2 would be "small" despite being a trillion parameters!
1 comments

VRAM doesn't matter if you are using API. Price and performance is what matters.