Hacker News new | ask | show | jobs
by TechDebtDevin 459 days ago
VRAM Required = Number of Parameters (in billions) × Number of Bytes per Parameter × Overhead[0].

[0]: https://twm.me/posts/calculate-vram-requirements-local-llms/

2 comments

Don’t forget to add a lot of extra space if you want a usable context size.
Wouldn't that be your overhead var
Thats neat! thanks