Hacker News new | ask | show | jobs
by tarruda 648 days ago
Since most LLMs are released as FP16, just the number of parameters is enough to know the total required GPU RAM.