Hacker News new | ask | show | jobs
by ekianjo 536 days ago
7b to 9b is usually what we call small. the rule of thumb is a model that you can run on a single GPU.