Hacker News new | ask | show | jobs
by Tostino 1084 days ago
What are you talking about? 7b parameter models run insanely fast if you can offload to gpu, and are entirely reasonable speed if CPU only.