Hacker News new | ask | show | jobs
by int_19h 839 days ago
Naturally, but you need that for GPUs as well, no? What is the actual difference when running, when measured per token generated?