Hacker News new | ask | show | jobs
by int_19h 836 days ago
$5M once, upfront. But given the significantly increased throughput, how fast does that pay for itself?
2 comments

You need computers for all of them and megawatts of power, power supplies, cooling, and power distribution.
Naturally, but you need that for GPUs as well, no? What is the actual difference when running, when measured per token generated?
Depends on power usage. I’m curious how power hungry those are compared to server/workstation cards.