Hacker News new | ask | show | jobs
by FuriouslyAdrift 28 days ago
I think the quote came out to $107k. 4 AMD MI300A's. Around 60k tokens per second, 512GB of GPU memory.

https://www.gigabyte.com/Enterprise/GPU-Server/G383-R80-AAP1

1 comments

Which model are you running ?