Y
Hacker News
new
|
ask
|
show
|
jobs
by
FuriouslyAdrift
28 days ago
I think the quote came out to $107k. 4 AMD MI300A's. Around 60k tokens per second, 512GB of GPU memory.
https://www.gigabyte.com/Enterprise/GPU-Server/G383-R80-AAP1
1 comments
disiplus
25 days ago
Which model are you running ?
link