|
|
|
|
|
by npn
10 days ago
|
|
I bought one AMD MI50 32GB back then when they were sold rather cheap (around $150-$170). it can easily generate over 70 tokens per second for gemma 4 26B moe model (q4). I have no doubt that we will have another wave of cheap retired server gpus just like before. And that is the time when everyone will have their own models at their home. Or we can just buy the newest medusa halo mini pc. they will be pretty decent, too, albeit pricey. |
|