Hacker News new | ask | show | jobs
by Tepix 236 days ago
> Blows away any consumer GPU.

Nah. Do you have 1st hand experience with Strix Halo? At less than 1600€ for a 128GB configuration it manages >45 tokens/s with gpt-oss 120b. Which is faster than DGX Spark at a fraction of the cost.

1 comments

Strix Halo has awful token prefill speed. Only suitable for very small contexts.