Hacker News new | ask | show | jobs
by hypercube33 63 days ago
Weird how you're leaving stuff like Strix Halo out. Also weird you think 128gb is the future with all of the research done to reduce that to something around 12GB being a target with all of these papers out now. I assume we'll end up with less general purpose models and more specific small ones swapped out for whatever work you are asking to do.
1 comments

Strix Halo hasn‘t got nearly enough bandwidth, its just 256bit.
It‘s sufficient for some MoE models.