Hacker News new | ask | show | jobs
by treesciencebot 921 days ago
> One of the biggest bottlenecks is memory bandwidth. That is also not cheap or simple to do.

This is precisely why people are trying to put logic into memory instead of just making the logic chips simpler. Compute being 10x faster doesn't mean much when you want real-time, near-zero latency in the current day (and potentially, future) ML workloads. Memory bandwith for low batches are much more important, and even though this chip comes with HBM3E (which is cutting edge), that by itself won't make this faster than H200/MI300X.