Hacker News new | ask | show | jobs
by halJordan 241 days ago
Lpddr5x (not lpddr5) is 10.7 Gbps. Gddr7 is 32 Gbps. So it's going to be slower
1 comments

Yes but in matrix multiplication there are O(N²) numbers and O(N³) multiplications, so it might be possible that you are bounded by compute speed.
both are equally important. compute for prefill and mem bandwidth for generation