Hacker News new | ask | show | jobs
by htk 7 days ago
The M1 Max from 2021 has better memory bandwidth. The M3 Max can be specced to 128GB.

Nothing new here, apart from being able to use CUDA on a less power hungry system.

1 comments

The M1 Max has an unusably slow GPU for inference. TTFT on real-world contexts can be over 10 minutes.

> Nothing new here, apart from being able to use CUDA on a less power hungry system.

CUDA has been running on ARM SOCs since the Tegra K1, 12 years ago. Nvidia is not new to ARM, nor is CUDA.