Hacker News new | ask | show | jobs
by tannhaeuser 87 days ago
For LLMs and other pure memory-bound workloads, but for eg. diffusion models their FPU SIMD performance is lacking.