Hacker News new | ask | show | jobs
by wolfgangK 321 days ago
Indeed, recent Flash Attention is a pain point for non CUDA.