Hacker News new | ask | show | jobs
Making FlashAttention-4 faster for inference (modal.com)
2 points by matt_d 2 hours ago