Y
Hacker News
new
|
ask
|
show
|
jobs
by
dandanua
27 days ago
The story of Flash Attention is the best manifestation of power and difficulty of GPU programming. This page gives a nice overview of it
https://aiwiki.ai/wiki/flash_attention