Hacker News new | ask | show | jobs
by fennecbutt 406 days ago
I thought flash attention was required for quantised KV?