Hacker News new | ask | show | jobs
by machinationu 171 days ago
speculative decoding is 1+1

transformer attention is integrals