Y
Hacker News
new
|
ask
|
show
|
jobs
by
steve-atx-7600
70 days ago
Inference from an LLM is O(tokens^2)
1 comments
halJordan
70 days ago
Only in the naive implementations of attention
link