|
|
|
|
|
by sdenton4
158 days ago
|
|
The quadratic attention problem seems to be largely solved by practical algorithmic improvements. (Iterations on flash attention, etc.) What's practically limiting context size IME is that results seem to get "muddy" and get off track when you have a giant context size. For a single-topic long session, I imagine you get a large number of places in the context which may be good matches for a given query, leading to ambiguous results. I'm also not sure how much work is being put into reinforcement in extremely large context inference, as it's presumably quite expensive to do and hard to reliably test. |
|
Perfect for a demo or work on a single self contained file.
Disastrous for a large code base with logic scattered all throughout it.