Hacker News new | ask | show | jobs
by mgraczyk 354 days ago
That doesn't matter, are you familiar with any theoretical results in which the computation is somehow limited in ways that practically matter when the context length is very long? I am not