Hacker News new | ask | show | jobs
by earth2mars 976 days ago
It would be interesting to see how does ring attention technique affect this. This maybe still valid for cost reasons, but unlimited context is like in-memory computing vs. traditional. https://arxiv.org/pdf/2310.01889.pdf