Hacker News new | ask | show | jobs
StreamIndex: Memory-bounded compressed sparse attention via streaming top-k (arxiv.org)
4 points by OsamaJaber 33 days ago