Hacker News new | ask | show | jobs
by astrange 818 days ago
Facebook had a paper about "system 2" LLM attention, where they identified which parts of the input would be distracting for the LLM and just deleted them.

https://arxiv.org/abs/2311.11829