Hacker News new | ask | show | jobs
by doctoboggan 992 days ago
No, what I am specifically asking about is these sliding window attention techniques. As far as I understand it Claude 100K actually uses a 100k context window, and not a sliding window.