Hacker News new | ask | show | jobs
by imtringued 808 days ago
Most important? The idea that not every token needs the full context window should be an obvious optimization.
1 comments

that’s not the idea here