Hacker News new | ask | show | jobs
by jackxlau 3 days ago
In my own testing I have seen peak performance happen usually within 15-20% of the intended context limit, albeit there are a few optimizations depending on the task quality.