Hacker News new | ask | show | jobs
by alexsubq 38 days ago
What do you want in a whitepaper that was not in our blog post? There is time to add more before the whitepaper is released.
1 comments

I'm not GP, but I would want a benchmark that actually tests the entire context window. A benchmark that only tests the first 128K tokens effectively tells us nothing about how well it works at its full capacity.
That makes sense! We are working on that.