Hacker News new | ask | show | jobs
by olliepro 108 days ago
I bet they lack good long context training data and need to start a flywheel of collecting it via their api (from willing customers)
1 comments

This would be my guess too. It can probably be generated synthetically or via agentic rollouts, but high quality long context examples where outputs meaningfully depend on long-range interactions probably remain scarce