Hacker News new | ask | show | jobs
by ml-anon 5 days ago
there is this bullshit term you hear paraded about the dwarkesh-adjacent circles: "capability overhang". Aside from being effectively meaningless jargon, there is a kernel of an idea that somehow the models are far more capable than what "normies" use them for.

Well, I think Siri AI puts this notion firmly to rest. Yes, if you have unlimited tokens and well-posed problems you can solve open Erdos problems. However, if you have meaningful real-world computational and reliability constraints then you better just stick to "summarize my messages and find the dogs in my photos".

And this isn't just Gemini, I can burn effectively unlimited Opus tokens and still get garbage code out or be run around in circles without very diligent oversight.