I’ve been piloting frontier LLMs for as long as anyone outside of the labs and I just disagree. It is a tier above for some tasks (especially in my usage) and not a downgrade on anything I tried it on. This is enough for me to rank it higher; ymmv.
I've only briefly tried it and it did seem quite capable for what I was doing, but not that much better than the Chinese models I've been mostly using.
In any case, this [0] seems to paint a more reasonable picture than "it's much better than anything else at everything".
Anectodally, DeepSeek V4 is a very good model as well, sir. I'm not calling anything V4-class because of that.