The vast gulf between open weights and frontier models that existed 6 months ago has suddenly disappeared?
It's far more likely you're just bad at assessing model output.