Have you done the reverse? In my experience models will always find something to criticize in another model's work.
But I've had the best results with GPT 5.4
But I've had the best results with GPT 5.4