What I discovered is different models are simply just better at different things and it’s very hard to predict which one will be better than the others at a certain task. Thus no one has similar experiences as no one is doing the same things the same way ever.