I think you will see much less "disagreement" between AIs than between doctors. I.e., subjectivity of diagnosis can be much more easily accounted for in an algorithm than in a person...
If you train all of them on the same data, you will get similar answers. That doesn't mean that those answers will be more right than a less sure doctor.