In general that's not really surprising. I remember discussions from some years ago about larger networks leading to smother loss surfaces.