This is their comparison point for actual radiologists. Citation number 6. It doesn't look comparable, though. Radiologists are around 90% specificity and sensitivity, which varies a good amount from the model's 77.3% and 87%, respectively.
This is not on this dataset though (right?), so not really a solid comparison point. Plus lik you mentioned, they seem to be doing worse than this benchmark.