The approach I've seen is to prompt for people with unusual names, they're often be only a single source image in the input data set that gets reproduced by the AI.
I've seen examples with the AI "generated" images and the source image side by side - I'll try and find them.