They control for the data being in-distribution
Their dataset also has examples of the problem being solved.