Hacker News new | ask | show | jobs
by wlamond 3299 days ago
The data should have been split into train, validation, and test sets before augmentation was applied. As it is now, there are likely examples in the validation and test sets that are highly similar to examples in the training set. The goal of validation should be to measure performance on previously unseen patients, not previously unseen scans of patients you've already trained on.