|
|
|
|
|
by dragonwriter
1522 days ago
|
|
> I know you probably just want to be annoying, but really, there is a world of difference between completely synthetic and anonymized data. No, I’ve spent ~20 years in healthcare, with this issue as a frequently recurring issue. > No, in actual practice you don't scrub the stuff you actually need to test. In actual practice, the stuff you really need to test often overlaps with the stuff minimally required to scrub to legally deanonymize the data. The most common scenario I’ve seen trying to do this is both creating most of the work of generating synthetic data and failing to legally deidentify the source data. |
|