Hacker News new | ask | show | jobs
by ruborcalor 2292 days ago
I apologize for the confusion, i've since removed this typo.

The data is only split once, before using a correlation test to select the features that the model would be trained on. As far as I can tell there is no data snooping occurring because the data is split into train and test sets before any decision are made.