|
|
|
|
|
by muraiki
1230 days ago
|
|
"The training, validation and test data sets are generated without overlap from periods in sequence. Successive periods of 400, 12, 40, 40 and 12 h are used to sample, respectively, training, validation, and test data, with the two 12 h periods inserted as hiatus." |
|
A common split is train/validate/test, but all three are used during training -- train to actually train, validate for intermediate loss, test for model comparison.
What you want is a fourth, held-out test set that isn't looked at until publish time.
This paper has two test sets, but they have different data properties, and it's not clear they were held out until publication.