|
|
|
|
|
by jononor
2399 days ago
|
|
Sure, that demonstrates the issue.
Problem is with using train_test_split(X, yy, test_size=0.2..) - this assumes independent samples, which is violated for this dataset (because some come from same source audio files). The easiest (and completely acceptable) is to use one fold as the validation data, one fold for the test set, and the remaining folds as training. This problem is unfortunately quite common even in academic papers using this dataset, even though the authors warn about it. EDIT: There is one more issue with Urbansound8k folds, and that is that the difficulty of the various folds is quite different. So one should ideally report the performance across all folds (mean/std or boxplot). But this is a minor issue compared to data leakage. PS: Nice use of Comet.ml platform this, collaborating online on improving the experimental setup :) |
|