| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by argonaut 3475 days ago

Two major errors: 1) This doesn't seem to be controlling overfitting on the right validation set. 2) There isn't a test set at all (separate from validation).

Using Keras' "validation_split" parameter will just randomly select a validation set. This is not the right thing to do when your data is image sequences, because you will get essentially identical data in training and validation.

Because of this, the numbers/plot here might as well be training accuracy numbers.

1 comments

mhanus 3475 days ago

Keras uses the end of the data set as validation, and only randomizes it if the "shuffle" argument is set to True [1].

[1]: https://keras.io/getting-started/faq/#how-is-the-validation-...

link

argonaut 3475 days ago

Except the second half of the data is the flipped of the first half (X = np.concatenate([X, X_flipped]))

link

wroscoe 3475 days ago

Well shit. Thanks for pointing that out. I'll revise.

link