Using validation mean/std after normalization as an indicator for how good it is for evaluation?

wgpubs · April 1, 2019, 7:17pm

Was thinking about how we normalize both the training and validation set via the mean/std of the training set and was wondering …

After normalization, can we infer how well our validation set reflects the data in the training set by how close its mean = 0 and its variance = 1?

If it is way off, it seems that would indicate that our training set does not reflect what is in the validation set and so our model is therefore likely to generalize poorly.

Does that intuition hold any weight?

jeremy · April 1, 2019, 7:57pm

Not really. See the Intro to Machine Learning course for ideas on how to test this properly.

marii · April 4, 2019, 5:23am

Wow, I had completely missed that this course existed! Thank you