You might be wondering, why must we create a third dataset? Couldn't we just use the test set for this purpose? Well, the idea is that when we go to test the model it looks at data that it has truly never seen before. Even though the model doesn't use the validation set to update its weights, out model selection process is biased in favor of the validation set. Thus, we need three separate sets of data.

Train vs Valid. vs Test Set

results matching ""

No results matching ""