Validation
Get introduced to the importance of data splits and the process of cross-validation.
Since regularization is a method to fine-tune the subject model by introducing an additional penalty in the error function, we need to validate its impact. Several hyperparameters need to be set before optimizing the objective function. The hyperparameters include model , loss function , regularization function , and the scale of regularization . Validation is the process of testing the accuracy of the trained model, which also measures the validity of the hyperparameters.
Note: An accurate indicator of generalization is the performance of the trained model on
. unseen data This is the data that isn’t used in the training process.
Data splits
Where to get the unseen data for validation? One way is to hold out a percentage of available data and use the rest for training. Once the training is complete, the validation can be carried out on the subset of available data that was kept for validation, known as the hold-out set.
Note: The more popular term used for hold-out set is test set.
Get hands-on with 1400+ tech skills courses.