Skip to content

Overview

Description

This process of deciding whether the numerical results quantifying hypothesized relationships between variables are acceptable as descriptions of the data, is known as validation.

Generally, an error estimation for the model is made after training, better known as evaluation of residuals. In this process, a numerical estimate of the difference in predicted and original responses is done, also called the training error. However, this only gives us an idea about how well our model does on data used to train it. Now it's possible that the model is underfitting or overfitting the data. So, the problem with this evaluation technique is that it does not give an indication of how well the learner will generalize to an independent/unseen data set. Getting this idea about our model is known as Cross Validation.

Choosing the right loss function is crucial for effectively optimizing your model for the task at hand!

Popular functions:

Loss Function Typical Use Case Notes
Squared Error Loss Regression Sensitive to outliers due to squaring
Log Loss Binary Classification Optimized for probabilistic models and binary outcomes
Distortion Signal Processing/Compression Quality Measures fidelity or degradation between inputs and outputs
Cross-Entropy Loss Multi-Class Classification Focuses on probability distributions (often used with softmax output)