Population Loss

#Data #Model Selection

Given a dataset with records $\{x_i, y_i\}$ and a model $\hat y_i = f(x_i)$. Suppose we know the actual generating process of the dataset and the joint probability density distribution of all the data points is $p(x, y)$, the population loss is defined on the whole assumed population,

$$ \begin{align} \mathcal L_{P} = \mathop{\mathbb{E}}_{p(x,y)}[ d(y, f(x))], \end{align} $$

where $d(y, f(x))$ is the distance defined between $y$ and $f(x)$.

Planted: 2021-02-06 by L Ma;

Dynamic Backlinks to cards/machine-learning/measurement/population-loss:

Measures of Generalizability

To measure the generalization, we define a generalization error, $$ \begin{align} \mathcal G = …

Empirical Loss

The loss calculated on all the data points

cards/machine-learning/measurement/population-loss Links to:

Measures of Generalizability

To measure the generalization, we define a generalization error, $$ \begin{align} \mathcal G = …

Empirical Loss

The loss calculated on all the data points

L Ma (2021). 'Population Loss', Datumorphism, 02 April. Available at: https://datumorphism.leima.is/cards/machine-learning/measurement/population-loss/.