Project points to lower dimension
Scale the features
Goal: to standardize the features to avoid one dimension’s domination
$$ X_{new} = \frac{X_{old}-X_{mean}}{\text{STD}(\text{all xs})} $$
$$ X_{new} = \frac{X_{old}-X_{min}}{X_{max} - X_{mean}} $$
Standardization is good to use when the data follows a gaussian like distribution, and outliers would not affect this
Normalization is good to use when you know tat the distribution does not follow a
gaussian distribution
Suppose data is provided in some D-dimensional space, but it can be well explained in a M-dimensional subspace for M < D
We want to look at the projection with the highest sample variance, because this will be the most informative choice (least information will be lost)