the gaussian prior p(w)
loading python runtime...
peak density = 0.40
likelihood p(D|w)
drag data points
loading python runtime...
MLE ŵ = ·
posterior = prior × likelihood
— prior
— likelihood
— posterior
MAP·
MLE·
loading python runtime...
the same optimization, two views
regularization view: ∑(yᵢ − wxᵢ)² + λw²
bayesian view: −log p(w|D)
loading python runtime...
σ = 0.71
λ = σₙ²/σ²
min at ŵ = ·
playground: MLE vs MAP polynomial fit
drag points to refit
— MLE
— MAP (ridge)
loading python runtime...