Fitting a Hyperplane to Noisy Points

Question

You observe $n$ points in $\mathbb{R}^d$. They were generated by sampling points that lie on a single unknown hyperplane and then adding independent random noise. 1. Write the general equation of a hyperplane in $\mathbb{R}^d$. 2. Construct a loss function whose minimizer estimates that hyperplane.…

Accepted Answer

How to Think About It: A hyperplane is defined by a normal direction and an offset. 'Fitting' it means finding the direction along which the points vary least -- the noise direction -- because the true points have zero spread perpendicular to the plane. So this is a total-least-squares / PCA problem, not an ordinary regression (there is no privileged 'response' coordinate; noise is in all directions). Part 1 -- Hyperplane equation: A hyperplane in $\mathbb{R}^d$ is $\{x : w^{	op}x = b\}$ with unit normal $w$ ($\lVert w Vert = 1$) and offset $b$. Equivalently $w^{	op}(x - x_0) = 0$ for any point $x_0$ on the plane. Part 2 -- Loss function: The perpendicular distance from point $x_i$ to the plane is $|w^{	op}x_i - b|$. Minimize the sum of squared orthogonal distances: $L(w, b) = \sum_{i=1}^n (w^{	op}x_i - b)^2 \quad 	ext{subject to } \lVert w Vert = 1.$ The unit-norm constraint is essential -- without it the trivial $w = 0$ wins. Part 3 -- Solution (PCA): Minimizing over $b$ first gives $b = w^{	op}\bar{x}$, where $\bar{x}$ is the sample mean. Substituting, $L(w) = \sum_i…

Fitting a Hyperplane to Noisy Points

Hints

Worked Solution

Intuition