Projections

When two vectors do not exist in the same column space, the best approximation of one in the other's columns space is called a projection.

Contents

Projections
1. Vectors
2. Matrices

Vectors

Given two vectors a and b, we can project b onto a to get the best possible estimate of the former as a multiple of the latter. This projection p has an error term e.

Take the multiple as x, so that p = ax. The error term can be characterized as b-p or b-ax.

a is orthogonal to e. Therefore, a^T(b-ax) = 0. This simplifies to x = (a^Tb)/(a^Ta). Altogether, the projection is characterized as p = a(a^Tb)/(a^Ta).

A matrix P can be defined such that p = Pb. The projection matrix is (aa^T)/(a^Ta). The column space of P (a.k.a. C(P)) is the line through a, and its rank is 1.

Incidentally, P is symmetric (i.e. P^T = P) and re-projecting does not change the result (i.e. P² = P).

Matrices

For problems like Ax = b where there is no solution for x, as in b does not exist in the column space of A, we can instead solve Ax = p where p estimates b with an error term e.

p is a linear combination of A: if there are two columns a₁ and a₂, then p = x₁a₁ + x₂a₂ and b = x₁a₁ + x₂a₂ + e.

e is orthogonal to the column space of A^T (a.k.a. C(A^T)), so A^T(b-Ax) = 0. Concretely in the same example, a₁^T(b-Ax) = 0 and a₂^T(b-Ax) = 0. More generally, this re-emphasizes that e is orthogonal in the null space of A^T (a.k.a. N(A^T)).

The solution for this all is x = (A^TA)^-1A^Tb. That also means that p = A(A^TA)^-1A^Tb.

A matrix P can be defined such that p = Pb. The projection matrix is A(A^TA)^-1A^T.

Note that if A were a square matrix, most of the above equations would cancel out. But we cannot make that assumption. This fundamentally means though that if b were in the column space of A, then P would be the identity matrix.

This should look familiar.

CategoryRicottone

LinearAlgebra/Projection

Projections

Vectors

Matrices