User:IssaRice/Linear algebra/Determinant of matrix equals product of eigenvalues

From Machinelearning

This is chapter 4 exercise 1.10 in Linear Algebra Done Wrong (p. 105).

Theorem. Let be a matrix. Then , where are the eigenvalues of (counting multiplicities).

Proof. We know that is an eigenvalue of if and only if . But is a polynomial in by equation 4.2 in the book. This means that is a root of the polynomial if and only if is an eigenvalue of .

Let's look at equation 4.2 from the book (p. 89) more closely:

The matrix has the form:

where the are entries containing just numbers. Thus when we use equation 4.2 to compute , we obtain the highest degree term (in terms of ) when we use the identity permutation (i.e. just go down the diagonal of the matrix), getting a degree term when multiplied out; in fact, the degree term will have coefficient . Every other permutation will take some of the diagonal entries and also some of the number-only entries, resulting in a term that, when multiplied out, gives a degree less than . This shows not only that is a polynomial in , but in fact a degree polynomial in .

So is an th degree polynomial, and its roots are exactly the eigenvalues of . By the fundamental theorem of algebra, has exactly complex roots, , which are also all the eigenvalues of (counting multiplicities). By the polynomial remainder theorem, this means that can be represented as . We want to show that .

Let's check the coefficient on the term. We already saw above that this equals . Multiplying out we get for the term. Thus the equation shows that as desired.

(What if we had written instead ? Then we would have gotten on the term. So we have the equation , since we already know that the coefficient on the term has to be . Yikes, are we in trouble? Thankfully not! We can take each of the n factors in and distribute it to each factor to get , so that we have , which is just the same as before!)

So finally we get that . Plugging in we get , which completes the proof. Alternatively, [TODO: this part I'm not sure about. idk what Treil had in mind here when he says to compare the terms without lambda. In particular, idk how to compute the constant term in det(A - lambda I).]

See also