Derivative of a quadratic form

Let $A \in M_{n, n} (R)$ be an $n$ by $n$ real-valued matrix, and let $f : R^{n} \to R$ be defined by $f (x) = x^{T} A x$ . On this page, we calculate the derivative of $f$ .

Understanding the problem

Straightforward method

Using the definition of the derivative

The derivative is the linear transformation $L$ such that:

lim_{x \to x_{0}; x \neq x_{0}} \frac{| f (x) - (f (x_{0}) + L (x - x_{0})) |}{| x - x_{0} |} = 0

Using our function, this is:

lim_{x \to x_{0}; x \neq x_{0}} \frac{| x^{T} A x - x_{0}^{T} A x_{0} - L (x - x_{0}) |}{| x - x_{0} |} = 0

Defining $h = x - x_{0}$ , we have $x = x_{0} + h$ and

\frac{| (x_{0} + h)^{T} A (x_{0} + h) - x_{0}^{T} A x_{0} - L (h) |}{| h |}

Focusing on the subexpression $(x_{0} + h)^{T} A (x_{0} + h)$ , since $A$ is a matrix, it is a linear transformation, so we obtain $(x_{0} + h)^{T} (A x_{0} + A h)$ . Since the transpose of a sum is the sum of the transposes, we have $(x_{0}^{T} + h^{T}) (A x_{0} + A h)$ . Now using linearity we have $x_{0}^{T} A x_{0} + h^{T} A x_{0} + x_{0}^{T} A h + h^{T} A h$ .

Now the fraction is

\frac{| x_{0}^{T} A x_{0} + h^{T} A x_{0} + x_{0}^{T} A h + h^{T} A h - x_{0}^{T} A x_{0} - L (h) |}{| h |} = \frac{| h^{T} A x_{0} + x_{0}^{T} A h + h^{T} A h - L (h) |}{| h |}

Focusing on $h^{T} A x_{0}$ , it is a real number so taking the transpose leaves it unchanged: $h^{T} A x_{0} = (h^{T} A x_{0})^{T} = x_{0}^{T} A^{T} h$ .

Now the fraction is

\frac{| x_{0}^{T} A^{T} h + x_{0}^{T} A h + h^{T} A h - L (h) |}{| h |} = \frac{| x_{0}^{T} (A^{T} + A) h + h^{T} A h - L (h) |}{| h |}

Understanding the problem

Straightforward method

Using the definition of the derivative

Using the chain rule