What is a Hessian matrix?

Hessian matrix is crucial in machine learning and data science since it’s used to optimize functions. Hessian matrix is widely used in neural networks and other models.

If we take the second-order derivative of $f:R^n\to R$ , the resultant matrix is called a Hessian matrix. Since the derivative of a derivative is commutativedoesn’t depend on the order, Hessian matrices are symmetric.

H_f= \begin{bmatrix} \dfrac{\partial^2 f}{\partial x_1^2} & \dfrac{\partial^2 f}{\partial x_1\,\partial x_2} & \cdots & \dfrac{\partial^2 f}{\partial x_1\,\partial x_n} \\[2.2ex] \dfrac{\partial^2 f}{\partial x_2\,\partial x_1} & \dfrac{\partial^2 f}{\partial x_2^2} & \cdots & \dfrac{\partial^2 f}{\partial x_2\,\partial x_n} \\[2.2ex] \vdots & \vdots & \ddots & \vdots \\[2.2ex] \dfrac{\partial^2 f}{\partial x_n\,\partial x_1} & \dfrac{\partial^2 f}{\partial x_n\,\partial x_2} & \cdots & \dfrac{\partial^2 f}{\partial x_n^2} \end{bmatrix}

A Hessian matrix has several uses, including the following:

$2^{nd}$ order optimization
Testing for critical points

Code

We can evaluate a Hessian in several numerical computing libraries. Following are the respective functions/syntax of some of them:

What is a Hessian matrix?

Code

Test for critical points