Ch.10
Hessian Matrix: Second Derivatives and Curvature of Surfaces
Math diagram by chapter
Select a chapter to see its diagram below. View the flow of intermediate math at a glance.
Bowl: curves only down → minimum
Saddle: value ↑ this way, value ↓ that way
Orange direction: value goes up · Green direction: value goes down
Saddle: neither minimum nor maximum
Bowl curves only down → here is the minimum
Inverted bowl curves only up → here is the maximum
Saddle: one direction up, the other down → neither min nor max
The Hessian matrix is a square matrix of second-order partial derivatives of a scalar function. It encodes how much a surface curves at a point and is used to classify minima, maxima, and saddle points in optimization, and forms the basis of Newton's method and trust-region methods.
Hessian Matrix: Reading the Curvature of Surfaces
- Formula
- Symbol meaning = the number in the entry of the table—think of it as "differentiate once in , once in ." is the function, , are variable (axis) indices. Order does not matter, so and the matrix is symmetric.
- Formula (total entries)
- Symbol meaning = number of variables. With variables the Hessian is , so there are entries. E.g. 2 vars → 4, 3 vars → 9.
- Formula (independent entries)
- Symbol meaning = number of variables. By symmetry you only count the upper triangle, giving . E.g. 2 vars → 3, 3 vars → 6.
- Formula (rows/columns)
- Symbol meaning = number of variables. The Hessian is , so "how many rows? columns?" both are .
- FormulaEigenvalue test
- Symbol meaning = eigenvalue of the Hessian (curvature in each direction). All positive → bowl, minimum. All negative → dome, maximum. Mixed signs → up one way, down the other, saddle.
- Formula
- Symbol meaning = current point, = next point. = Hessian at that point, = its inverse. = gradient there. The formula "jumps toward the bottom" using both gradient and curvature.
- Formula
- Symbol meaning = current position, = next. = slope (1st derivative), = 2nd derivative (Hessian in 1D). For , (constant).
- Formula ()
- Symbol meaning = second derivative. is the coefficient of . For a quadratic, differentiating twice removes and leaves the constant .
- Formula (critical point)
- Symbol meaning = gradient (vector of 1st partials). = zero vector ("no gradient"). Where the gradient is zero is a candidate for min/max/saddle; use Hessian eigenvalues to tell which.
| Formula | Symbol meaning |
|---|---|
| = the number in the entry of the table—think of it as "differentiate once in , once in ." is the function, , are variable (axis) indices. Order does not matter, so and the matrix is symmetric. | |
| (total entries) | = number of variables. With variables the Hessian is , so there are entries. E.g. 2 vars → 4, 3 vars → 9. |
| (independent entries) | = number of variables. By symmetry you only count the upper triangle, giving . E.g. 2 vars → 3, 3 vars → 6. |
| (rows/columns) | = number of variables. The Hessian is , so "how many rows? columns?" both are . |
| Eigenvalue test | = eigenvalue of the Hessian (curvature in each direction). All positive → bowl, minimum. All negative → dome, maximum. Mixed signs → up one way, down the other, saddle. |
| = current point, = next point. = Hessian at that point, = its inverse. = gradient there. The formula "jumps toward the bottom" using both gradient and curvature. | |
| = current position, = next. = slope (1st derivative), = 2nd derivative (Hessian in 1D). For , (constant). | |
| () | = second derivative. is the coefficient of . For a quadratic, differentiating twice removes and leaves the constant . |
| (critical point) | = gradient (vector of 1st partials). = zero vector ("no gradient"). Where the gradient is zero is a candidate for min/max/saddle; use Hessian eigenvalues to tell which. |
Worked examples
문제
Read the instructions below, find the answer (integer), and enter it in the blank (?).