Newton’s Method

Learn how to use the second-order information, such as Hessian, for gradient descent.

We'll cover the following...

The second-order optimization algorithms
Implementation of Newton’s method

The second-order optimization algorithms

Newton’s methods are a class of optimization algorithms that leverage second-order information, such as Hessians, to achieve faster and more efficient convergence. In contrast, the gradient descent algorithms, like the Nesterov momentum, depend solely on the first-order gradient information.

The idea of Newton’s method is to utilize the curvature information present in Hessians to get a more accurate approximation of the function near the optimum.

Recall the two-degree Taylor series expansion of our objective $f(x)$ around a point $x_t$ (at the time $t$ ) as follows:

1.Introduction to Optimization

2.Vector Calculus

3.Convex Optimization

4.Gradient Descent for Non-Convex Optimization

Project

5.Constrained Optimization

6.Miscellaneous Methods

7.Course Conclusion

Assessment

Mini Project

Newton’s Method

The second-order optimization algorithms