Gradients of the Vector-Valued Functions

Explore the concept of gradients for vector-valued functions, extending from scalar outputs to multi-dimensional outputs. Understand how to compute the Jacobian matrix using partial derivatives and apply the chain rule. Gain practical skills in calculating gradients for functions mapping vectors to vectors, supported by Python examples involving NumPy and SciPy.

We'll cover the following...

Vector-valued functions and their gradients
Chain rule
Computing the Jacobian

Vector-valued functions and their gradients

A vector-valued function in mathematics is a function that takes one or more input variables and produces one or more output variables, where the output variables are components of a vector. For example, the function $f(\theta) = \begin{bmatrix} \cos(\theta) \\ \sin(\theta) \end{bmatrix}$ is a vector-valued function that describes a circle in a 2D plane. The function takes input $\theta \in [0, 2\pi]$ (the angle with $x$ -axis) and returns a 2D vector representing the coordinates on a unit circle $x = \cos(\theta)$ and $y = \sin(\theta)$ .

So far, we have discussed the partial derivatives and gradients of functions that take one or more input variables and output a scalar value, i.e., $f: \R^m \rightarrow \R$ . Now, we will generalize the notion of gradients for vector-valued functions $f: \R^m \rightarrow \R^n$ , where $m \geq 1$ ...

1.Introduction to Optimization

2.Vector Calculus

3.Convex Optimization

4.Gradient Descent for Non-Convex Optimization

Project

5.Constrained Optimization

6.Miscellaneous Methods

7.Course Conclusion

8.Appendix

Assessment

Mini Project

Gradients of the Vector-Valued Functions

Vector-valued functions and their gradients