When gradient is small...

Optimization Fails because...

Untitled

How to find out we stuck at local minima or saddle point?

Tayler Series Approximation

圖2

圖2

圖3

圖3

Hessian

Untitled

Example

圖片解釋