Orthogonal transformations and gradient updates

We show that if the contour lines of a function are symmetric with respect to some rotation or reflection, then so is the evolution of gradient descent when minimising that function. Rotation of the space on which the function is evaluated effects a corresponding rotation of each of the points visited under gradient descent (similarly, for reflections).

This ultimately comes down to showing the following: if f: \mathbb{R}^N \to \mathbb{R} is the differentiable function being minimised and g is a rotation or reflection that preserves the contours of f, then

(1)   \begin{equation*} \nabla |_{g(u)} f = g ( \nabla |_u f) \end{equation*}

for all points u \in \mathbb{R}^N.


We consider below three one-dimensional examples that demonstrate that, even if the function f is symmetric with respect to all orthogonal transformations, it is necessary that the transformation g be orthogonal in order for the property (1) above to hold.

Leave a Reply

Your email address will not be published. Required fields are marked *