Problem
Mirror Descent
: Legendre function
: Legendre Dual with
: Bregman Divergence
Traditional Algorithm:
, where
and
Equivalent Expression:
The Mirror Descent algorithm can be equivalently expressed as:
Comparison with projected subgradient descent
projected subgradient descent algorithm:
and similarly, it can be equivalently expressed as:
So the difference between Mirror Descent and Projected Subgradient Descent is just it replace the regular 2-norm by a more general Bregman divergence function.
Convergence Rate for Mirror Descent
Assumption:
is
-strongly convex with respect to some norm
, namely
and
Then we have
Reference
- paper linked to: https://web.iem.technion.ac.il/images/user-files/becka/papers/3.pdf
- also this note: http://users.cecs.anu.edu.au/~xzhang/teaching/bregman.pdf