Negative Momentum for Improved Game Dynamics Gauthier Gidel* , - - PowerPoint PPT Presentation

negative momentum for improved game dynamics
SMART_READER_LITE
LIVE PREVIEW

Negative Momentum for Improved Game Dynamics Gauthier Gidel* , - - PowerPoint PPT Presentation

Negative Momentum for Improved Game Dynamics Gauthier Gidel* , Reyhane Askari Hemmat*, Mohammad Pezeshki, Gabriel Huang, Remi Lepriol, Simon Lacoste-Julien, Ioannis Mitliagkas *equal contribution Simple Min-max smooth game: Gradient dynamic:


slide-1
SLIDE 1

Negative Momentum for Improved Game Dynamics

Gauthier Gidel*, Reyhane Askari Hemmat*, Mohammad Pezeshki, Gabriel Huang, Remi Lepriol, Simon Lacoste-Julien, Ioannis Mitliagkas

*equal contribution

slide-2
SLIDE 2

Simple Min-max smooth game:

Gradient dynamic: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

slide-3
SLIDE 3

Simple Min-max smooth game:

Gradient dynamic: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

slide-4
SLIDE 4

Simple Min-max smooth game:

Gradient dynamic: Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

slide-5
SLIDE 5

Way to optimize bilinear games

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

slide-6
SLIDE 6

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Way to optimize bilinear games

slide-7
SLIDE 7

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Way to optimize bilinear games

slide-8
SLIDE 8

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Way to optimize bilinear games

slide-9
SLIDE 9

This talk

> >

(Improvements) (Improvements) Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

Way to optimize bilinear games

slide-10
SLIDE 10

General 2 player games:

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018 Two players aim to minimize their respective cost functions:

slide-11
SLIDE 11

General 2 player games:

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018 Two players aim to minimize their respective cost functions: Examples:

  • Simple class of zero-sum games: ( )
slide-12
SLIDE 12

General 2 player games:

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018 Two players aim to minimize their respective cost functions: Examples:

  • Simple class of zero-sum games: ( )
  • Generative Adversarial Networks:

(non-saturating GAN from Goodfellow et al. 2014)

slide-13
SLIDE 13

General 2 player games:

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018 Two players aim to minimize their respective cost functions: Dynamics of gradient based method depends on the gradient vector fields:

slide-14
SLIDE 14

General 2 player games:

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018 Two players aim to minimize their respective cost functions: Dynamics of gradient based method depends on the gradient vector fields: And its associated Jacobian,

slide-15
SLIDE 15

Fixed point dynamics

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018 Gradient method is defined as the repetition of the operator: Thus, the sequence computed is

slide-16
SLIDE 16

Fixed point dynamics

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018 Gradient method is defined as the repetition of the operator: Thus, the sequence computed is We aim to converge to a Nash Equilibrium:

slide-17
SLIDE 17

Tuning the step size

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018 Jacobian of our fixed point operator:

  • To have fixed point we need to be definite positive.
  • Thus, small enough step-size Eigenvalues in the unit disk.
  • Want to find optimal step-size.
slide-18
SLIDE 18

Fixed point dynamics

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  • Local convergence.
  • Stationary point may not be a Nash equilibrium. (See Adolphs et al. 2018)
  • But any Nash equilibrium is an stationary point.
  • In this talk: local results on stationary points.

Jacobian of our fixed point operator:

slide-19
SLIDE 19

Tuning the step size

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

slide-20
SLIDE 20

Negative Momentum

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018 Fixed point operator requires a state augmentation : (because need previous iterates) Recall Polyak’s momentum :

slide-21
SLIDE 21

Negative Momentum

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

slide-22
SLIDE 22

Negative Momentum

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018

  • Fixed momentum.

(- 0.25)

  • Step-size is not fixed.
  • Helps when the eigenvalue

has large imaginary part.

slide-23
SLIDE 23

What happens in practice ?

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018 Fashion MNIST:

slide-24
SLIDE 24

What happen in practice ?

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018 CIFAR-10:

slide-25
SLIDE 25

Negative Momentum

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018 To sum up:

  • Negative momentum seems to improve the behaviour of the “bad” eigenvalues.
  • If small enough seems to always help.
  • It also allows larger step-size.
slide-26
SLIDE 26

Thank you !

Gauthier Gidel, Workshop on learning and strategic behavior, August 22, 2018 If you are interested in that topic:

  • NIPS Workshop : Smooth Games Optimization and Machine Learning

Co-organized with: Simon Lacoste-Julien · Ioannis Mitliagkas · Vasilis Syrgkanis · Eva Tardos · Leon Bottou · Sebastian Nowozin Soon : Call for contributions !!!