Enforcing constraints for interpolation and extrapolation in - PowerPoint PPT Presentation

Enforcing constraints for interpolation and extrapolation in Generative Adversarial Networks Panos Stinis (joint work with T. Hagge, A.M. Tartakovsky and E. Yeung) Pacific Northwest National Laboratory space space space Supported by “Deep Learning for Scientific Discovery Investment" and “PhILMs" ICERM Scientific Machine Learning, January 2019

Statement of the problem Generative Adversarial Networks (GANs) are becoming popular machine learning choices for training generators. There is a concerted effort in the machine learning community: i) to expand the range of tasks in which learning can be applied and ii) to utilize methods from other disciplines to accelerate learning. Task : We want to enforce given constraints in the output of a GAN generator both for interpolation and extrapolation (prediction). Given a time series, we wish to train GAN generators to represent the flow map of a system. Remark : The cases of interpolation and extrapolation should be treated differently. ICERM Scientific Machine Learning, January 2019

Definition and basic properties of GANs We are interested in training a generator to produce data from a given distribution (called the true data distribution). GANs formulate this task as a game between the generator and an associated discriminator (Goodfellow et al. , 2014). The objective of the generator is to trick the discriminator into deciding that the generator-created samples actually come from the true data distribution. The best possible performance of the generator is to convince the discriminator half of the time that the samples it is generating come from the true distribution. ICERM Scientific Machine Learning, January 2019

A two-player minimax game with value function V ( D , G ) : min G max V ( D , G ) = E x ∼ p data ( x ) [ log D ( x )] + E z ∼ p z ( z ) [ log ( 1 − D ( G ( z )))] , D where p data ( x ) is the true distribution and p z ( z ) is the input distribution of the generator. The generator G and the discriminator D are assumed to be neural networks with parameters θ g and θ d respectively. For a given generator G , the optimal discriminator D ∗ G ( x ) is given by p data ( x ) D ∗ G ( x ) = p data ( x ) + p g ( x ) , where p g ( x ) is the generator’s output distribution. ICERM Scientific Machine Learning, January 2019

We can define C ( G ) = max D V ( D , G ) = V ( D ∗ G , G ) . 1) The global minimum of C ( G ) is obtained if and only if p g = p data . G ( x ) = 1 2) Since for this minimum we have D ∗ 2 , the value of the minimum is − log 4 . 3) For the generator, the minimum of log ( 1 − D ( G ( z ))) is − log 2 . Remark : GANs are notoriously difficult to train. In particular, the discriminator learns much faster than the generator and this leads to an instability of the training process. Remark : There are many variants of the basic GAN framework. Our construction can be applied to those variants too. ICERM Scientific Machine Learning, January 2019

Enforcing constraints for interpolation We have a prior distribution p z ( z ) on the input of the generator and we would like to train the generator to produce samples from a function f ( z ) i.e., we want the data x = f ( z ) . In the original implementation of a GAN, for each z we produce an x that satisfies the constraint x = f ( z ) . We feed the discriminator both with the data x as well as the data x ′ = G ( z ) that the generator produces. Remark : This implementation is straightforward but does not utilize the known bond between z and x . Idea : We can enforce the constraint by augmenting the discriminator input vector as ( z , ǫ ) where ǫ = x − f ( z ) for the true sample. Similarly, for the generator-created sample, we augment the discriminator input vector as ( z , ǫ ′ ) where ǫ ′ = x − G ( z ) . ICERM Scientific Machine Learning, January 2019

Pros and cons The modification is easy to implement with modern software e.g. Tensorflow. We introduce a constraint for the data created by the generator. This constraint influences directly the parameters of the generator neural network (weights, biases in the case of convolutional neural nets) through back propagation. We respect the game-theoretic setup of GANs (under reasonable assumptions about the smoothness of the constraint). The modification can exacerbate the known instability issue of GANs. ICERM Scientific Machine Learning, January 2019

Adding noise to promote stability To avoid the instability we can regulate the discriminator’s ability to distinguish between generator-created and true data by adding noise to the constraint residual for the true data only. If the true data satisfy the constraint exactly, then we can decide how much noise to add based on Monte Carlo considerations. If the true data come from a numerical method (with given order of accuracy) or a physical experiment (with known precision), then we can use this information to decide the magnitude of the noise. The idea behind adding the noise is that the discriminator will become more tolerant by allowing generator-created samples for which the constraint residual is not exactly 0 but essentially within a narrow interval around it. ICERM Scientific Machine Learning, January 2019

Numerical example - interpolation 1.8 10 Game-theoretic error convergence Relative error in logarithmic scale 1.6 Constraint enforced (10 4 samples) 1.4 Constraint not enforced (10 4 samples) Discriminator error (constraint enforced) 1 1.2 Discriminator error (constraint not enforced) Generator error (constraint enforced) Generator error (constraint not enforced) 1.0 0.1 0.8 0.6 0 10,000 20,000 30,000 40,000 50,000 60,000 70,000 0 10,000 20,000 30,000 40,000 50,000 60,000 70,000 Iterations Iterations (a) (b) Figure: Two linearly coupled oscillators. (a) Comparison of the evolution of the absolute value of the generator and discriminator game-theoretic error with and without enforcing a constraint (linear-linear plot) ; (b) Comparison of the evolution of the relative error RE m of the function learned with and without enforcing a constraint (linear-log plot). Remark : The game-theoretic optimum can be reached much faster than the actual relative error threshold (very different solution landscapes require adaptive learning rate) ICERM Scientific Machine Learning, January 2019

Enforcing constraints for extrapolation 2.50 2.50 2.25 2.25 2.00 2.00 x_pos_extra x_pos_extra 1.75 1.75 1.50 1.50 1.25 1.25 1.00 1.00 0.75 0.75 0.50 0.50 0 2 4 6 8 10 0 2 4 6 8 10 z z (a) (b) Figure: Two linearly coupled oscillators. Comparison of target function x 1 ( z ) = R − sin ( z ) (blue dots) and GAN prediction x extra pos ( z ) (red crosses). (a) 10 4 samples of noiseless data with the constraints enforced during training and a projection step; (b) 10 4 samples of noisy data with the constraints enforced during training and a projection step. ICERM Scientific Machine Learning, January 2019

The need for error-correction Remark : The repeated application of the flow map neural network model (generative model) leads to error accumulation. Remark : The predicted trajectory veers off in untrained parts of phase space. As a result, the generator predictions fail. Idea : Introduce an error-correcting (restoring) force → Train an interpolatory generator with noisy data Implementation : We center a cloud of points at each data point that is provided on the trajectory. Then, during training, at each step we force the GAN generator to map a point in this cloud to the correct (noiseless) point on the trajectory at the next step. Remark : This is akin to temporal renormalization. ICERM Scientific Machine Learning, January 2019

Generator dynamics = Approximate dynamics + Restoring force Incorporate the restoring force implicitly by training with noisy data Incorporate the restoring force explicitly by training with noisy data and by learning the functional form of the restoring force Remark : The functional form of the restoring force can come from (temporal) model reduction considerations. Remark : It can even be represented by a separate neural network. Remark : The magnitude of the noise depends on the accuracy of the ground truth (order of accuracy, measurement errors). Remark : Expect scaling laws for parameters as a function of stepsize (incomplete similarity). ICERM Scientific Machine Learning, January 2019

25 25 20 20 15 15 10 10 5 5 x 1 x 1 0 0 − 5 − 5 − 10 − 10 − 15 − 15 − 20 − 20 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 z z (a) (b) Figure: Lorenz system. Comparison of ground truth x 1 ( z ) computed with the Euler scheme with stepsize δ z = 10 − 4 (blue dots), the GAN prediction with stepsize ∆ z = 1 . 5 × 10 − 2 (red crosses) and the Euler scheme prediction with stepsize ∆ z = 1 . 5 × 10 − 2 (green triangles). (a) 2 × 10 4 samples of noisy data without enforced constraints during training; (b) 2 × 10 4 samples of noisy data with enforced constraints during training. ICERM Scientific Machine Learning, January 2019

Enforcing constraints for interpolation and extrapolation in - PowerPoint PPT Presentation

Enforcing constraints for interpolation and extrapolation in Generative Adversarial Networks Panos Stinis (joint work with T. Hagge, A.M. Tartakovsky and E. Yeung) Pacific Northwest National Laboratory space space space Supported by Deep

First-Order Interpolation Laura Kov acs Interpolation: Craig Interpolation Use of

Part II: Interpolation and Approximation theory Contents: Review of Lagrange interpolation

Stability and Lebesgue constants in Good interpolation points RBF interpolation Results

Interpolation Dr. Mihail October 26, 2015 (Dr. Mihail) Interpolation October 26, 2015 1 / 11

3. Interpolation Closing the Gaps of Discretization . . . 3. Interpolation Numerical Programming

Marcinkiewicz interpolation Updated May 18, 2020 Plan 2 Outline: Interpolation of quasinorms

3. Interpolation: Closing the Gaps of Discretization . . . 3. Interpolation: Closing the Gaps of

19 Interpolation and splines How are smooth surfaces drawn (when polygonal approximation fails)?

A modeling and simulation perspective on extrapolation EMA Workshop on extrapolation of efficacy

EMA Extrapolation Framework Regulatory tools Workshop on extrapolation of efficacy and safety in

Extrapolation framework Status quo and issues to be resolved EMA extrapolation workshop 2015-09

EMA extrapolation fram ew ork Extrapolation w orkshop 2 0 1 6 -0 5 Christoph Male Austrian PDCO

On interpolation in theorem proving Maria Paola Bonacina Visiting: Computer Science Laboratory,

CFA Interpolation Detection Leszek Swirski October 15, 2009 Leszek Swirski CFA

Enforcing the Right to Work Enforcing the Right to Work for Asylum Seekers in for Asylum Seekers

Landsat Calibration: Interpolation, Extrapolation, and Reflection L DCM Sc ie nc e T e a m Me

PXUPA Investor Group Supporters PIGS is a committed group of volunteers formed in early 2012

4/6/2011 The Market Is Heating Up Are You Prepared? The Aerospace & Defense Forum March

2003 Full year financials Conference call, 3 March 2004 Operational highlights Operational

Extraordinary Budget Items Waste management, amount in $ 429,108 excess of inflation and growth

Univariate Smoothing Overview Problem Definition & Interpolation Problem definition

FUNCTION SPACE DISTRIBUTIONS OVER KERNELS GREG BENTON, WESLEY MADDOX, JAYSON SALKEY, JULIO

Nonparametric estimation of extreme risks from heavy-tailed distributions St ephane GIRARD

Extrapolation of operator moments, with problems C. Brezinski, applications to linear algebra

Enforcing constraints for interpolation and extrapolation in - PowerPoint PPT Presentation

Enforcing constraints for interpolation and extrapolation in Generative Adversarial Networks Panos Stinis (joint work with T. Hagge, A.M. Tartakovsky and E. Yeung) Pacific Northwest National Laboratory space space space Supported by Deep

First-Order Interpolation Laura Kov acs Interpolation: Craig Interpolation Use of

Part II: Interpolation and Approximation theory Contents: Review of Lagrange interpolation

Stability and Lebesgue constants in Good interpolation points RBF interpolation Results

Interpolation Dr. Mihail October 26, 2015 (Dr. Mihail) Interpolation October 26, 2015 1 / 11

3. Interpolation Closing the Gaps of Discretization . . . 3. Interpolation Numerical Programming

Marcinkiewicz interpolation Updated May 18, 2020 Plan 2 Outline: Interpolation of quasinorms

3. Interpolation: Closing the Gaps of Discretization . . . 3. Interpolation: Closing the Gaps of

19 Interpolation and splines How are smooth surfaces drawn (when polygonal approximation fails)?

A modeling and simulation perspective on extrapolation EMA Workshop on extrapolation of efficacy

EMA Extrapolation Framework Regulatory tools Workshop on extrapolation of efficacy and safety in

Extrapolation framework Status quo and issues to be resolved EMA extrapolation workshop 2015-09

EMA extrapolation fram ew ork Extrapolation w orkshop 2 0 1 6 -0 5 Christoph Male Austrian PDCO

On interpolation in theorem proving Maria Paola Bonacina Visiting: Computer Science Laboratory,

CFA Interpolation Detection Leszek Swirski October 15, 2009 Leszek Swirski CFA

Enforcing the Right to Work Enforcing the Right to Work for Asylum Seekers in for Asylum Seekers

Landsat Calibration: Interpolation, Extrapolation, and Reflection L DCM Sc ie nc e T e a m Me

PXUPA Investor Group Supporters PIGS is a committed group of volunteers formed in early 2012

4/6/2011 The Market Is Heating Up Are You Prepared? The Aerospace &amp; Defense Forum March

2003 Full year financials Conference call, 3 March 2004 Operational highlights Operational

Extraordinary Budget Items Waste management, amount in $ 429,108 excess of inflation and growth

Univariate Smoothing Overview Problem Definition &amp; Interpolation Problem definition

FUNCTION SPACE DISTRIBUTIONS OVER KERNELS GREG BENTON, WESLEY MADDOX, JAYSON SALKEY, JULIO

Nonparametric estimation of extreme risks from heavy-tailed distributions St ephane GIRARD

Extrapolation of operator moments, with problems C. Brezinski, applications to linear algebra

4/6/2011 The Market Is Heating Up Are You Prepared? The Aerospace & Defense Forum March

Univariate Smoothing Overview Problem Definition & Interpolation Problem definition