Adversarial Learned Molecular Graph Inference and Generation - PowerPoint PPT Presentation

Adversarial Learned Molecular Graph Inference and Generation Sebastian Pölsterl and Christian Wachinger Artificial Intelligence in Medical Imaging, Ludwig-Maximilians-Universität, Munich European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases September 14–18 th 2020

De Novo Chemical Design Goal Find a molecule with certain properties, e.g., an antiviral drug to inhibit SARS-CoV-2 replication. S. Pölsterl and C. Wachinger (AI-Med) Adversarial Learned Molecular Graph Inference and Generation 2 of 18

De Novo Chemical Design Goal Find a molecule with certain properties, e.g., an antiviral drug to inhibit SARS-CoV-2 replication. Problem 1. The space of molecules is extremely large – in the order of 10 33 drug-like molecules. 1 2. Molecules are discrete in nature, which prevents the use of gradient-based optimization. 1 P. G. Polishchuk et al. (2013). “Estimation of the size of drug-like chemical space based on GDB-17 data”. In: Journal of Computer-Aided Molecular Design 27.8, pp. 675–679 S. Pölsterl and C. Wachinger (AI-Med) Adversarial Learned Molecular Graph Inference and Generation 2 of 18

De Novo Chemical Design Goal Find a molecule with certain properties, e.g., an antiviral drug to inhibit SARS-CoV-2 replication. Problem 1. The space of molecules is extremely large – in the order of 10 33 drug-like molecules. 1 2. Molecules are discrete in nature, which prevents the use of gradient-based optimization. Solution Use a deep generative model to project molecules into a continuous latent space and perform gradient-based optimization there. 1 P. G. Polishchuk et al. (2013). “Estimation of the size of drug-like chemical space based on GDB-17 data”. In: Journal of Computer-Aided Molecular Design 27.8, pp. 675–679 S. Pölsterl and C. Wachinger (AI-Med) Adversarial Learned Molecular Graph Inference and Generation 2 of 18

Graph Variational Autoencoder z ∼ Prior Distribution O Encoder Decoder OH Input G Output ˜ Latent Space z G Reconstruction Loss L ( G, ˜ G ) Requires solving expensive graph isomorphism problem! S. Pölsterl and C. Wachinger (AI-Med) Adversarial Learned Molecular Graph Inference and Generation 3 of 18

Prior Work I Inference (Encoder): Various Graph Convolutional Neural Networks. Generation (Decoder): • In a single step using MLP (De Cao and Kipf, 2018; Ma et al., 2018; Simonovsky and Komodakis, 2018) . • Sequentially using RNN (Bradshaw et al., 2019; Jin et al., 2018; Li, Zhang, et al., 2018; Li, Vinyals, et al., 2018; Liu et al., 2018; Podda et al., 2020; Samanta et al., 2019; You et al., 2018) . S. Pölsterl and C. Wachinger (AI-Med) Adversarial Learned Molecular Graph Inference and Generation 4 of 18

Prior Work II Generative Models for Molecular Graphs : • Likelihood-based (VAEs): compute reconstruction loss by (i) traversing nodes in a fixed order, (ii) Monte-Carlo sampling, or (iii) graph matching. • Adversarial: MolGAN is the only such model, but cannot do inference (De Cao and Kipf, 2018) . S. Pölsterl and C. Wachinger (AI-Med) Adversarial Learned Molecular Graph Inference and Generation 5 of 18

Prior Work II Generative Models for Molecular Graphs : • Likelihood-based (VAEs): compute reconstruction loss by (i) traversing nodes in a fixed order, (ii) Monte-Carlo sampling, or (iii) graph matching. • Adversarial: MolGAN is the only such model, but cannot do inference (De Cao and Kipf, 2018) . Generative Models for Continuous Data : • Adversarial Learned Inference (ALI) and its extension ALICE learn an encoder/decoder without optimizing an explicit reconstruction loss (Dumoulin et al., 2017; Li, Liu, et al., 2017) . • ALI & ALICE are only applicable to continuous-valued data, such as images. S. Pölsterl and C. Wachinger (AI-Med) Adversarial Learned Molecular Graph Inference and Generation 5 of 18

Our Contributions • We propose Adversarial Learned Molecular Graph Inference and Generation (ALMGIG) that 1. does not require solving an expensive graph isomorphism problem, S. Pölsterl and C. Wachinger (AI-Med) Adversarial Learned Molecular Graph Inference and Generation 6 of 18

Our Contributions • We propose Adversarial Learned Molecular Graph Inference and Generation (ALMGIG) that 1. does not require solving an expensive graph isomorphism problem, 2. performs inference over graphs by extending the Graph Isomorphism Network to multi-graphs (Xu et al., 2019) , S. Pölsterl and C. Wachinger (AI-Med) Adversarial Learned Molecular Graph Inference and Generation 6 of 18

Our Contributions • We propose Adversarial Learned Molecular Graph Inference and Generation (ALMGIG) that 1. does not require solving an expensive graph isomorphism problem, 2. performs inference over graphs by extending the Graph Isomorphism Network to multi-graphs (Xu et al., 2019) , 3. generates discrete data (atoms and bonds) via the Gumbel-softmax trick (Jang et al., 2017; Maddison et al., 2017) , S. Pölsterl and C. Wachinger (AI-Med) Adversarial Learned Molecular Graph Inference and Generation 6 of 18

Our Contributions • We propose Adversarial Learned Molecular Graph Inference and Generation (ALMGIG) that 1. does not require solving an expensive graph isomorphism problem, 2. performs inference over graphs by extending the Graph Isomorphism Network to multi-graphs (Xu et al., 2019) , 3. generates discrete data (atoms and bonds) via the Gumbel-softmax trick (Jang et al., 2017; Maddison et al., 2017) , 4. generates chemically valid molecules by enforcing connectivity constraints via penalty terms (Ma et al., 2018) . S. Pölsterl and C. Wachinger (AI-Med) Adversarial Learned Molecular Graph Inference and Generation 6 of 18

Our Contributions • We propose Adversarial Learned Molecular Graph Inference and Generation (ALMGIG) that 1. does not require solving an expensive graph isomorphism problem, 2. performs inference over graphs by extending the Graph Isomorphism Network to multi-graphs (Xu et al., 2019) , 3. generates discrete data (atoms and bonds) via the Gumbel-softmax trick (Jang et al., 2017; Maddison et al., 2017) , 4. generates chemically valid molecules by enforcing connectivity constraints via penalty terms (Ma et al., 2018) . • We show that current evaluation metrics are flawed, and propose a better evaluation metric to assess the distribution learning capabilities of methods. S. Pölsterl and C. Wachinger (AI-Med) Adversarial Learned Molecular Graph Inference and Generation 6 of 18

Adversarial Learned Inference Dumoulin et al. (2017) O G ′ ∼ q θ ( G | ˜ G ′ ∼ q θ ( G | ˜ ˜ ˜ D η ( G, ˜ g φ ( G, ε ) z ∼ q φ ( z | G ) z ∼ q φ ( z | G ) ˜ ˜ ˜ z ∼ q φ ( z | G ) g θ (˜ z , ε ) G ′ ) D ψ ( G, ˜ z ) z ) z ) Cycle Joint q ( G ) q ( G ) q ( G ) Latent space Encoder Generator Discriminator Discriminator D ψ ( ˜ ˜ ˜ ˜ z ∼ N ( 0 , I ) z ∼ N ( 0 , I ) z ∼ N ( 0 , I ) g θ ( z , ε ) G ∼ q θ ( G | z ) G ∼ q θ ( G | z ) G ∼ q θ ( G | z ) D η ( G, G ) G, z ) • Training : match joint distributions over graphs and latent variables S. Pölsterl and C. Wachinger (AI-Med) Adversarial Learned Molecular Graph Inference and Generation 7 of 18

Adversarial Learned Inference Dumoulin et al. (2017) O G ′ ∼ q θ ( G | ˜ G ′ ∼ q θ ( G | ˜ ˜ ˜ D η ( G, ˜ g φ ( G, ε ) ˜ z ∼ q φ ( z | G ) ˜ z ∼ q φ ( z | G ) ˜ z ∼ q φ ( z | G ) g θ (˜ z , ε ) G ′ ) D ψ ( G, ˜ z ) z ) z ) Cycle Joint q ( G ) q ( G ) q ( G ) Latent space Encoder Generator Discriminator Discriminator D ψ ( ˜ ˜ ˜ ˜ z ∼ N ( 0 , I ) z ∼ N ( 0 , I ) z ∼ N ( 0 , I ) g θ ( z , ε ) G ∼ q θ ( G | z ) G ∼ q θ ( G | z ) G ∼ q θ ( G | z ) D η ( G, G ) G, z ) • Training : match joint distributions over graphs and latent variables 1. encoder joint distribution: q φ ( G, z ) = q ( G ) q φ ( z | G ) S. Pölsterl and C. Wachinger (AI-Med) Adversarial Learned Molecular Graph Inference and Generation 7 of 18

Adversarial Learned Molecular Graph Inference and Generation - PowerPoint PPT Presentation

Adversarial Learned Molecular Graph Inference and Generation Sebastian Plsterl and Christian Wachinger Artificial Intelligence in Medical Imaging, Ludwig-Maximilians-Universitt, Munich European Conference on Machine Learning and Principles

4. Molecular dynamics Understanding Molecular Simulation Molecular Simulations Molecular

Lessons Learned from Evaluating the Robustness of Defenses to Adversarial Examples Nicholas

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Lessons Learned Lessons Learned From From Lessons Learned Lessons Learned From From

Basics of Molecular biology Molecular biology is the study of biology at molecular level.

Molecular vibrations Ask Hjorth Larsen Center for Atomic-scale Materials Design 2008 Molecular

3. Monte Carlo Simulations Understanding Molecular Simulation Molecular Simulations Molecular

Molecular Simulation Introduction Understanding Molecular Simulation Introduction Why to use

Stronger and Faster Wasserstein Adversarial Attacks Kaiwen Wu kaiwen.wu@uwaterloo.ca Joint work

Deep Adversarial Learning for NLP 9:00 10:30 Introduction and Adversarial Training, GANs

Reinforcing Adversarial Robustness using Model Confidence Induced by Adversarial Training Xi Wu

Adversarial Examples and Adversarial Training Ian Goodfellow, Sta ff Research Scientist, Google

Neglected topics CS 446 Adversarial examples and deep networks 1 / 23 Adversarial

Confidence-Calibrated Adversarial Training Generalizing to Unseen Attacks David Stutz, Matthias

Synthesizing Robust Adversarial Examples Anish Athalye, Logan Engstrom, Andrew Ilyas*, Kevin

CSC321 Lecture 22: Adversarial Learning Roger Grosse Roger Grosse CSC321 Lecture 22: Adversarial

Design Heirarchy and Analysis September 14, 2006 Typeset by Foil T EX Modern Digital

Upper and Lower Loop Bound Estimation by Symbolic Execution and Loop Acceleration Pavel Cadek

How Design Quality Which Random Variable? Let Us Use Max- . . . Improves with Increasing

Bibliography on Synchronous Elastic (aka Latency Insensitive) Systems July 20, 2009 Latency

The Aerospace & Defense Forum Los Angeles Chapter November 15, 2013 Joe Klocko Director,

On Automated Parameter Tuning, with Applications in Next-Generation Manufacturing Lars Kotthofg

MA162: Finite mathematics . Jack Schmidt University of Kentucky October 3, 2011 Schedule: HW

PLANNING AND DESIGNING OPERATING SYSTEMS A 86012 Management and Principles of 1 Accoun:ng 8-2

Adversarial Learned Molecular Graph Inference and Generation - PowerPoint PPT Presentation

Adversarial Learned Molecular Graph Inference and Generation Sebastian Plsterl and Christian Wachinger Artificial Intelligence in Medical Imaging, Ludwig-Maximilians-Universitt, Munich European Conference on Machine Learning and Principles

4. Molecular dynamics Understanding Molecular Simulation Molecular Simulations Molecular

Lessons Learned from Evaluating the Robustness of Defenses to Adversarial Examples Nicholas

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Lessons Learned Lessons Learned From From Lessons Learned Lessons Learned From From

Basics of Molecular biology Molecular biology is the study of biology at molecular level.

Molecular vibrations Ask Hjorth Larsen Center for Atomic-scale Materials Design 2008 Molecular

3. Monte Carlo Simulations Understanding Molecular Simulation Molecular Simulations Molecular

Molecular Simulation Introduction Understanding Molecular Simulation Introduction Why to use

Stronger and Faster Wasserstein Adversarial Attacks Kaiwen Wu kaiwen.wu@uwaterloo.ca Joint work

Deep Adversarial Learning for NLP 9:00 10:30 Introduction and Adversarial Training, GANs

Reinforcing Adversarial Robustness using Model Confidence Induced by Adversarial Training Xi Wu

Adversarial Examples and Adversarial Training Ian Goodfellow, Sta ff Research Scientist, Google

Neglected topics CS 446 Adversarial examples and deep networks 1 / 23 Adversarial

Confidence-Calibrated Adversarial Training Generalizing to Unseen Attacks David Stutz, Matthias

Synthesizing Robust Adversarial Examples Anish Athalye*, Logan Engstrom*, Andrew Ilyas*, Kevin

CSC321 Lecture 22: Adversarial Learning Roger Grosse Roger Grosse CSC321 Lecture 22: Adversarial

Design Heirarchy and Analysis September 14, 2006 Typeset by Foil T EX Modern Digital

Upper and Lower Loop Bound Estimation by Symbolic Execution and Loop Acceleration Pavel Cadek

How Design Quality Which Random Variable? Let Us Use Max- . . . Improves with Increasing

Bibliography on Synchronous Elastic (aka Latency Insensitive) Systems July 20, 2009 Latency

The Aerospace &amp; Defense Forum Los Angeles Chapter November 15, 2013 Joe Klocko Director,

On Automated Parameter Tuning, with Applications in Next-Generation Manufacturing Lars Kotthofg

MA162: Finite mathematics . Jack Schmidt University of Kentucky October 3, 2011 Schedule: HW

PLANNING AND DESIGNING OPERATING SYSTEMS A 86012 Management and Principles of 1 Accoun:ng 8-2

Synthesizing Robust Adversarial Examples Anish Athalye, Logan Engstrom, Andrew Ilyas*, Kevin

The Aerospace & Defense Forum Los Angeles Chapter November 15, 2013 Joe Klocko Director,