Neural Networks Designing New Drugs Mariya Popova, Olexandr Isayev, - - PowerPoint PPT Presentation

neural networks designing new drugs
SMART_READER_LITE
LIVE PREVIEW

Neural Networks Designing New Drugs Mariya Popova, Olexandr Isayev, - - PowerPoint PPT Presentation

Neural Networks Designing New Drugs Mariya Popova, Olexandr Isayev, Alexandr Tropsha Drug g Discovery Timeline ne 2 Conventio ional al Vir irtual tual Scre reening ing Pip ipeli line CHEMICAL AL CHEMIC MICAL PREDICTIVE PROPERTY/


slide-1
SLIDE 1

Neural Networks Designing New Drugs

Mariya Popova, Olexandr Isayev, Alexandr Tropsha

slide-2
SLIDE 2

Drug g Discovery Timeline ne

2

slide-3
SLIDE 3

Conventio ional al Vir irtual tual Scre reening ing Pip ipeli line

3

~106 – 109 molecules CHEMICAL AL STRUCTURES RES CHEMIC MICAL DESCRIPTO IPTORS PROPERTY/ ACTIVITY Actives PREDICTIVE MODELS Chemical database Inactives VIRTUAL L SCREEN EENING

slide-4
SLIDE 4

Why Do We Need Generat rativ ive Mode dels? ls?

4

  • Biggest database of

molecules has ~109 compounds

  • Estimates for the size of

chemical space – up to 1060

  • Searching for new drug

candidates in existing databases – observation bias

slide-5
SLIDE 5

Generati rative Models ls Ove vervie view

5 Sanchez-Lengeling, Benjamin, and Alán Aspuru-Guzik. "Inverse molecular design using machine learning: Generative models for matter engineering." Science 361.6400 (2018): 360-365.

slide-6
SLIDE 6

Our Appr proach

  • ach

6

  • Generative model

for SMILES 𝐻

  • Predictive model for

the desired property 𝑄

  • 𝐻 and 𝑄 combined

with RL in one pipeline to bias the property of generated molecules.

Popova et. al. "Deep reinforcement learning for de novo drug design." Science advances 4.7 (2018): eaap7885.

slide-7
SLIDE 7

SMIL ILES ES-ba base sed d Generati rative Mode del

7

  • SMILES (simplif

lified ied molec ecula lar-in input t line- entry y system) is a sequence of characters then encodes the molecular graph

  • One sequence = one molecule
  • Has alphabet

Use language model for producing novel SMILES strings 𝑞 𝑡𝑢 𝑡1 … 𝑡𝑢−1; 𝜄 = 𝑔(𝑡1 … 𝑡𝑢−1|𝜄)

slide-8
SLIDE 8

Generati rative Model: l: train aining g mode

8

Stack GRU GRU Stack GRU Stack GRU

[0.27 0.15 … 0.03] [0.63 0.14 … 0.23] [0.45 0.66 … 0.87] [0.33 0.13 … 0.01] [0.90 0.05 … 0.01] [0.03 0.50 … 0.03] [0.7 0.15 … 0.02] [0.07 0.13 … 0.77]

<START> <END> C O N C С O Softmax loss

  • Trained on 1.5 million of drug-like compounds from ChEMBL in a supervised manner
slide-9
SLIDE 9

Generati rative Model: l: in infe feren rence mode de

9

Embedding vectors Probabilities of the next character sampling sampling sampling sampling Model takes its own predictions as next input character: Growing SMILES

𝑞 𝑡𝑢 𝑡1 … 𝑡𝑢−1; 𝜄 = 𝑔(𝑡1 … 𝑡𝑢−1|𝜄)

slide-10
SLIDE 10

RL fo form rmulati ulation

  • n fo

for SMIL ILES ES generat ratio ion

  • Action – generate symbol 𝑡
  • Set of actions – SMILES alphabet 𝐵
  • State – generated prefix 𝑡1𝑡2 … 𝑡𝑢−1
  • Set of states – set of all possible strings in SMILES alphabet 𝐵

with lengths from 0 to T -- 𝔹 = {𝐵𝑢, 𝑢 = 0 … 𝑈}

  • Environment – set of states 𝔹, set of actions 𝐵 and transition

probabilities 𝑞 𝑡𝑢 = 𝑏 𝑡1 … 𝑡𝑢−1; 𝜄 , 𝑏 ∈ 𝐵

  • Reward function – 𝑆 𝑇𝑢
  • Objective – maximize the expected reward:

𝔽 𝑆 𝑇𝑢 𝜄 = σ𝑇∈𝔹 𝑞 𝑇 𝜄 𝑆 𝑇 → 𝑛𝑏𝑦𝜄

10

𝑞 𝑡𝑢 𝑡1 … 𝑡𝑢−1; 𝜄 = 𝑔(𝑡1 … 𝑡𝑢−1|𝜄)

slide-11
SLIDE 11

RL Pip ipeli line For Molec lecul ule Generati ration

  • n

11

  • Generative model

is a policy network

  • Predictive model is

a simulator of the real-world

  • Reward is assigned

based on the property prediction and researcher’s

  • bjective
slide-12
SLIDE 12

Re Resul ults: ts: opti timi mizi zing li lipophil ilicity ty

  • Lipophilicity is possibly the lost important physicochemical property of

a potential drug

  • It plays a role in solubility, absorption, membrane penetration, etc
  • Log P is quantitative measure of lipophilicity, is the ratio
  • f concentrations of a compound in a mixture of

two immiscible phases at equilibrium

  • Log P is a component of Lipinski’s Rule of 5 a rule of thumb to predict

drug-likeness

  • According to Lipinski’s rule must be in a range between 0 and 5 for

drug-like molecules

slide-13
SLIDE 13

Predi dictiv ive Model l fo for r lo log P

13

  • SMILES-based RNN
  • Dataset of 14k

compounds with logP measurements

  • 5 fold cross-

validation

  • RMSE = 0.57
  • 𝑆2 = 0.90
slide-14
SLIDE 14

Re Resul ults: ts: opti timi mizi zing li lipophil ilicity ty

14

Log P value Reward value

𝑆 𝑇 = ቊ11, 𝑗𝑔 𝑚𝑝𝑕𝑄 𝑡 ∈ [0.5; 4.5] 0, 𝑝𝑢ℎ𝑓𝑠𝑥𝑗𝑡𝑓

slide-15
SLIDE 15

Re Resul ults: ts: opti timi mizi zing li lipophil ilicity ty

15

Values of the reward function during training Reward value Training iteration

slide-16
SLIDE 16

Re Resul ults: ts: opti timi mizi zing li lipophil ilicity ty

16

Predicted log P values Distribution of unbiased and optimized log P values

  • Statistics are

calculated from 10000 randomly generated SMILES

  • 100% of optimized

SMILES were predicted to have log P within drug-like region

slide-17
SLIDE 17

Worked well for a relatively simple physical property What if a molecule with a high reward is a rear event? It could take very long until the model receives a high or non-zero reward

Lim imit itat atio ions

17

slide-18
SLIDE 18

Tricks ks

  • Flexible reward

– First give high reward for worse molecules, then gradually increase threshold

  • Fine-tuning on a dataset of “good” molecules in a supervised manner

– Fine-tune on generated molecules with high rewards – Fine-tune on experimental ground truth data – High exploitation, low exploration

  • Using experience replay for policy gradient optimization

– Remember generated molecules with high rewards and replay on them – Replay on experimental ground truth data

18

slide-19
SLIDE 19

Epidermal growth factor receptor (EGFR)

  • Associated with cancer and inflammatory disease
  • Has ~10k experimental measurements for molecules

More re results: ts: EGFR R

19

slide-20
SLIDE 20

More re results: ts: EGFR

  • Built a binary classification (active/inactive) predictive model for

EGFR (F-1 score 0.9)

  • Took pretrained on ChEMBL generative network
  • Generated 10k random molecules and predicted probability of class

“active”

20

slide-21
SLIDE 21

More re results: ts: EGFR

21

Probability of class “active”

slide-22
SLIDE 22

More re results: ts: EGFR

  • Flexible reward:

𝑆 𝑇 = ቊ10, 𝑗𝑔 𝑄 𝑇 > 𝑢ℎ𝑠𝑓𝑡ℎ𝑝𝑚𝑒 0, 𝑝𝑢ℎ𝑓𝑠𝑥𝑗𝑡𝑓

  • Initial threshold = 0.05
  • After every update we generate 10k compound
  • If 15% of them predicted to have property > threshold, we increase

threshold by 0.05

  • Fine-tuning on generated molecules with high rewards
  • Experience replay on experimental measurements and on generated

molecules with high rewards

22

slide-23
SLIDE 23

More re results: ts: EGFR

23

Probability of class “active”

Unbiased Maximized

slide-24
SLIDE 24

Experimental validation:

  • Selected several

commercially available and validated our results experimentally

  • Found 4 active compounds

More re results: ts: EGFR

24

slide-25
SLIDE 25

Futur ure work

  • Develop graph-based generative models:

– SMILES-based models generate some amount of invalid molecules

  • Develop lead optimization methods:

– Start from a given scaffold/structure – Impossible to do with SMILES

  • Develop models for predicting route for synthesis:

– To be able to perform custom synthesis

25

slide-26
SLIDE 26

Code Lin inks

26

RL for de novo drug design

https://github.com/isayev/ReLeaSE

slide-27
SLIDE 27

Ac Acknow

  • wledge

dgements nts

Univ iver ersit ity y of North Carolina

  • lina at Chapel

pel Hill: l: Olexandr Isayev Alexandr Tropsha

27