Fragment binding prediction using unsupervised learning of ligand - - PowerPoint PPT Presentation

fragment binding prediction using unsupervised learning
SMART_READER_LITE
LIVE PREVIEW

Fragment binding prediction using unsupervised learning of ligand - - PowerPoint PPT Presentation

Fragment binding prediction using unsupervised learning of ligand substructure binding sites Grace Tang Altman Lab July 20, 2013 Berlin, Germany 1 Structure Based Virtual Screening Virtual Databases End Goal Screening ZINC 1 21 million


slide-1
SLIDE 1

Fragment binding prediction using unsupervised learning of ligand substructure binding sites

Grace Tang Altman Lab July 20, 2013 Berlin, Germany

1

slide-2
SLIDE 2

GDB-133 970 million drug-like small molecules 21 million purchasable compounds ZINC1 PubChem2 48 million compounds Databases End Goal Virtual Screening

Structure Based Virtual Screening

2

  • 1. Irwin, J.J., et al., ZINC: A Free Tool to Discover

Chemistry for Biology. J Chem Inf Model, 2012.

  • 2. Bolton E., et al., PubChem: Integrated Platform of Small Molecules and

Biological Activities. Annual Reports in Computational Chemistry, 2008.

slide-3
SLIDE 3

Adding Prior Knowledge

970 million drug-like small molecules 21 million purchasable compounds 48 million compounds Databases End Goal Virtual Screening

3

contains fragment: X contains fragment: Y GDB-133 ZINC1 PubChem2

3. Blum, L.C. and J.L. Reymond, 970 million druglike small molecules for virtual screening in the chemical universe database GDB-13. J Am Chem Soc, 2009. 131(25): p. 8732-3.

slide-4
SLIDE 4

Method Overview

Query Protein Fragment Predictions Knowledge Base

4

slide-5
SLIDE 5

Derivation of Knowledge Base

Atom Type Atom Element Residue Name Residue Class Partial Charge Hydrophobicity Aromatic etc.

FEATURE Protein-Ligand Complexes1 Structural Information2

1. Berman, H.M., et al., The Protein Data Bank. Nucleic Acids Res, 2000. 28(1): p. 235-42. 2. Halperin, I., et al., The FEATURE framework for protein function

  • annotation. BMC Genomics, 2008. 9 Suppl 2: p. S2.

5

slide-6
SLIDE 6

Derivation of Knowledge Base

Knowledge Base Protein-Ligand Complexes1 Fragment Information2

6

1. Berman, H.M., et al., The Protein Data Bank. Nucleic Acids Res, 2000. 28(1): p. 235-42. 2. Rahman, S.A., et al., Small Molecule Subgraph Detector (SMSD)

  • toolkit. J Cheminform, 2009. 1(1): p. 12.
slide-7
SLIDE 7

Method

Fisher’s p-value Query Protein Hypergeometric p-value 5 Nearest Non-Homologous Neighbors

7

slide-8
SLIDE 8

Validation Ligands

ADE adenine n = 123 ADP adenosine-5’-diphosphate n = 2640 FAD flavin-adenine dinucleotide n = 2769 NAD nicotinamide-adenine dinucleotide n = 2309 VIB thiamin, vitamin B1 n = 19 TPP thiamine diphosphate n = 217 PLP pyridoxal-5’-phosphate n = 1227 TCL triclosan n = 88

ADE FAD NAD VIB PLP TCL TPP ADP

8

slide-9
SLIDE 9

Validation

PDB ID: 2X58

Protein – ADP Complex Predictions and Moieties

adenine ring 1 & 2 ribose phosphate 1 & 2

# Correct Predictions # Total Predictions Precision: # Predicted Moieties # Bound Moieties Recall:

0% 20% 40% 60% 80% 100% TCL PLP VIB TPP NAD FAD ADP ADE PRECISION RECALL

9

slide-10
SLIDE 10

Exotoxin A

Exotoxin A from Pseudomonas aeruginosa is an ADP-ribosyltransferase that inactivates eukaryotic ribosomal elongation factor 2, preventing protein synthesis and triggering cell necrosis.

Query Predicted Fragment

CID: 450318 (p-value 1 × 10-28)

Example Nearest Microenvironment Neighbors

PDB ID: 3GEY PDB ID: 3KI2 PDB ID: 3KI7 PDB ID: 3KCZ PDB ID: 3L3M PDB ID: 1WOK PDB ID: 3B78 10

slide-11
SLIDE 11

Exotoxin A

Exotoxin A from Pseudomonas aeruginosa is an ADP-ribosyltransferase that inactivates eukaryotic ribosomal elongation factor 2, preventing protein synthesis and triggering cell necrosis.

Ligands Bound by Nearest Microenvironment Neighbors Ki = 140 nM Query Predicted Fragment

CID: 450318 (p-value 1 × 10-28)

PDB ID: 3B78 11

slide-12
SLIDE 12

Exotoxin A

Exotoxin A from Pseudomonas aeruginosa is an ADP-ribosyltransferase that inactivates eukaryotic ribosomal elongation factor 2, preventing protein synthesis and triggering cell necrosis.

Structural validation of predicted fragment Ki = 140 nM Query Predicted Fragment

CID: 450318 (p-value 1 × 10-28)

PDB ID: 3B78 12 PDB ID: 1XK9

slide-13
SLIDE 13

Adding Fragment Knowledge

GDB-13 970 million drug-like small molecules 21 million purchasable compounds ZINC PubChem 48 million compounds Databases End Goal Virtual Screening contains fragment:

13

slide-14
SLIDE 14

Acknowledgements

  • Russ Altman
  • Altman Lab
  • Funding Sources
  • Travel funding to ISMB/ECCB 2013 was generously provided by ISCB

14

slide-15
SLIDE 15

Thank You!

gwtang@stanford.edu 3DSIG: Poster … 046 ISMB/ECCB: Poster … L081

15