exploring activity cliffs from a chemoinformatics
play

Exploring Activity Cliffs from a Chemoinformatics Perspective Jrgen - PowerPoint PPT Presentation

Exploring Activity Cliffs from a Chemoinformatics Perspective Jrgen Bajorath Life Science Informatics University of Bonn Activity Cliff Concept Activity cliff is generally defined as a pair of structurally similar active compounds with


  1. Exploring Activity Cliffs from a Chemoinformatics Perspective Jürgen Bajorath Life Science Informatics University of Bonn

  2. Activity Cliff Concept ¡ Activity cliff is generally defined as a pair of structurally similar active compounds with a large difference in potency 2390 nM 6 nM Analogs Paradigm: “small chemical modifications – large biological effects“ è high SAR information content

  3. Activity Cliffs in Medicinal Chemistry ¡ Utility in SAR analysis and compound optimization ¡ Which compound to make next? ¡ Typically focused on individual compound series ¡ Methodological simplicity and chemical intuition are key to practical utility in med. chem.

  4. Activity Cliffs in Chemoinformatics ¡ Much stronger emphasis on methodological aspects ¡ Departure from individual series toward global analysis

  5. Activity Cliffs in Chemoinformatics ¡ Molecular representation dependence ¡ Large-scale compound data mining ¡ Activity cliff networks ¡ Prediction of activity cliffs

  6. Activity Cliffs ¡ Activity cliff is generally defined as a pair of structurally similar active compounds with a large difference in potency 2390 nM 6 nM Analogs Definition requires consideration of: Similarity criterion Potency difference criterion

  7. Activity Cliff Definition ¡ Alternative similarity criteria Fingerprint Tanimoto similarity MACCS Tc 0.85, ECFP4 Tc 0.55 Substructure-based similarity Matched molecular pairs, scaffolds ¡ Potency difference criterion Usually at least 1 or 2 orders of magnitude (10- or 100-fold)

  8. 1. Molecular Representations ¡ Activity cliff distribution is strongly influenced by selected molecular representations and similarity criteria ¡ Qualifying pairs ( QP s) − QPs are compound pairs exceeding a given similiarity threshold ¡ Activity cliff frequency − percentage of QPs with a more than 100-fold difference in potency

  9. Molecular Representation Dependence ¡ QPs and activity cliff distribution for six different fingerprints 1,076,177 ¡ 128 activity classes from ChEMBL with 563,445 more than 100 512,026 467,592 468,145 447,224 414,224 compounds 130,223 ¡ 35,021 unique 8.99% 8.88% 6.78% 7.43% 5.87% 5.47% 5.47% compounds 3.36% Consensus GpiDAPH3 MACCS ECFP4 FCFP4 Union TGD TGT Stumpfe D, Hu Y, Dimova D & Bajorath J. J Med Chem, 57, 18 (2014)

  10. Activity Cliff-Forming Compounds ¡ Percentage of compounds that form at least one activity cliff 64.5% ¡ Union of cliff-forming compounds: 41% 41.4% 37.2% 35.3% 34.2% 36.1% More than 64% of all compounds form at 14.7% least one cliff Consensus GpiDAPH3 MACCS ECFP4 FCFP4 Union TGD TGT 128 activity c lasses (>100 cpds) Stumpfe D, Hu Y, Dimova D & Bajorath J. J Med Chem, 57, 18 (2014) from ChEMBL

  11. MMPs as Molecular Represetation ¡ A M atched M olecular P air (MMP) is formed by two structurally related compounds that − differ only by a small structural change at a single site − are related by the exchange of a substructure (termed chemical transformation) MMP

  12. Transformation Size Restriction ¡ Transformation size-restricted MMPs were introduced to limit transformations to small and chemically intuitive replacements Examples of largest permitted transformations

  13. Preferred Activity Cliff Definition ¡ Transformation size-restricted MMPs - substructure-based similarity assessment (med. chem. focus) ¡ At least 100-fold difference in potency ¡ Equilibrium constants (K i ) MMP 4.6 pK i 7.2 pK i Stumpfe D & Bajorath J. J Chem Inf Model 52, 2348 (2012)

  14. Activity Cliff-Forming Compounds 35021 35021 35021 35021 35021 35021 35021 35021 35021 35021 35021 ¡ MMPs and six fingerprint representations 64.5% 65.6% ¡ MMPs yield smallest 41.4% 41% percentage of 37.2% 35.3% 34.2% 36.1% cliff compounds 27.5% 14.7% 10.9% Consensus (FP only) ECFP4 FCFP4 Consensus GpiDAPH3 MACCS MMP TGT TGD (FP only) Union Union 128 activity classes (>100 cpds) Stumpfe D, Hu Y, Dimova D & Bajorath J. J Med Chem, 57, 18 (2014) from ChEMBL

  15. 2. Large-Scale Data Mining Proportion of bioactive compounds forming activity cliffs ? Percentage of all bioactive compounds involved in the formation of activity cliffs (ChEMBL survey): 31.7% (ECFP4/Tanimoto-based cliffs) 22.8% ( MMP-cliffs )

  16. Large-Scale Data Mining Currently available high-confidence activity cliffs ? (ChEMBL version 17) 20,080 MMP-cliffs detected for 293 targets involving 11,783 unique active compounds

  17. Target Distribution 60 50 40 % MMP-cliffs 30 20 10 0 0 200 400 600 800 1000 1200 1400 1600 120 100 8 0 % Cliff-forming 60 compounds 40 2 0 0 0 200 400 600 800 1000 1200 1400 1600 # Compounds 414 activity classes from ChEMBL Hu Y, Stumpfe D, Bajorath J. F1000Research 2, 199 (2013)

  18. Target Distribution 60 50 40 For data set with 30 >200 cpds, activity 20 cliffs and cliff 10 compounds are 0 0 200 400 600 800 1000 1200 1400 1600 fairly evenly 120 distributed among 100 many different 8 0 targets 60 40 2 0 0 0 200 400 600 800 1000 1200 1400 1600 # Compounds 414 activity classes from ChEMBL

  19. Ligand Efficiency (LE) for MMP-Cliffs ¡ Changes in LE accompanying activity LE = pKi / MW cliff formation 12 Percentage of cliff partners ¡ Difference in LE between 10 weakly and highly potent 8 cliff partners 6 4 ¡ LE increase detected for 2 99.1% of all activity cliffs; 0 4 9 14 19 24 29 34 39 average Δ LE = 6.27 Ligand efficiency Weakly potent cliff partner Highly potent cliff partner de la Vega de Leon A & Bajorath J. AAPS J 16, 335 (2014)

  20. Lipophilic Efficiency (LipE) ¡ Changes in LipE accompanying activity cliff LipE = pK i - cLogP formation 12 Percentage of cliff partners ¡ Difference in LipE between 10 weakly and highly potent 8 cliff partners 6 4 ¡ LipE increase detected for 2 96.7% of all activity cliffs; 0 -5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 average Δ LipE = 2.42 Lipophilic efficiency Weakly potent cliff partner Highly potent cliff partner

  21. 3. Activity Cliff Network Analysis

  22. Isolated vs. Coordinated Cliffs ¡ ‘Isolated‘ cliffs: cliff partners are only involved in a single activity cliff ¡ ‘Coordinated‘ cliffs: cliff partners are involved in multiple and overlapping activity cliffs Cliff type Isolated cliffs % Coordinated cliffs % MACCS 1.4 98.6 ECFP4 2.2 97.8 MMP-cliffs 3.5 96.5 128 activity classes (>100 cpds) from ChEMBL

  23. Isolated vs. Coordinated Cliffs ¡ MMP-cliff network for serotonin 1d receptor ligands 46 compounds (nodes) 69 MMP-cliffs (edges) 2 isolated cliffs 67 coordinated cliffs highly potent cliff partner weakly potent cliff partner both highly and weakly potent cliff partner

  24. Global MMP-Cliff Network ¡ ChEMBL 17 ¡ 14,044 nodes (compounds) ¡ 20,080 edges (MMP-cliffs) ¡ Many separate components ¡ 2072 clusters Stumpfe D et al. & Bajorath J. J Chem Inf Model 54, 451 (2014)

  25. Activity Cliff Cluster Size Distibution Cluster size # Cluster ¡ 769 isolated cliffs 1-5 1463 6-10 306 ¡ 1303 coordinated cliff cluster 10-15 114 15-20 65 21-30 56 ¡ 26 clusters with > 50 31-40 27 compounds 41-50 15 51-60 11 ¡ 420 clusters comprising six to 61-70 4 15 compounds 71-80 2 81-90 3 91-100 2 101-152 4

  26. Node Degree Distribution Node degree # Nodes ¡ Average node degree 2.9 1-4 11878 5-9 1552 ¡ The union of all clusters follows a 10-14 341 power law 15-20 155 P(k)~k - γ 21-30 85 31-40 17 41-50 9 with γ having a value of 2.5, 51-60 4 which is characteristic of scale-free 61-70 3 networks ¡ Many densely connected nodes: activity cliff hubs

  27. Network Modification ¡ Deletion of all hubs with a degree ≥ 5 (2166 nodes, i.e. 15.4%)

  28. Network Modification ¡ Deletion of all hubs with a degree ≥ 10 (614 nodes, i.e. 4.4%)

  29. Global MMP-Cliff Network ¡ 2072 clusters ¡ 769 isolated cliffs ¡ 19,311 coordinated cliffs in 1303 clusters ¡ 450 cluster topologies with 1 to 769 instances Stumpfe D et al. & Bajorath J. J Chem Inf Model 54, 451 (2014)

  30. Activity Cliff Cluster Topologies ¡ Topologies with ≥ 3 instances ¡ Identification of 3 recurrent main topologies Star n Chain n Rectangle

  31. Activity Cliff Cluster Topologies ¡ Topologies with ≥ 3 instances ¡ Cover 861 of 1303 clusters main topologies Star n Chain n Rectangle

  32. Main Topologies and Extensions Main Extensions topology of main topology Star Twin Star n n n Chain Modified Chain n n Modified Rectangle Rectangle n

  33. Main Topologies and Extensions Main Extensions Hybrid Irregular topology of main topology topologies topologies Star n n n Chain n n Rectangle n

  34. Star Topology Example ¡ Adenosine A3 receptor ligands 6.2 pKi 4.8 pKi 5.6 pKi 8.3 pK i 5.1 pK i 5.5 pK i

  35. Star Topology Example ¡ Adenosine A3 receptor ligands 6.2 pK i 6.7 pK i 9.1 pK i 6.7 pK i 6.9 pK i

  36. Rectangle Topology Example ¡ Adenosine A2b receptor ligands 8.0 pK i 6.0 pK i 5.4 pK i 8.3 pK i

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend