ABCD Daylight User Group Meeting Cambridge, 4.-5.11.04 Application - - PowerPoint PPT Presentation

abcd
SMART_READER_LITE
LIVE PREVIEW

ABCD Daylight User Group Meeting Cambridge, 4.-5.11.04 Application - - PowerPoint PPT Presentation

ABCD Daylight User Group Meeting Cambridge, 4.-5.11.04 Application of Daylight Fingerprints to Virtual Screening Uta Lessel Boehringer Ingelheim Pharma GmbH & Co. KG Department of Lead Discovery ABCD Ligand Based Virtual Screening


slide-1
SLIDE 1

ABCD

Daylight User Group Meeting Cambridge, 4.-5.11.04

Application of Daylight Fingerprints to Virtual Screening

Uta Lessel Boehringer Ingelheim Pharma GmbH & Co. KG Department of Lead Discovery

slide-2
SLIDE 2

ABCD

Ligand Based Virtual Screening Goal:

  • Selection of subsets with increased hit rates from

a data set Procedure:

  • Looking for compounds similar to known actives
  • Ranking of data sets with actives and inactives

according to decreasing similarities Evaluation:

  • E.g. determination of enrichment curves
slide-3
SLIDE 3

ABCD

Study Aim: Comparison of different methods for the search for similar compounds Methods analyzed:

  • Tanimoto coefficients on the basis of Daylight Fingerprints
  • Euklidean distances in a 5-dimensional BCUT property

space (R.S. Pearlman, K.M. Smith, Perspectives in Drug

Discovery and Design, 9/10/11, 339-353, 1998)

  • Feature Trees

(M. Rarey, J.S. Dixon, J. of Computer-Aided Molecular Design, 12, 471-490, 1998)

slide-4
SLIDE 4

ABCD

Data Set 75 5HT1A agonists 75 H2 antagonists 75 MAOA inhibitors 75 Thrombin inhibitors + ~ 15.000 compounds chosen randomly from MDDR data base Examples shown for the 5HT1A agonists

slide-5
SLIDE 5

ABCD

First Step

each in turn

Query 75

Known Actives

Similarity Search 3 Data set Ranked Data Set Enrichment Curve

slide-6
SLIDE 6

ABCD

Results from First Step

  • 1. Shapes of individual enrichment curves depend on the

query, shown for Daylight Fingerprints

slide-7
SLIDE 7

ABCD

Individual Enrichment Curves - Daylight Fingerprints

5HT - Daylight Fingerprints 20 40 60 80 100 120 20 40 60 80 100 120 % of data set screened % Hits found 5HT_21 5HT_53 RANDOM 5HT_01 5HT_31 5HT_64

slide-8
SLIDE 8

ABCD

Results from First Step

  • 1. Shapes of individual enrichment curves depend on

the query Valid for all three methods

  • 2. Shapes of individual enrichment curves depend on

the method used for similarity searches, shown for 5HT_57

slide-9
SLIDE 9

ABCD

Corresponding Results Achieved with Daylight Fingerprints, BCUTs, and FTs

5HT_57

20 40 60 80 100 120 20 40 60 80 100 120 Daylight BCUTs FTs Random

slide-10
SLIDE 10

ABCD

Results from First Step

  • 1. Shapes of individual enrichment curves depend on the

query Valid for all three methods

  • 2. Shapes of individual enrichment curves depend on the

method used for similarity searches, shown for 5HT_59

  • 3. Ranking of the 3 methods depends on the queries

Complementarity?

slide-11
SLIDE 11

ABCD

Consequences from First Step Global conclusions on this basis questionable! ⇒ Try to reduce variance and / or dependence on the queries ⇒ Analyze complementarity of the methods

slide-12
SLIDE 12

ABCD

Strategy to Reduce Variance Combination of Queries: 75 x random selection of 3 actives for each combination:

  • determine distances to all 3 actives for the whole data set
  • for each compound:

select the distance to the nearest of the 3 actives

  • rank all compounds according to those distances

perform this procedure for all 3 methods

slide-13
SLIDE 13

ABCD

Results for Combinations of 3 Queries

combinations Single queries SD Average # hits SD Average # hits method # comp. 3.0 11.1 2.2 5.5 Daylight 75 2.9 7.4 3.3 4.2 BCUTs 3.5 12.1 3.0 6.4 FTs 8.2 34.7 9.3 26.4 FTs 6.6 35.2 12.1 29.1 BCUTs 7.0 30.9 8.3 22.2 Daylight 1500

  • 1. Average

number of hits found increased

  • 2. Relative SD

decreased

  • 3. Trends

instead of global conclusions

slide-14
SLIDE 14

ABCD

Average Enrichment Curves for 75 Combinations of 3 Queries

5HT-1A

20 40 60 80 100 120 20 40 60 80 100 120 % data set screened % hits found Random Daylight BCUTs FTs

slide-15
SLIDE 15

ABCD

Average Enrichment Curves for 75 Combinations of 3 Queries - Detail

5HT-1A

10 20 30 40 50 60 2 4 6 8 10 12

% data set screened % hits found

Random Daylight BCUTs FTs

slide-16
SLIDE 16

ABCD

Average Number of Hits Found

75 150 300 500 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 11000 12000 13000 14000 15271

Heat Map

Random Daylight BCUTs FTs

# comp. screened

slide-17
SLIDE 17

ABCD

Nearest Neighbors (Actives) to 5HT_59

NH O O N NH O

O N NH O

Daylight

O O N H N

O O Cl N NH

O N H O O

N N N H

N O

Feature Trees 38 99 141 67 2 1 5 3 15 BCUTs

slide-18
SLIDE 18

ABCD

Overlap Daylight – Feature Trees Average # hits detected by screening x% of the data set

  • nly Feature Trees

both x = 0. 5 x = 5 x = 10

  • nly Daylight

15.1 hits found: 33.6 hits found: 43.4 hits found:

4 8.5 2.6

10.3 16.7 6.6 12.5 23.5 7.4

slide-19
SLIDE 19

ABCD

Overlap Daylight - BCUTs Average # hits detected by screening x% of the data set

  • nly BCUTs

both x = 0. 5 x = 5 x = 10

  • nly Daylight

2.9 4.5 6.6 11.8 12.4 10.9 16.1 19.1 11.8

14 hits found: 35.1 hits found: 47 hits found: Combination of methods

  • but how?
slide-20
SLIDE 20

ABCD

Characteristics of Methods BCUTs:

  • Allow scaffold hopping
  • Higher percentages of the data set have to be screened

to make full use of the method‘s potential Daylight Fingerprints:

  • Especially useful for the detection of actives from the

same structural class

  • Extremely high enrichments among the very nearest

neighbors

  • High hit rates among nearest neighbors within a

Tanimoto threshold

slide-21
SLIDE 21

ABCD

Similarity Search with Daylight Fingerprints Using a Tanimoto Threshold - Procedure

A 0.95 B 0.83 C 0.79 D 0.72 E 0.69 F 0.68 …

  • 1. Number of combined queries

with any nearest neighbors within Tanimoto threshold

  • 2. Average hit rate of subsets

from queries with any nearest neighbors within Tanimoto threshold

  • 3. Sum of hits and sum of

non-hits within all subsets from all queries

Combined query:

Act1 Act2 Act3 Rank data set using Daylight Fingerprints

slide-22
SLIDE 22

ABCD

Similarity Search with Daylight Fingerprints Using a Tanimoto Threshold - Results

602 549 55.6 % 75 0.6 60 387 88.0 % 75 0.7 8 233 94.1 % 73 0.8

# non- hits # hits Average hit rate # Queries with NNs Tanimoto Threshold

slide-23
SLIDE 23

ABCD

Procedure

Daylight NN > 0.7 Combined query:

3 4 5 6 1 2 Act1 Act2 Act3 A 7 B 11 C 13 D E F … 8 9 10 12

Similarity search using BCUTs

Ranked data set:

slide-24
SLIDE 24

ABCD

Average Number of Hits Found 7.1 2.4 0.4

Random

39.6 35.2 30.9 1500 21.9 19.0 19.9 500 9.9 7.4 11.1 75

Daylight + BCUTs BCUTs Daylight # comp. screened

  • 1. Combination better

than BCUTs for screening 75 compounds

  • 2. Combination better

than both methods for all other cases

  • 3. Single methods as

well as combination clearly superior to random selection

slide-25
SLIDE 25

ABCD

Conclusions

  • Reasonable enrichments of actives can be achieved using each of

the three methods to measure similarity

  • Results of the three methods are complementary to each other
  • Daylight Fingerprints show
  • extremely high enrichments among the very nearest neighbors

(actives from the same structural class)

  • High hit rates among nearest neighbors within a Tanimoto

threshold (e.g. 0.8 / 0.7)

  • BCUT distances allow scaffold hopping, but higher percentages of

the data set have to be screened to make full use of the method‘s potential

  • Feature Trees allow scaffold hopping, but they are also useful for the

detection of actives from the same structural class

  • Improvement of results by combining methods
slide-26
SLIDE 26

ABCD

Acknowledgements Michael Bieler Bernd Wellenzohn Herbert Köppen

slide-27
SLIDE 27

ABCD

BACKUP

slide-28
SLIDE 28

ABCD

Descriptors

Generally any kind of descriptors can be used! Diverse Solutions provides BCUT values: diagonal elements contain atomic properties:

  • Gasteiger charges
  • H-donor and H-acceptor abilities
  • polarizabilities
  • ff-diagonal elements reflect connectivity

information: 2D, 3D, topological BCUTs

atom no. : 1 2 3 4 1 2 3 4

for each matrix different BCUT values:

  • highest and lowest eigen values
  • set of scaling factors
slide-29
SLIDE 29

Clustering of Compounds from Different Activity Classes GPCR ligands Kinase inhibitors Protease inhibitors BCUT values useful for similarity searches / virtual screening?

slide-30
SLIDE 30

ABCD

Feature Trees Instead of a linear representation of a molecule, the molecule is described by a tree structure representing its major chemical building blocks and the way they are connected. Characteristics:

  • conformation independent (2.5 D)
  • fragment based
  • can handle local similarity