Relational Datasets for Drug Discovery Christian Partl, Alexander - - PowerPoint PPT Presentation

relational datasets for drug discovery
SMART_READER_LITE
LIVE PREVIEW

Relational Datasets for Drug Discovery Christian Partl, Alexander - - PowerPoint PPT Presentation

ConTour: Data-Driven Exploration of Multi- Relational Datasets for Drug Discovery Christian Partl, Alexander Lex, Marc Streit, Hendrik Strobelt, Anne- MaiWassermann, Hanspeter Pfister and Dieter Schmalstieg Domain Problem = biological receptor


slide-1
SLIDE 1

ConTour: Data-Driven Exploration of Multi- Relational Datasets for Drug Discovery

Christian Partl, Alexander Lex, Marc Streit, Hendrik Strobelt, Anne- MaiWassermann, Hanspeter Pfister and Dieter Schmalstieg

slide-2
SLIDE 2

= biological receptor = biological target = chemical compound = potential target = direct target = inhibits target

Result (Phenotype) Interaction Compound

UNDERSTANDING DRUG DISCOVERY

Scenario 1: Targeted interaction, understood mechanism, desired outcome

Domain Problem

slide-3
SLIDE 3

= biological receptor = biological target = chemical compound = potential target = direct target = inhibits target

Result (Phenotype) Interaction Compound Scenario 2: Indirect interaction, understood mechanism, desired outcome

UNDERSTANDING DRUG DISCOVERY

Domain Problem

slide-4
SLIDE 4

= biological receptor = biological target = chemical compound = potential target = direct target = inhibits target

Result (Phenotype) Interaction Compound Scenario 3: Complex interactions, mechanism poorly understood, multiple outcomes

UNDERSTANDING DRUG DISCOVERY

Domain Problem

slide-5
SLIDE 5

Drug Discovery Main Goals

  • Identify a drug’s mechanism of action
  • Identify the biological process a drug

modulates

  • Identify new drugs for specific therapeutic

indications

Domain Problem

slide-6
SLIDE 6

ConTour

History View Filter View Relationship View Pathway View Compound View

slide-7
SLIDE 7

Data Abstraction

derived* derived

* Derived using a scheme propose by the Prous Integrity database

slide-8
SLIDE 8

Data Abstraction

“The drug discovery domain problem can be generalized to the problem

  • f analysing multi-relational datasets […] Consequently, we argue

that our approach is applicable to many other problems.”

slide-9
SLIDE 9

Data Abstraction

“The multi-relational data exploration problem can be interpreted as a graph exploration problem where each item of each dataset represents a node and the relationships between the items are the edges”

slide-10
SLIDE 10
  • T1: Identify Related Items

Item selection and highlighting

Task Analysis

Clicking, not hovering, on an item also moves all related items in columns to the top

slide-11
SLIDE 11
  • T1: Identify Related Items

Selection-based filters

Task Analysis

Filter choices when multiple items are selected

slide-12
SLIDE 12
  • T1: Identify Related Items

Nesting

Task Analysis

Simple Nesting

slide-13
SLIDE 13
  • T1: Identify Related Items

Nesting

Task Analysis

Recursive Nesting

slide-14
SLIDE 14
  • T2: Identify Items that Share a

Relationships with a Set of Items

Nesting

Task Analysis

Simple Nesting Recursive Nesting

slide-15
SLIDE 15
  • T3: Analyse Network Enrichment

Task Analysis

Enrichment Score Judging how specific two items are when compared to a third

Where: I = clusters K = compounds J = Pathways S(i,j) = pair score

compounds clusters pathways

*I assume they take care of divide by 0?

slide-16
SLIDE 16
  • T4: Rank Items

Task Analysis

Enrichment Score Sort by enrichment score

compounds clusters pathways

Sorting by interest Sort alpha-numerically

slide-17
SLIDE 17
  • T4: Rank Items

Task Analysis

Enrichment Score Sort by enrichment score

compounds clusters pathways

Sorting by interest Sort alpha-numerically

slide-18
SLIDE 18
  • T5: Filter Items

Task Analysis

Depends on tasks 1 and 2 Navigation Local Filter : filter within a specific column Global Filter: remove items that are not connected to the source column

slide-19
SLIDE 19
  • T5: Filter Items

Task Analysis

Depends on tasks 1 and 2 Simple Nesting Recursive Nesting Selection-based filters Nesting Navigation

slide-20
SLIDE 20
  • T6: View items in detail

Pathway View

Task Analysis

Compound Clusters Individual Compounds

Marks

slide-21
SLIDE 21
  • T6: View items in detail

Task Analysis

Total # of compounds that interact with pathway Total # of compounds that interact with pathway

Pathway View

Channels Size Saturation Hue Compounds binding None Many One

slide-22
SLIDE 22
  • T6: View items in detail

Task Analysis

Pathway View

Linked Views Highlighting Hover

slide-23
SLIDE 23
  • T6: View items in detail

Task Analysis

Compound View

Hue : Elements

slide-24
SLIDE 24

Implementation Details

slide-25
SLIDE 25

source code: https://github.com/Caleydo/

slide-26
SLIDE 26

ConTour

Algorithm Design

slide-27
SLIDE 27

ConTour

History View Filter View Relationship View Pathway View Compound View

slide-28
SLIDE 28

Relationship View

  • There are facets within the relationship view
  • Combination of tabular data and plots
slide-29
SLIDE 29

Relationship View

slide-30
SLIDE 30

Relationship View

slide-31
SLIDE 31

Relationship View

slide-32
SLIDE 32

Relationship View

Approximately 100 numerical values shown here

slide-33
SLIDE 33

Relationship View

Approximately 100 numerical values shown here

Compound:Gene binding Activating Inhibiting Binding

slide-34
SLIDE 34

Relationship View

T1 T2 T3 T4

slide-35
SLIDE 35

T5 T5

slide-36
SLIDE 36

Conclusions

slide-37
SLIDE 37

System ConTour

What : Data

Multi-relational databases; node-link graph; clusters (derived)

Why : Tasks

Discovery; drill down; highlight relationships

How: Ranking & sorting How: Filtering How: Selection & Highlighting Scale:

Dozens of columns; upper limit on HD display appears to about 20. Thousands of data items. Up to 8 simultaneous views for compounds; only 1 for pathways Linked highlighting across facets; automatic sorting Drag and drop (nesting); user control (navigation)

How: Encode How: Facet

Side-by-side linked views, containing tabular data, bar plots, glyphs Simple marks with manipulation of hue and saturation (pathway view) Enrichment score; highlight; user control (navigation)

How: Multiple Views

Relationship view; pathway view; compound view; history and filters

slide-38
SLIDE 38

Case Studies

Concluding Thoughts

  • Seems like a very good tool for use on

structured datasets

  • When there are indirect (inferred) relationships,

it would be good to highlight this with some uncertainty

  • What about incomplete relationships?
slide-39
SLIDE 39

Backup

slide-40
SLIDE 40
slide-41
SLIDE 41