Large Scale Ranking and Repositioning of Drugs with respect to - PowerPoint PPT Presentation

ISBRA 2012 – Dallas 21-23 May Large Scale Ranking and Repositioning of Drugs with respect to DrugBank Therapeutic Categories Matteo Re and Giorgio Valentini Dept. of Computer Science University of Milan - Italy Re and Valentini

Outline Drug repositioning Large scale ranking of drugs w.r.t. DrugBank therapeutic categories Ψ netPro : a general framework to construct pharmacological networks Kernelized score functions for drug ranking in pharmacological networks Experiments with 1253 FDA approved drugs Conclusions and developments Re and Valentini

Drug repositioning Small scale ( Kotelnikova et al. 2010, Li et al 2010 ) Large scale ( Iorio et al 2010, Gottlieb et al 2011 ) Computational tasks related to drug discovery: Clustering-based approaches ( Noeske et al 2005, Iorio et al 2010 ) Prediction of drug-target interactions ( Keiser et al 2009, Yamanishi et al 2010 ) Prediction of drug-disease association ( Gottlieb et al 2011, Chiang and Butte, 2009 ) ... 3 Re and Valentini

A novel prediction task: Large scale ranking of drugs w.r.t. DrugBank therapeutic categories Why DrugBank therapeutic categories?  Why not diseases? “At present, there is not a comprehensive and systematic representation of known drugs indications that would enable a fine- scale delineation of types of drug-disease relationships” ( Dudley et al 2011 )  Manually curated using medical literature 4 Re and Valentini

Drug ranking problem Having : Nodes → drugs  A network G=<V,E> connecting a large set of drugs: Edges → similarities A subset of drugs belonging to a given therapeutic category C  V C ⊂ V Rank drugs w.r.t. to a given therapeutic category C v ∈ V 5 Re and Valentini

Drug repositioning in homogeneous pharmacological networks 1. Construction and integration of homogeneous pharmacological networks 2. Network-based algorithms to rank drugs 6 Re and Valentini

How to construct meaningful pharmacological networks? A direct solution: a pairwise chemical structure similarity network N StructSim Can we construct other more general pharmacological networks? 7 Re and Valentini

Ψ NetPro: Pharmacological Space Integration Based on Networks Projections Drugs Targets Bipartite network One-mode pharm. network (e.g. drug-target) 8 Re and Valentini

Integration of pharmacological spaces Max integration (union) Min integration (intersection) Average Weighted average ... Per edge weighted average 9 Re and Valentini

Per edge weighted average A set of n pharmacological networks: G d =〈 V d ,E d 〉 , 1 ≤ d ≤ n with weights of edges d  v v ,v j  ∈ E d w ij The integrated pharmacological network: G= 〈 V,E 〉 ,V=U d V d ,E ⊆ U d E d has weights: D  i,j  = { d ∣ v i ∈ V d ∧ v j ∈ V d } w ij = 1 /∣ D  i,j  ∣ ∑ d , d ∈ D  i,j  w ij High coverage and no penalization for drugs with a limited number of data sources 10 Re and Valentini

Kernelized score functions: an algorithmic scheme for ranking drugs Any kernel. E.g.: - Linear kernel - Gaussian kernel - Graph kernels Drug-drug 1 ∣ V C ∣ ∑ network S AV  v,V C  = K  v,x  Average score : x ∈ V C S kNN  v,V C  = ∑ kNN score : K  v,x  x ∈ kNN  v  NN score : S NN  v,V C  =max x ∈ V C K  v,x  11 Re and Valentini

An example of graph kernel: the Random Walk kernel  One-step random walk kernel (Smola and Kondor, 2003): K=  a − 1  I+D − 1 / 2 WD − 1 / 2 Normalized W : weighted adjacency matrix of the graph Laplacian of the graph K : Gram matrix with elements k ij =K  v i ,v j  I : identity matrix D : diagonal matrix with d ii = ∑ w ij j  q-step random walk kernel: K q − step =K q q: number of steps By setting q >1 we can explore also “indirect neighbours” between drugs 12 Re and Valentini

A picture of the ranking method 13 Re and Valentini

Experiments 1253 FDA approved drugs 51 DrugBank therapeutic classes 3 pharmacological networks: - N structSim : pairwise chemical similarity ( Tanimoto coefficients) - N drugTarget : projection from drug-target interactions (from DrugBank 3.0 ) - N drugChem : projection from chemical interactions (from STITCH 2.0 ) Binarization and Graph Laplacian normalization 14 Re and Valentini

Progressive integration through “per edge” weighted average High coverage Low coverage 100% ........................................................ 50% N structSim N drugTarget N drugChem N structSim → W 1 (1253 nodes, 13010 edges) N structSim + N drugTarget → W 2 (1253, 43827) N structSim + N drugTarget + N drugChem → W 3 (1253, 96711) 15 Re and Valentini

A view of the integrated pharmacological network with Cytoscape 16 Re and Valentini

Results: AUC Kernelized score functions with random walk kernels compared with Random Walk (RW) and Random Walk with Restart (RWR) algorithms: 5-fold CV AUC results averaged across 51 DrugBank therapeutic classes: W 1 → W 2 → W 3 : AUC increments are statistically significant (Wilcoxon rank sum test, α =0.01) RW fails S AV and S kNN significantly better than the other methods (Wilcoxon rank sum test, α =0.01) 17 Re and Valentini

Results: precision at fixed recall S kNN : precision a fixed recall levels. 18 Re and Valentini

Time complexity 5-fold CV repeated 10 times for 51 therapeutical categories  No model learning is required (transductive method)  Score computation complexity : O(|V| |V C |) Approximately linear when |V C | << |V| 19 Re and Valentini

Preliminary analysis of top ranked “false positives” • “Anti HIV agents” : first top ranked FP is Darunavir (annotated in DrugBank as “ HIV Protease Inhibitor ”) • “GABA modulators” : Adinazolam and other 4 top ranked “false positives” are benzodiazepines, known to modulate the effect of GABA ( Hanson et al , 2008) 20 Re and Valentini

Conclusions ΨΝ etPro : a general framework for the construction and integration of pharmacological spaces based on networks projections Kernelized score functions : an algorithmic scheme for ranking drugs in pharmacological networks Cross-validated results show that our proposed methods are able to recover DrugBank therapeutic categories and to potentially reuse existing drugs for novel therapeutic indications 21 Re and Valentini

Developments and research perspectives 1. Integration of projected one-mode pharmacological networks from different two-mode networks: e.g. annotated side-effects ( SIDER ), curated pathway DB ( Reactome ), gene expression signature repositories ( Connectivity Map ) 2. Novel algorithms from the proposed algorithmic scheme: - novel distance measures and score functions - design of novel kernels well suited to the topology of the drug-drug networks 3. Low complexity of the algorithm: applicability to thousands of investigational compounds (not only FDA approved drugs) 4. Experimenting with different variants of network projections and integration 5. Systematic analysis of top ranked “false positive” drugs extended to all the therapeutic categories, or using other taxonomies (supported by text mining and text disambiguation techniques?) 22 Re and Valentini

Thank you for your attention! Giorgio Valentini Matteo Re 23 Re and Valentini

Large Scale Ranking and Repositioning of Drugs with respect to - PowerPoint PPT Presentation

ISBRA 2012 Dallas 21-23 May Large Scale Ranking and Repositioning of Drugs with respect to DrugBank Therapeutic Categories Matteo Re and Giorgio Valentini Dept. of Computer Science University of Milan - Italy Re and Valentini Outline

FROM HINDSIGHT TO FORESIGHT REPOSITIONING INTERNAL AUDIT TO DELIVER HIGHER VALUE

A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org

Easy and Hard Outline Constraint Ranking in OT The Constraint Ranking problem Making fast

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large

Surgical repositioning , replantation and transplantation of teeth Dr. Rafik Al Kowafi BDS,

Online Submodular Set Cover, Ranking, and Repeated Active Learning Online Ranking: At each round,

Ranking candidate genes from Ranking candidate genes from perturbation experiments Niko

TVM for Ads Ranking @ Facebook Hao Lu, Ansha Yu, Yinghai Lu, Andrew Tulloch Ads Ranking at

Statistical Ranking Problem Tong Zhang Statistics Department, Rutgers University Ranking

Large-Scale Machine Learning at Twitter 2 Large-Scale Machine Learning at Twitter Jimmy Lin and

WILTSHIRE Drugs and Alcohol Strategy 2019-2022 Reducing harm caused by drugs and alcohol, to

INFRASTRUCTURE 2110414 Large Scale Computing Systems Natawut Nupairoj, Ph.D. Outline 2

The Nervous System and Drugs How drugs affect the nervous system Adapted from Sylvia Maders in

PROTEIN DRUGS PEPTIDE AND PROTEIN DRUGS In this lecture, the general differences in the kinetic

Ranking of manipulated images in a large set using Error Level Analysis Daan Wagenaar &

International Public Sector Accounting Standards Board Agenda Item 2A Key Characteristics of the

Annual General Meeting 3 July 2012 PROGRAMME 1.30pm AGM Business 2.00pm Opening Presentation

Mathivannan Jaganathan, School of Business Management A game-based classroom response system

How Has Forking Changed in the Last 20 Years? A Study of Hard Forks on GitHub Shurui Zhou, Bogdan

ZK with Rubiks Cubes and Non-Abelian Groups Emmanuel Volte - Valrie Nachef - Jacques Patarin

DUNE T HREE H ORN D ESIGN S TUDY Brendon Bullard P URPOSE Benchmarking G4LBNE/V2 and V3

MICROKERNELS KISHAN VARMA VAISHALI CHANANA AGENDA MONOLITHIC KERNELS VS MICROKERNELS

LANDSCAPE BUY TO LETS , ACCEPTABLE TO BE SOLD TO VULTURES SEEN AS AN INVESTMENT

Large Scale Ranking and Repositioning of Drugs with respect to - PowerPoint PPT Presentation

ISBRA 2012 Dallas 21-23 May Large Scale Ranking and Repositioning of Drugs with respect to DrugBank Therapeutic Categories Matteo Re and Giorgio Valentini Dept. of Computer Science University of Milan - Italy Re and Valentini Outline

FROM HINDSIGHT TO FORESIGHT REPOSITIONING INTERNAL AUDIT TO DELIVER HIGHER VALUE

A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org

Easy and Hard Outline Constraint Ranking in OT The Constraint Ranking problem Making fast

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large

Surgical repositioning , replantation and transplantation of teeth Dr. Rafik Al Kowafi BDS,

Online Submodular Set Cover, Ranking, and Repeated Active Learning Online Ranking: At each round,

Ranking candidate genes from Ranking candidate genes from perturbation experiments Niko

TVM for Ads Ranking @ Facebook Hao Lu, Ansha Yu, Yinghai Lu, Andrew Tulloch Ads Ranking at

Statistical Ranking Problem Tong Zhang Statistics Department, Rutgers University Ranking

Large-Scale Machine Learning at Twitter 2 Large-Scale Machine Learning at Twitter Jimmy Lin and

WILTSHIRE Drugs and Alcohol Strategy 2019-2022 Reducing harm caused by drugs and alcohol, to

INFRASTRUCTURE 2110414 Large Scale Computing Systems Natawut Nupairoj, Ph.D. Outline 2

The Nervous System and Drugs How drugs affect the nervous system Adapted from Sylvia Maders in

PROTEIN DRUGS PEPTIDE AND PROTEIN DRUGS In this lecture, the general differences in the kinetic

Ranking of manipulated images in a large set using Error Level Analysis Daan Wagenaar &amp;

International Public Sector Accounting Standards Board Agenda Item 2A Key Characteristics of the

Annual General Meeting 3 July 2012 PROGRAMME 1.30pm AGM Business 2.00pm Opening Presentation

Mathivannan Jaganathan, School of Business Management A game-based classroom response system

How Has Forking Changed in the Last 20 Years? A Study of Hard Forks on GitHub Shurui Zhou, Bogdan

ZK with Rubiks Cubes and Non-Abelian Groups Emmanuel Volte - Valrie Nachef - Jacques Patarin

DUNE T HREE H ORN D ESIGN S TUDY Brendon Bullard P URPOSE Benchmarking G4LBNE/V2 and V3

MICROKERNELS KISHAN VARMA VAISHALI CHANANA AGENDA MONOLITHIC KERNELS VS MICROKERNELS

LANDSCAPE BUY TO LETS , ACCEPTABLE TO BE SOLD TO VULTURES SEEN AS AN INVESTMENT

Ranking of manipulated images in a large set using Error Level Analysis Daan Wagenaar &