Mining co-expression networks Nathalie Villa-Vialaneix - PowerPoint PPT Presentation

Overview on co-expression network analysis Case study 1 Case study 2 References Mining co-expression networks Nathalie Villa-Vialaneix http://www.nathalievilla.org INRA, Unité MIA-T, INRA, Toulouse (France) School for advanced sciences of Luchon Network analysis and applications NV 2 | Mining co-expression networks 1/32

Overview on co-expression network analysis Case study 1 Case study 2 References Outline Overview on co-expression network analysis 1 Case study 1: gene network analysis in relations with meat 2 quality Case study 2: gene network analysis in LCD experiment 3 NV 2 | Mining co-expression networks 2/32

Overview on co-expression network analysis Case study 1 Case study 2 References Outline Overview on co-expression network analysis 1 Case study 1: gene network analysis in relations with meat 2 quality Case study 2: gene network analysis in LCD experiment 3 NV 2 | Mining co-expression networks 3/32

Overview on co-expression network analysis Case study 1 Case study 2 References Transcriptomic data DNA transcripted into mRNA to produce proteins NV 2 | Mining co-expression networks 4/32

Overview on co-expression network analysis Case study 1 Case study 2 References Transcriptomic data transcriptomic data: measure of the quantity of mRNA corresponding to a given gene in given cells (blood, muscle...) of a living organism DNA transcripted into mRNA to produce proteins NV 2 | Mining co-expression networks 4/32

Overview on co-expression network analysis Case study 1 Case study 2 References Systems biology Some genes’ expressions activate or repress other genes’ expressions ⇒ understanding the whole cascade helps to comprehend the global functioning of living organisms 1 1 Picture taken from: Abdollahi A et al. , PNAS 2007, 104 :12890-12895. c � 2007 by National Academy of Sciences NV 2 | Mining co-expression networks 5/32

Overview on co-expression network analysis Case study 1 Case study 2 References Standard issues in network analysis Inference Giving expression data, how to build a graph whose edges represent the direct links between genes? Example: co-expression networks built from microarray/RNAseq data (nodes = genes; edges = significant “direct links” between expressions of two genes) NV 2 | Mining co-expression networks 6/32

Overview on co-expression network analysis Case study 1 Case study 2 References Standard issues in network analysis Inference Giving expression data, how to build a graph whose edges represent the direct links between genes? Graph mining (examples) Network visualization: nodes are not a priori given a position. 1 Positions aiming at representing connected Random positions nodes closer NV 2 | Mining co-expression networks 6/32

Overview on co-expression network analysis Case study 1 Case study 2 References Standard issues in network analysis Inference Giving expression data, how to build a graph whose edges represent the direct links between genes? Graph mining (examples) Network visualization: nodes are not a priori given a position. 1 Important node extraction (high degree, high centrality...) 2 NV 2 | Mining co-expression networks 6/32

Overview on co-expression network analysis Case study 1 Case study 2 References Standard issues in network analysis Inference Giving expression data, how to build a graph whose edges represent the direct links between genes? Graph mining (examples) Network visualization: nodes are not a priori given a position. 1 Important node extraction (high degree, high centrality...) 2 3 Network clustering: identify “communities” NV 2 | Mining co-expression networks 6/32

Overview on co-expression network analysis Case study 1 Case study 2 References Network inference Data: large scale gene expression data       . . . . . .          individuals     X j  X =   . . . . .        n ≃ 30 / 50 i     . . . . . . � �� variables (genes expression) , p ≃ 10 3 / 4 What we want to obtain: a graph/network with nodes: genes (a selected sublist of interest 2 ; usually, DE genes); edges: “strong relations” between gene expressions. 2 See [Verzelen, 2012] for conditions on respective n / p suited for inference. NV 2 | Mining co-expression networks 7/32

Overview on co-expression network analysis Case study 1 Case study 2 References Advantages of this network model over raw data: focuses on the strongest direct relationships: 1 irrelevant or indirect relations are removed (more robust) and the data are easier to visualize and understand (track transcription relations). NV 2 | Mining co-expression networks 8/32

Overview on co-expression network analysis Case study 1 Case study 2 References Advantages of this network model over raw data: focuses on the strongest direct relationships: 1 irrelevant or indirect relations are removed (more robust) and the data are easier to visualize and understand (track transcription relations). Expression data are analyzed all together and not by pairs (systems model). NV 2 | Mining co-expression networks 8/32

Overview on co-expression network analysis Case study 1 Case study 2 References Advantages of this network model over raw data: focuses on the strongest direct relationships: 1 irrelevant or indirect relations are removed (more robust) and the data are easier to visualize and understand (track transcription relations). Expression data are analyzed all together and not by pairs (systems model). over bibliographic network: can handle interactions with yet 2 unknown (not annotated) genes and deal with data collected in a particular condition. NV 2 | Mining co-expression networks 8/32

Overview on co-expression network analysis Case study 1 Case study 2 References Using correlations : relevance network [Butte and Kohane, 1999, Butte and Kohane, 2000] First (naive) approach: calculate correlations between expressions for all pairs of genes, threshold the smallest ones and build the network. Thresholding Graph Correlations NV 2 | Mining co-expression networks 9/32

Overview on co-expression network analysis Case study 1 Case study 2 References Using partial correlations x y z strong indirect correlation NV 2 | Mining co-expression networks 10/32

Overview on co-expression network analysis Case study 1 Case study 2 References Using partial correlations x y z strong indirect correlation set.seed(2807); x <- rnorm(100) y <- 2*x+1+rnorm(100,0,0.1); cor(x,y) [1] 0.998826 z <- 2*x+1+rnorm(100,0,0.1); cor(x,z) [1] 0.998751 cor(y,z) [1] 0.9971105 NV 2 | Mining co-expression networks 10/32

Overview on co-expression network analysis Case study 1 Case study 2 References Using partial correlations x y z strong indirect correlation set.seed(2807); x <- rnorm(100) y <- 2*x+1+rnorm(100,0,0.1); cor(x,y) [1] 0.998826 z <- 2*x+1+rnorm(100,0,0.1); cor(x,z) [1] 0.998751 cor(y,z) [1] 0.9971105 ♯ Partial correlation cor(lm(x ∼ z)$residuals,lm(y ∼ z)$residuals) [1] 0.7801174 cor(lm(x ∼ y)$residuals,lm(z ∼ y)$residuals) [1] 0.7639094 cor(lm(y ∼ x)$residuals,lm(z ∼ x)$residuals) [1] -0.1933699 NV 2 | Mining co-expression networks 10/32

Overview on co-expression network analysis Case study 1 Case study 2 References Partial correlation and GGM Gaussian Graphical Model framework: ( X i ) i = 1 ,..., n are i.i.d. Gaussian random variables N ( 0 , Σ) (gene expression); then � � j ←→ j ′ (genes j and j ′ are linked) ⇔ C or X j , X j ′ | ( X k ) k � j , j ′ � 0 NV 2 | Mining co-expression networks 11/32

Overview on co-expression network analysis Case study 1 Case study 2 References Partial correlation and GGM Gaussian Graphical Model framework: ( X i ) i = 1 ,..., n are i.i.d. Gaussian random variables N ( 0 , Σ) (gene expression); then � � j ←→ j ′ (genes j and j ′ are linked) ⇔ C or X j , X j ′ | ( X k ) k � j , j ′ � 0 If (concentration matrix) S = Σ − 1 , � � S jj ′ X j , X j ′ | ( X k ) k � j , j ′ = − � C or S jj S j ′ j ′ ⇒ Estimate Σ − 1 to unravel the graph structure NV 2 | Mining co-expression networks 11/32

Overview on co-expression network analysis Case study 1 Case study 2 References Partial correlation and GGM Gaussian Graphical Model framework: ( X i ) i = 1 ,..., n are i.i.d. Gaussian random variables N ( 0 , Σ) (gene expression); then � � j ←→ j ′ (genes j and j ′ are linked) ⇔ C or X j , X j ′ | ( X k ) k � j , j ′ � 0 If (concentration matrix) S = Σ − 1 , � � S jj ′ X j , X j ′ | ( X k ) k � j , j ′ = − � C or S jj S j ′ j ′ ⇒ Estimate Σ − 1 to unravel the graph structure Σ n ) − 1 is a poor Problem: Σ : p -dimensional matrix and n ≪ p ⇒ ( � estimate of S )! NV 2 | Mining co-expression networks 11/32

Mining co-expression networks Nathalie Villa-Vialaneix - PowerPoint PPT Presentation

Overview on co-expression network analysis Case study 1 Case study 2 References Mining co-expression networks Nathalie Villa-Vialaneix http://www.nathalievilla.org INRA, Unit MIA-T, INRA, Toulouse (France) School for advanced sciences of

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Gene Expression Data Introduction to gene expression data Expression data storage concept An

The Expression Problem and Lenses Lambdajam 2016 Tony Morris The Expression Problem A new name

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Differential expression analysis John Blischak Instructor DataCamp Differential Expression

Confluent Orthogonal Drawing for Syntax Diagrams S-expression ( S-expression

Lec 03. Regular expression, Pumping lemma Eunjung Kim F ORMAL DEFINITION OF R EGULAR EXPRESSION

Cement, Aggregates, Mining Presentation Cement, Aggregates and Mining Cement, Aggregates and

Frequent Pattern Mining Frequent Sequence Mining Frequent Tree Mining Christian Borgelt

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

Web MINING Web MINING Overview Overview Dr Ahmed Rafea Rafea Dr Ahmed 1 Web Mining Outline

Web Mining Andreas Andersson Gustav Strmberg Sandra Stendahl Introduction Web mining o

Week 5 Video 1 Relationship Mining Correlation Mining Relationship Mining Discover

Week 5 Video 2 Relationship Mining Causal Mining Causal Data Mining These slides developed in

Data Mining 2018 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 10, 2018

Requirements Elicitation Notes by mainly Jo Anne Atlee, with modifications by Daniel Berry dberry

Systems@Google Vamsi Thummala Slides by Prof. Cox DeFiler FAQ Multiple writes to a dFile?

Rascal The Metaprogramming Language Summer School on Software Technologies and Software

Learning to Rank with Learning to Rank with Partially-Labeled Data Partially-Labeled Data Kevin

Personalized Image Aesthetics Assessment Leida Li School of Artificial Intelligence Xidian

Recovering continuous conformations and reconstructing the energy landscape of a molecular machine

Web Mining and Recommender Systems T ext Mining Learning Goals Introduce the topic of text

Outline: This week 1. Subnetwork querying. More colorcoding. Treewidth graphs. 2.

Sambuz

Useful Links

Newsletter

Mail Us