Nick Hamilton Institute for Molecular Bioscience Essential Graph - PowerPoint PPT Presentation

Nick Hamilton Institute for Molecular Bioscience Essential Graph Theory for Biologists Image: Matt Moores, The Visible Cell

Outline • Core definitions • Which are the most important bits? Which are the most important bits? • What happens when I break it? Robustness • What are the functional modules? Wh h f i l d l ? • Are there functional modules? • Getting around in a graph • Graph algorithms Graph algorithms • Trees & hierarchical structure • Small world and scale free graphs S ll ld d l f h • Software

Core Definitions A graph is a collection of nodes or vertices and a set of edges that connect pairs of nodes. t i f d Edges may be undirected or directed or have loops A graph might have multiple disconnected components 3 components

A simple example p p Nodes : people in this room Edges : “are friends” Nodes : people in this room Nodes : people in this room Edges : “likes”

Which graph bit is the most important? g p p For an undirected graph, the degree of a node is the number of edges connected to a node Degree 6 Degree 0 If the graph is directed, define in ‐ degree and out ‐ degree defined similarly similarly I In ‐ degree 2 d 2 Out ‐ degree 4

Which graph bit is the most important? A hub node is a node of “high” degree, relatively The inevitable example, the p53 protein interaction network Image: Dartnell et al, FEBS Letters 579, 2005 P53: crucial for cell cycle and apoptosis

Importance: What happens if I break it? p pp Node Deletion . Take the graph and delete a node and all its edges. Node separation set : a subset of nodes whose deletion causes Node separation set : a subset of nodes whose deletion causes the number of components in the graph to increase Mutations reducing p53 activity are present in over 50% of human tumours! (Haupt et al. 2003)

Importance: What happens if I break it? p pp Edge Deletion . Delete an edge (but not the nodes it joins) Cut set : as for node separation set, but deleting edges Network Robustness : how hard is it to break the network? Delete a random node or edge: it is still connected?

What are the (functional) modules? ( ) But what about: Components Mathematicians Biologists Clique A subset of nodes each pair joined by an edge Clique . A subset of nodes, each pair joined by an edge A maximal clique is contain in no larger clique

What at the (functional) modules? ( ) e ‐ Near Clique. A subset of nodes such that a fraction of e pairs of nodes have an edge between them 10/15 10/15 – near clique near clique 3 ‐ clique q Co ‐ Clique . A subset of nodes, no two joined by an edge Green nodes are a co ‐ clique

Are there modules? ‐ Clustering Coefficient g How do we tell if a node u is in a cluster? C = 8/21 C u 8/21 u C u = 0 u Why? ‐ Lots of triangles on the node ‐ i.e. mutual connection i e mutual connection For a node u of degree k , where there are e edges between neighbours of u, define the cluster coefficient C u as: C u = e / [k(k ‐ 1)/2] / [ ( )/ ] u # triangles on u Maximum possible # triangles on u For a graph, then define the average cluster coefficient

Getting around in a Graph Path . A “walk” through the graph with no repeated edges Path . A walk through the graph with no repeated edges a a-c-d c d b Cycle . A path that begins and ends at the same node Cycle . A path that begins and ends at the same node a a-b-c-a c d b Connected . There is a path between any two nodes

For instance, Metabolic Pathways http://www.genome.jp/kegg/pathway/map/map00260.html

Path Example: Shotgun sequence reconstruction Original Sequence b e Fragments Fragments d g c a a f f Construct overlap graph nodes : sequence fragments d f t edges : the tail of one fragment overlaps the head of another e b d g a c f f Warning: the above ignore all the awful details: sequencing errors, repeats, …

Hamiltonian (no relation) Paths Original Sequence b e Fragments d d g c a f Hamiltonian Path : Visits every node exactly once e b d g a c f

Edge Weights But there might be multiple Hamiltonian paths Which is “best”? Which is best ? 4 3 6 6 5 or ? 3 5 3 6 6 3 3 3 3 Total 15 Total 11 U Use edge weights : amount of overlap between fragments d i ht t f l b t f t M More overlap means a shorter combined sequence : better l h t bi d b tt In fact this is just the “famous” travelling salesman problem f h h “f ”

Trees and Hierarchical Structure A tree is an undirected connected acyclic graph A directed tree is a directed graph that would be tree if the directions were ignored directions were ignored Noam Chomsky, Syntactic Structures Species Tree with LGT events

Small World Networks Stanley Milgram in 1967 “showed” social networks have “ six degrees of separation ” and other shocking experiments Variations : Six degrees of Kevin Bacon, Erdös Number, Six degrees of Eric Clapton. Erdös ‐ Bacon ‐ Sabbath Number. g p Defining characteristics of small world networks Defining characteristics of small world networks ‐ Most nodes are not directly connected to each other ‐ Can get from between most pair of nodes in few steps C t f b t t i f d i f t [For N nodes, average pair distance proportional to Log(N)] Watts & Strogatz (Nature, 1998): constructed networks with small average shortest path & high clustering coefficient

Properties and Examples of Small World Networks p p Think “airports”, “connecting flights” • Lots of hubs • Often have cliques and near cliques q q • Said to be robust to perturbation (though hubs are vulnerable) For example (but beware, cf Lima ‐ Mendez & van Helden 2009) • Transcriptional networks Transcriptional networks • Metabolic networks • Protein interaction networks • Neural connections • You name it, it is a small world!

Scale Free Networks • Barabasi & Albert (Science, 1999) • Have power law distribution of degrees: P(k) ~ k ‐α ee k s with degre on of nodes Proportio Actors Power grid Web pages • Can be constructed by preferential attachment • They are “ ultra ‐ small worlds ”: Log(Log(N)) steps (Cohen & Havlin, 2003)

Software for Graph Exploration & Visualisation Pajek: graph algorithms Tulip: 2D and 3D interactive See: and visualisation visualisation of graphs http://www google com/ http://www.google.com/ Top/Science/Math/ Combinatorics/Software/ Graph_Drawing/ For a selection of tools Matlab (MatlabBGL): Cytoscape: viz. interaction GraphViz: sophisticated Graph algorithms & metrics networks/pathways graph layout images nicked from the respective websites

Further Reading • Mark Buchanan, Small World: Uncovering Nature’s Hidden Networks • Albert & Barabasi, Emergence of scaling in random networks, Science 286 (286):509 ‐ 512 , 1999 • Watts, & Stogatz, Collective dynamics of small world , g , y networks, Nature 393 :440 ‐ 444, 1998 • Lima ‐ Mendez & van Helden. The powerful law of the power l law and other myths in network biology. Mol. Biosys. d th th i t k bi l M l Bi 5 (12):1482 ‐ 9, 2009

Summary • Node Degree : Which are the most important bits? • Node & Edge Cuts : What happens when I break it? Robustness • Cliques & Clusters : What are the functional modules? Cliques & Clusters : What are the functional modules? • Cluster Coefficient : Are there functional modules? • Paths & Edge Weights : Getting around in a graph h d h d h • Graph algorithms : Are usually hard • Trees : Are ubiquitous • Small world and scale free graphs : Are popular Small world and scale free graphs : Are popular • Software : There is some

Nick Hamilton Institute for Molecular Bioscience The End The End Image: Matt Moores, The Visible Cell

Nick Hamilton Institute for Molecular Bioscience Essential Graph - PowerPoint PPT Presentation

Nick Hamilton Institute for Molecular Bioscience Essential Graph Theory for Biologists Image: Matt Moores, The Visible Cell Outline Core definitions Which are the most important bits? Which are the most important bits? What happens when

ETH facilities @ Bioscience, Wageningen BU Bioscience, Wageningen Plant Research, WUR Ric de Vos,

4. Molecular dynamics Understanding Molecular Simulation Molecular Simulations Molecular

7.1 Denis Corr, Ph.D. Denis Corr, Ph.D. Chair Clean Air Hamilton www.cleanair.hamilton.ca

HAMILTON BUSINESS DISTRICT HDC DOWNTOWN HAMILTON IMPROVEMENTS HAMILTON DEVELOPMENT CORPORATION

Hamilton cycles in the random geometric graph Nick Wormald University of Waterloo Hamilton

Denis Corr, Ph.D. Chair Clean Air Hamilton www.cleanair.hamilton.ca First the good news!

Molecular vibrations Ask Hjorth Larsen Center for Atomic-scale Materials Design 2008 Molecular

Basics of Molecular biology Molecular biology is the study of biology at molecular level.

3. Monte Carlo Simulations Understanding Molecular Simulation Molecular Simulations Molecular

Molecular Simulation Introduction Understanding Molecular Simulation Introduction Why to use

Introduction to PLINK Scott Hazelhurst Sydney Brenner Institute for Molecular Bioscience and

Town of Hamilton, Massachusetts Hamilton Technical Assistance Panel, June 22, 2015 About

Town of Hamilton, Massachusetts Hamilton Technical Assistance Panel, June 22, 2015 About ULI

Bioscience Technology Career Technical Education Preparing Students For the Workforce

2 nd Workshop Argentina-Japan 2 nd Workshop Argentina-Japan Bioscience and Biotechnology for

Building on UK Excellence: Bioscience for the bioeconomy Lee Beniston Head of Innovation,

Oversight Committee Meeting Streets Project Status Updates * * All financial values based

visualizing security boundaries in docker swarm overlay networks Mode for managing a cluster

2019 - 20 Financial Aid High School Presentation New Jersey Higher Education Student Assistance

ERSAT - EAV ERTMS on SATELLITE Enabling Application Validation Alessandro Neri 1 , Gianluigi

How to create a good poster Nitzan Rimon, Yael Elbaz & Maya Schuldiner Department of

Requirements PREPARED AND PRESENTED BY: DR. KELLEY PENNELL, DNP, APRN, ACNS-BC Objectives Gain

Workforce Policy Op/ons Colorado Commission on Affordable Health

Requirements of the Affordable Care Act Office o e of Medi edical As Assi sist stance

Nick Hamilton Institute for Molecular Bioscience Essential Graph - PowerPoint PPT Presentation

Nick Hamilton Institute for Molecular Bioscience Essential Graph Theory for Biologists Image: Matt Moores, The Visible Cell Outline Core definitions Which are the most important bits? Which are the most important bits? What happens when

ETH facilities @ Bioscience, Wageningen BU Bioscience, Wageningen Plant Research, WUR Ric de Vos,

4. Molecular dynamics Understanding Molecular Simulation Molecular Simulations Molecular

7.1 Denis Corr, Ph.D. Denis Corr, Ph.D. Chair Clean Air Hamilton www.cleanair.hamilton.ca

HAMILTON BUSINESS DISTRICT HDC DOWNTOWN HAMILTON IMPROVEMENTS HAMILTON DEVELOPMENT CORPORATION

Hamilton cycles in the random geometric graph Nick Wormald University of Waterloo Hamilton

Denis Corr, Ph.D. Chair Clean Air Hamilton www.cleanair.hamilton.ca First the good news!

Molecular vibrations Ask Hjorth Larsen Center for Atomic-scale Materials Design 2008 Molecular

Basics of Molecular biology Molecular biology is the study of biology at molecular level.

3. Monte Carlo Simulations Understanding Molecular Simulation Molecular Simulations Molecular

Molecular Simulation Introduction Understanding Molecular Simulation Introduction Why to use

Introduction to PLINK Scott Hazelhurst Sydney Brenner Institute for Molecular Bioscience and

Town of Hamilton, Massachusetts Hamilton Technical Assistance Panel, June 22, 2015 About

Town of Hamilton, Massachusetts Hamilton Technical Assistance Panel, June 22, 2015 About ULI

Bioscience Technology Career Technical Education Preparing Students For the Workforce

2 nd Workshop Argentina-Japan 2 nd Workshop Argentina-Japan Bioscience and Biotechnology for

Building on UK Excellence: Bioscience for the bioeconomy Lee Beniston Head of Innovation,

Oversight Committee Meeting Streets Project Status Updates * * All financial values based

visualizing security boundaries in docker swarm overlay networks Mode for managing a cluster

2019 - 20 Financial Aid High School Presentation New Jersey Higher Education Student Assistance

ERSAT - EAV ERTMS on SATELLITE Enabling Application Validation Alessandro Neri 1 , Gianluigi

How to create a good poster Nitzan Rimon, Yael Elbaz &amp; Maya Schuldiner Department of

Requirements PREPARED AND PRESENTED BY: DR. KELLEY PENNELL, DNP, APRN, ACNS-BC Objectives Gain

Workforce Policy Op/ons Colorado Commission on Affordable Health

Requirements of the Affordable Care Act Office o e of Medi edical As Assi sist stance

How to create a good poster Nitzan Rimon, Yael Elbaz & Maya Schuldiner Department of