Introduction to Computational Graph Analytics Lecture 1 CSCI - PowerPoint PPT Presentation

Introduction to Computational Graph Analytics Lecture 1 CSCI 4974/6971 29 August 2016 1 / 6

Graph, networks, and characteristics of real-world data Slides from Marta Arias & R. Ferrer-i-Cancho, Intro to Complex and Social Networks 2 / 6

Presentation and course logistics Examples of real networks Intro to Network Analysis Measuring and modeling networks So, let’s start! Today, we’ll see: 1. Examples of real networks 2. What do real networks look like? ◮ real networks exhibit small diameter ◮ .. and so does the Erd¨ os-R´ enyi or random model ◮ real networks have high clustering coefficient ◮ .. and so does the Watts-Strogatz model ◮ real networks’ degree distribution follows a power-law ◮ .. and so does the Barabasi-Albert or preferential attachment model Marta Arias & R. Ferrer-i-Cancho Intro to Complex and Social Networks

Presentation and course logistics Examples of real networks Intro to Network Analysis Measuring and modeling networks Examples of real networks ◮ Social networks ◮ Information networks ◮ Technological networks ◮ Biological networks Marta Arias & R. Ferrer-i-Cancho Intro to Complex and Social Networks

Presentation and course logistics Examples of real networks Intro to Network Analysis Measuring and modeling networks Social networks Links denote social “interactions” ◮ friendship, collaborations, e-mail, etc. Marta Arias & R. Ferrer-i-Cancho Intro to Complex and Social Networks

Presentation and course logistics Examples of real networks Intro to Network Analysis Measuring and modeling networks Information networks Nodes store information, links associate information ◮ citation networks, the web, p2p networks, etc. Marta Arias & R. Ferrer-i-Cancho Intro to Complex and Social Networks

Presentation and course logistics Examples of real networks Intro to Network Analysis Measuring and modeling networks Technological networks Man-built for the distribution of a commodity ◮ telephone networks, power grids, transportation networks, etc. Marta Arias & R. Ferrer-i-Cancho Intro to Complex and Social Networks

Presentation and course logistics Examples of real networks Intro to Network Analysis Measuring and modeling networks Biological networks Represent biological systems ◮ protein-protein interaction networks, gene regulation networks, metabolic pathways, etc. Marta Arias & R. Ferrer-i-Cancho Intro to Complex and Social Networks

Presentation and course logistics Examples of real networks Intro to Network Analysis Measuring and modeling networks Representing networks ◮ Network ≡ Graph ◮ Networks are just collections of “points” joined by “lines” points lines vertices edges, arcs math nodes links computer science sites bonds physics actors ties, relations sociology Marta Arias & R. Ferrer-i-Cancho Intro to Complex and Social Networks

Presentation and course logistics Examples of real networks Intro to Network Analysis Measuring and modeling networks Types of networks From [Newman, 2003] (a) unweighted, undirected (b) discrete vertex and edge types, undirected (c) varying vertex and edge weights, undirected (d) directed Marta Arias & R. Ferrer-i-Cancho Intro to Complex and Social Networks

Presentation and course logistics Examples of real networks Intro to Network Analysis Measuring and modeling networks Small-world phenomenon ◮ A friend of a friend is also frequently a friend ◮ Only 6 hops separate any two people in the world Marta Arias & R. Ferrer-i-Cancho Intro to Complex and Social Networks

Presentation and course logistics Examples of real networks Intro to Network Analysis Measuring and modeling networks Measuring the small-world phenomenon, I ◮ Let d ij be the shortest-path distance between nodes i and j ◮ To check whether “any two nodes are within 6 hops”, we use: ◮ The diameter (longest shortest-path distance) as d = max i , j d ij ◮ The average shortest-path length as 2 � l = d ij n ( n + 1) i > j ◮ The harmonic mean shortest-path length as 2 l − 1 = � d − 1 ij n ( n + 1) i > j Marta Arias & R. Ferrer-i-Cancho Intro to Complex and Social Networks

Presentation and course logistics Examples of real networks Intro to Network Analysis Measuring and modeling networks From [Newman, 2003] Marta Arias & R. Ferrer-i-Cancho Intro to Complex and Social Networks

Presentation and course logistics Examples of real networks Intro to Network Analysis Measuring and modeling networks Degree distribution Histogram of nr of nodes having a particular degree f k = fraction of nodes of degree k Marta Arias & R. Ferrer-i-Cancho Intro to Complex and Social Networks

Presentation and course logistics Examples of real networks Intro to Network Analysis Measuring and modeling networks Scale-free networks The degree distribution of most real-world networks follows a power-law distribution f k = ck − α ◮ “heavy-tail” distribution, implies existence of hubs ◮ hubs are nodes with very high degree Marta Arias & R. Ferrer-i-Cancho Intro to Complex and Social Networks

How to Analyze Networks Slides from Johannes Putzke, Social Network Analysis: Basic Concepts, Methods & Theory 3 / 6

Different Levels of Analysis  Actor-Level  Dyad-Level  Triad-Level  Subset-level (cliques / subgraphs)  Group (i.e. global) level Folie: 35

Example: Centrality Measures  Who is the most prominent?  Who knows the most actors? (Degree Centrality)  Who has the shortest distance to the other actors?  Who controls knowledge flows?  ... Folie: 18

Closeness Centrality  Who knows the most actors?  Who has the shortest distance to the other actors? (Closesness Centrality)  Who controls knowledge flows?  ... Folie: 40

Betweenness Centrality  Who knows the most actors? 1 2  Who has the shortest distance to 4 5 3 7 the other actors? 6  Who controls knowledge flows? 8 9 (Betweenness Centrality)  ... 10 12 11 14 13 17 15 16 18 19 Folie: 44

Reachability, Distances and Diameter  Reachability  If there is a path between nodes n i and n j  Geodesic  Shortest path between two nodes  (Geodesic) Distance d(i,j)  Length of Geodesic (also called „degrees of separation“) Folie: 27

Diameter of a Graph and Average Geodesic Distance  Diameter 1 2  Largest geodesic distance 4 5 3 7 between any pair of nodes 6  Average Geodesic Distance 8 9  How fast can information 10 get transmitted? 12 11 14 13 17 15 16 18 19 Folie: 50

Density  Proportion of ties in a graph High density (44%) Low density (14%) Folie: 51

Connectivity of Graphs Folie: 58

Connected Graphs, Components, Cutpoints and Bridges  Connectedness  A graph is connected if there is a path between every pair of nodes  Components  Connected subgraphs in a graph  Connected graph has 1 component  Two disconnected graphs are one social network!!! Folie: 59

Connected Graphs, Components, Cutpoints and Bridges  Connectivity of pairs of nodes and graphs n1 n2 n3 n4  Weakly connected  Joined by semipath n1 n2 n3 n4  Unilaterally connected n1 n2 n3 n4  Path from n j to n j or from n j to n j  Strongly connected  Path from n j to n j and from n j to n j n2 n3  Path may contain different nodes n1 n4  Recursively Connected n5 n6  Nodes are strongly connected and both paths use the same nodes and arcs in reverse order n1 n2 n3 n4 Folie: 60

Connected Graphs, Components, Cutpoints and Bridges  Cutpoints 1 2  number of components in the graph that contain 4 5 3 node n j is fewer than 7 number of components in 6 subgraphs that results from 8 9 deleting n j from the graph  Cutsets (of size k) 10  k -node cut 12 11  Bridges / line cuts 14  Number of 13 components…that contain 17 15 line l k 16 18 19 Folie: 61

Node- and Line Connectivity  How vulnerable is a graph to removal of nodes or lines? Point connectivity / Line connectivity / Edge connectivity Node connectivity  Minimum number λ for  Minimum number of k for which for which graph has a which the graph has a k - λ -line cut node cut  For any value <k the graph is k-node-connected Folie: 62

How to Analyze Networks (cont.) Slides from Jon Crowcroft, Introduction to Network Theory 4 / 6

Subgraph Subgraph Vertex and edge sets are subsets of those of G Vertex and edge sets are subsets of those of G    a a supergraph supergraph of a graph G is a graph that contains G as a of a graph G is a graph that contains G as a  subgraph. . subgraph

Isomorphism Isomorphism Bijection, i.e., a one-to-one mapping: Bijection , i.e., a one-to-one mapping:   f : V(G) -> V(H) f : V(G) -> V(H) u and v from G are adjacent if and only if f(u) and f(v) are u and v from G are adjacent if and only if f(u) and f(v) are adjacent in H. adjacent in H. If an isomorphism can be constructed between two graphs, then If an isomorphism can be constructed between two graphs, then   we say those graphs are isomorphic we say those graphs are isomorphic . .

Introduction to Computational Graph Analytics Lecture 1 CSCI - PowerPoint PPT Presentation

Introduction to Computational Graph Analytics Lecture 1 CSCI 4974/6971 29 August 2016 1 / 6 Graph, networks, and characteristics of real-world data Slides from Marta Arias & R. Ferrer-i-Cancho, Intro to Complex and Social Networks 2 / 6

Analytics and Data Summit 2020 Analytics and Data Summit 2020 Analytics and Data Summit 2020

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

WITH RAPIDS Joe Eaton, Ph.D. Technical Lead for Graph Analytics AGENDA Introduction - Why

Undergraduate Business Analytics Minor Spreadsheet Analytics BANA-2081 Business Analytics

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Massively Parallel Graph Analytics Supercomputing for large-scale graph analytics George M. Slota

Graph Data Processing M. Tamer Ozsu 1 / 75 Outline Introduction RDF Graph Querying

Introduction to Talent Analytics and Interim View 01 Overview Erich OSaben Talent Analytics

Architecture 3.0 Landscape Analytics Jrgen Dllner Hasso-Plattner-Institut Jrgen

Google Analytics Overview Whats Google Analytics? The Google Analytics

Google Analytics A beginners guide What is Google Analytics? Google Analytics is not magic.

Document Name Solar Analytics - Rooftop PV energy analytics PREPARED BY: Your Name, Your Title

Graph Indexing: Tree + Delta Delta >= Graph >= Graph Graph Indexing: Tree + Peixian Zhao,

Graph Mining Marco Serafini COMPSCI 532 Lecture 11 Classes of Graph Systems Graph

Graph Analytics using Vertica Relational Database Alekh Jindal - Samuel Madden, Mal

Graph Analytics on Massively Parallel Processing Databases Frank McQuillan Feb 2017 MPP

CONSENT: Scalable self-correction of long reads with multiple sequence alignment Pierre Morisse 1

Bio-communication and Natural Genome Editing A new concept for the emergence of biological

Semester projects The Plan Principles of Complex Systems Suggestions for CSYS/MATH 300, Spring,

Reverse engineering minimal wiring diagrams Elena Dimitrova School of Mathematical and

Disclosure Cervical Spinal Disorders Morio Matsumoto Morio Matsumoto received honorarium for

Outline 1 Introduction 2 Bayesian Networks 3 Neuroscience 4 Industry 5 Sport 6

Disclosures Complications in Cervical Deformity Surgery Stryker Spine: royalties Fellowship

CRLite: When Industry & Academia Collide Thyla van der Merwe Real World Crypto 9 January

Introduction to Computational Graph Analytics Lecture 1 CSCI - PowerPoint PPT Presentation

Introduction to Computational Graph Analytics Lecture 1 CSCI 4974/6971 29 August 2016 1 / 6 Graph, networks, and characteristics of real-world data Slides from Marta Arias & R. Ferrer-i-Cancho, Intro to Complex and Social Networks 2 / 6

Analytics and Data Summit 2020 Analytics and Data Summit 2020 Analytics and Data Summit 2020

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

WITH RAPIDS Joe Eaton, Ph.D. Technical Lead for Graph Analytics AGENDA Introduction - Why

Undergraduate Business Analytics Minor Spreadsheet Analytics BANA-2081 Business Analytics

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Massively Parallel Graph Analytics Supercomputing for large-scale graph analytics George M. Slota

Graph Data Processing M. Tamer Ozsu 1 / 75 Outline Introduction RDF Graph Querying

Introduction to Talent Analytics and Interim View 01 Overview Erich OSaben Talent Analytics

Architecture 3.0 Landscape Analytics Jrgen Dllner Hasso-Plattner-Institut Jrgen

Google Analytics Overview Whats Google Analytics? The Google Analytics

Google Analytics A beginners guide What is Google Analytics? Google Analytics is not magic.

Document Name Solar Analytics - Rooftop PV energy analytics PREPARED BY: Your Name, Your Title

Graph Indexing: Tree + Delta Delta &gt;= Graph &gt;= Graph Graph Indexing: Tree + Peixian Zhao,

Graph Mining Marco Serafini COMPSCI 532 Lecture 11 Classes of Graph Systems Graph

Graph Analytics using Vertica Relational Database Alekh Jindal - Samuel Madden, Mal

Graph Analytics on Massively Parallel Processing Databases Frank McQuillan Feb 2017 MPP

CONSENT: Scalable self-correction of long reads with multiple sequence alignment Pierre Morisse 1

Bio-communication and Natural Genome Editing A new concept for the emergence of biological

Semester projects The Plan Principles of Complex Systems Suggestions for CSYS/MATH 300, Spring,

Reverse engineering minimal wiring diagrams Elena Dimitrova School of Mathematical and

Disclosure Cervical Spinal Disorders Morio Matsumoto Morio Matsumoto received honorarium for

Outline 1 Introduction 2 Bayesian Networks 3 Neuroscience 4 Industry 5 Sport 6

Disclosures Complications in Cervical Deformity Surgery Stryker Spine: royalties Fellowship

CRLite: When Industry &amp; Academia Collide Thyla van der Merwe Real World Crypto 9 January

Graph Indexing: Tree + Delta Delta >= Graph >= Graph Graph Indexing: Tree + Peixian Zhao,

CRLite: When Industry & Academia Collide Thyla van der Merwe Real World Crypto 9 January