compsci 514: algorithms for data science Cameron Musco University - PowerPoint PPT Presentation

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst. Fall 2019. Lecture 16 0

• Finish up stochastic block model. • Efficient algorithms for SVD/eigendecomposition. • Iterative methods: power method, Krylov subspace methods. summary Last Class: This Class: 1 • Spectral clustering and embeddings • Started application to stochastic block model.

summary Last Class: This Class: 1 • Spectral clustering and embeddings • Started application to stochastic block model. • Finish up stochastic block model. • Efficient algorithms for SVD/eigendecomposition. • Iterative methods: power method, Krylov subspace methods.

Stochastic Block Model (Planted Partition Model): Let G n p q be a • Any two nodes in the same group are connected with probability p • Any two nodes in different groups are connected with prob. q • Connections are independent. stochastic block model Goal: Argue the effectiveness of spectral clustering in a natural, if oversimplified, generative model. distribution over graphs on n nodes, split equally into two groups B and C , each with n 2 nodes. (including self-loops). p . 2

stochastic block model Goal: Argue the effectiveness of spectral clustering in a natural, if oversimplified, generative model. distribution over graphs on n nodes, split equally into two groups B (including self-loops). 2 Stochastic Block Model (Planted Partition Model): Let G n ( p , q ) be a and C , each with n / 2 nodes. • Any two nodes in the same group are connected with probability p • Any two nodes in different groups are connected with prob. q < p . • Connections are independent.

expected adjacency spectrum Letting G be a stochastic block model graph drawn from What is the rank of A and how can you see this quickly? How many nonzero eigenvalues does A have? each. Connections are independent with probability p between nodes in the same group, and probability q between nodes not in the same group. 3 G n ( p , q ) and A ∈ R n × n be its adjacency matrix. ( E [ A ]) i , j = p for i , j in same group, ( E [ A ]) i , j = q otherwise. G n ( p , q ) : stochastic block model distribution. B , C : groups with n / 2 nodes

expected adjacency spectrum Letting G be a stochastic block model graph drawn from and how can you see this quickly? How many nonzero eigenvalues does A have? each. Connections are independent with probability p between nodes in the same group, and probability q between nodes not in the same group. 3 G n ( p , q ) and A ∈ R n × n be its adjacency matrix. ( E [ A ]) i , j = p for i , j in same group, ( E [ A ]) i , j = q otherwise. What is the rank of E [ A ] G n ( p , q ) : stochastic block model distribution. B , C : groups with n / 2 nodes

expected adjacency spectrum Letting G be a stochastic block model graph drawn from and how can you see this quickly? How many nonzero have? each. Connections are independent with probability p between nodes in the same group, and probability q between nodes not in the same group. 3 G n ( p , q ) and A ∈ R n × n be its adjacency matrix. ( E [ A ]) i , j = p for i , j in same group, ( E [ A ]) i , j = q otherwise. What is the rank of E [ A ] eigenvalues does E [ A ] G n ( p , q ) : stochastic block model distribution. B , C : groups with n / 2 nodes

If we compute v 2 then we recover the communities B and C ! • v 1 • v 2 B C with eigenvalue B C i B C i expected adjacency spectrum . C . 1 for i B and 1 if i • q n 2 p 2 . 2 q n p 1 1 with eigenvalue 4

If we compute v 2 then we recover the communities B and C ! expected adjacency spectrum 2 . 2 . 4 v 1 = ⃗ 1 with eigenvalue λ 1 = ( p + q ) n • ⃗ v 2 = χ B , C with eigenvalue λ 2 = ( p − q ) n • ⃗ • χ B , C ( i ) = 1 if i ∈ B and χ B , C ( i ) = − 1 for i ∈ C .

expected adjacency spectrum 2 . 2 . 4 v 1 = ⃗ 1 with eigenvalue λ 1 = ( p + q ) n • ⃗ v 2 = χ B , C with eigenvalue λ 2 = ( p − q ) n • ⃗ • χ B , C ( i ) = 1 if i ∈ B and χ B , C ( i ) = − 1 for i ∈ C . If we compute ⃗ v 2 then we recover the communities B and C !

expected laplacian spectrum Letting G be a stochastic block model graph drawn from 5 G n ( p , q ) , A ∈ R n × n be its adjacency matrix and L be its Laplacian, what are the eigenvectors and eigenvalues of E [ L ] ?

expected laplacian spectrum Letting G be a stochastic block model graph drawn from 6 G n ( p , q ) , A ∈ R n × n be its adjacency matrix and L be its Laplacian, what are the eigenvectors and eigenvalues of E [ L ] ?

• Random matrix theory is a very recent and cutting edge • If the random graph G (equivilantly A and L ) were exactly How do we show that a matrix (e.g., A ) is close to its expectation? Matrix concentration inequalities. • Analogous to scalar concentration inequalities like Markovs, expected laplacian spectrum indicator vector for the cut between the communities. equal to its expectation, partitioning using this eigenvector would exactly recover the two communities B and C . Chebyshevs, Bernsteins. subfield of mathematics that is being actively applied in computer science, statistics, and ML. 7 Upshot: The second small eigenvector of E [ L ] is χ B , C – the

• Random matrix theory is a very recent and cutting edge How do we show that a matrix (e.g., A ) is close to its expectation? Matrix concentration inequalities. • Analogous to scalar concentration inequalities like Markovs, expected laplacian spectrum indicator vector for the cut between the communities. equal to its expectation, partitioning using this eigenvector would exactly recover the two communities B and C . Chebyshevs, Bernsteins. subfield of mathematics that is being actively applied in computer science, statistics, and ML. 7 Upshot: The second small eigenvector of E [ L ] is χ B , C – the • If the random graph G (equivilantly A and L ) were exactly

• Random matrix theory is a very recent and cutting edge • Analogous to scalar concentration inequalities like Markovs, expected laplacian spectrum indicator vector for the cut between the communities. equal to its expectation, partitioning using this eigenvector would exactly recover the two communities B and C . expectation? Matrix concentration inequalities. Chebyshevs, Bernsteins. subfield of mathematics that is being actively applied in computer science, statistics, and ML. 7 Upshot: The second small eigenvector of E [ L ] is χ B , C – the • If the random graph G (equivilantly A and L ) were exactly How do we show that a matrix (e.g., A ) is close to its

• Random matrix theory is a very recent and cutting edge expected laplacian spectrum indicator vector for the cut between the communities. equal to its expectation, partitioning using this eigenvector would exactly recover the two communities B and C . expectation? Matrix concentration inequalities. Chebyshevs, Bernsteins. subfield of mathematics that is being actively applied in computer science, statistics, and ML. 7 Upshot: The second small eigenvector of E [ L ] is χ B , C – the • If the random graph G (equivilantly A and L ) were exactly How do we show that a matrix (e.g., A ) is close to its • Analogous to scalar concentration inequalities like Markovs,

expected laplacian spectrum indicator vector for the cut between the communities. equal to its expectation, partitioning using this eigenvector would exactly recover the two communities B and C . expectation? Matrix concentration inequalities. Chebyshevs, Bernsteins. subfield of mathematics that is being actively applied in computer science, statistics, and ML. 7 Upshot: The second small eigenvector of E [ L ] is χ B , C – the • If the random graph G (equivilantly A and L ) were exactly How do we show that a matrix (e.g., A ) is close to its • Analogous to scalar concentration inequalities like Markovs, • Random matrix theory is a very recent and cutting edge

Exercise: Show that X 2 is equal to the largest singular value of X . matrix concentration their difference in spectral norm? A are close. How does this relate to second eigenvectors of A and For the stochastic block model application, we want to show that the of the largest magnitude eigenvalue. A ) show that it is equal to the magnitude For symmetric X (like A 8 high probability , then with n ( ) Matrix Concentration Inequality: If p ≥ O log 4 n ∥ A − E [ A ] ∥ 2 ≤ O ( √ pn ) . where ∥ · ∥ 2 is the matrix spectral norm (operator norm). For any X ∈ R n × d , ∥ X ∥ 2 = max z ∈ R d : ∥ z ∥ 2 = 1 ∥ X z ∥ 2 .

matrix concentration of the largest magnitude eigenvalue. their difference in spectral norm? A are close. How does this relate to second eigenvectors of A and For the stochastic block model application, we want to show that the 8 , then with high probability n ( ) Matrix Concentration Inequality: If p ≥ O log 4 n ∥ A − E [ A ] ∥ 2 ≤ O ( √ pn ) . where ∥ · ∥ 2 is the matrix spectral norm (operator norm). For any X ∈ R n × d , ∥ X ∥ 2 = max z ∈ R d : ∥ z ∥ 2 = 1 ∥ X z ∥ 2 . Exercise: Show that ∥ X ∥ 2 is equal to the largest singular value of X . For symmetric X (like A − E [ A ] ) show that it is equal to the magnitude

compsci 514: algorithms for data science Cameron Musco University - PowerPoint PPT Presentation

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst. Fall 2019. Lecture 16 0 Finish up stochastic block model. Efficient algorithms for SVD/eigendecomposition. Iterative methods: power method,

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst.

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst.

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst. Fall

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst. Fall

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst. Fall

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst. Fall

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst. Fall

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst.

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst. Fall

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst. Fall

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst. Fall

compsci 514: algorithms for data science Prof. Cameron Musco University of Massachusetts Amherst.

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst.

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst. Fall

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst. Fall

compsci 514: algorithms for data science Cameron Musco University of Massachusetts Amherst.

Outline Malware and the network, contd Denial of service and the network CSci 5271

BIOSKILLS ONE THE TRICUSPID AND MITRAL VALVE COMPARATIVE ANATOMY MARK REISMAN, MD

GPD a GPD at t COMP COMPASS ASS at C t CERN ERN 1- DVCS CS 2- HEMP HEMP Nicole dHose

Responsibility Functions for Explaining Deviations in Decision Behaviour - CHANGES+ Colloquium -

OpenSGX: An Open Platform for SGX Research Prerit Jain, Soham Desai, Seongmin Kim* , Ming-Wei

Proofs for Satisfiability Problems Marijn J.H. Heule Joint work with Armin Biere X . X ,

Hash-routing Schemes for Information Centric Networking Lorenzo Saino, Ioannis Psaras, George

Op#mizing DNS Authority Server Placement Ning Kong, Guangqing