(Nearly) Efficient Algorithms
for the
Graph Matching Problem
Tselil Schramm
(Harvard/MIT)
with Boaz Barak, Chi-Ning Chou, Zhixian Lei & Yueqi Sheng (Harvard)
(Nearly) Efficient Algorithms for the Graph Matching Problem - - PowerPoint PPT Presentation
(Nearly) Efficient Algorithms for the Graph Matching Problem Tselil Schramm (Harvard/MIT) with Boaz Barak, Chi-Ning Chou, Zhixian Lei & Yueqi Sheng (Harvard) graph matching problem (approximate graph isomorphism) input: two graphs on
Tselil Schramm
(Harvard/MIT)
with Boaz Barak, Chi-Ning Chou, Zhixian Lei & Yueqi Sheng (Harvard)
π»0 π»1
goal: find permutation of vertices that maximizes # shared edges
maπ¦
π
π΅π»0, π π΅π»1
input: two graphs on π vertices
π»0 π»1 maπ¦
π
π΅π»0, π π΅π»1 # matched = 4
goal: find permutation of vertices that maximizes # shared edges input: two graphs on π vertices
π»0 π»1 maπ¦
π
π΅π»0, π π΅π»1
goal: find permutation of vertices that maximizes # shared edges input: two graphs on π vertices
π»0 π»1 maπ¦
π
π΅π»0, π π΅π»1 # matched = 5
goal: find permutation of vertices that maximizes # shared edges input: two graphs on π vertices
NP-hard: reduction from quadratic assignment problem (non-simple graphs).
[Lawlerβ63]
also: reduction from sparse random 3-SAT to approximate version
[OβDonnell-Wright-Wu-Zhouβ14]
[Conte-Foggia-Sansone-Ventoβ04]
maπ¦
π
π΅π»0, π π΅π»1 β ππΏ2 β π 2
sample π» βΌ π»(π, π) subsample edges w/prob πΏ π» π»0 π»1 π»0 random permutation π
structured model
[e.g. Pedarsani-Grossglauserβ11, Lyzinski-Fishkind-Priebeβ14, Korula-Lattanziβ14]
βrobust average-case graph isomorphismβ
β ππΏ2 β π 2 βnullβ model
sample π» βΌ π»(π, π) subsample edges w/prob πΏ π» π»0 π»1 π»0
π structured model
βrobust average-case graph isomorphismβ
maπ¦
π
π΅π»0, π π΅π»1 β ππΏ 2 β π 2
β ππΏ2 β π 2 βnullβ model
sample π»0, π»1 βΌ π»(π, ππΏ) π»0 π»1
sample π» βΌ π»(π, π) subsample edges w/prob πΏ π» π»0 π»1 π»0
π structured model
βrobust average-case graph isomorphismβ
Iff ππΏ2 >
log π π , with high probability π is the unique maximizing permutation.
Theorem
π»(π, π)
π» π»0 π»1 π»0
πΏ πΏ π for which π, πΏ can we recover π?
[Cullina-Kivayashβ16&17]
e.g. matching local neighborhoods average-case graph isomorphism algorithms fail. match radius-π neighborhoods?
π»0 π»1
πΏ πΏ
e.g. matching local neighborhoods average-case graph isomorphism algorithms fail. match radius-π neighborhoods?
π»0 π»1
average-case graph isomorphism algorithms fail. e.g. spectral algorithm unique entries in top eigenvector give isomorphism? π€max π€max
π»0 π»1
average-case graph isomorphism algorithms fail. e.g. spectral algorithm unique entries in top eigenvector give isomorphism? π€max π€max
πΏ πΏ
+ +
πΏ πΏ
perturb eigenvectors by β πΏ
π»0 π»1
πΘπ known π π(π)
π»0 π»1
π π£ π€
match vertices with similar adjacency into π
π»0 π»1
π π π£ = π€
match vertices with similar adjacency into π
need 2 ΰ·¨
π ππ time to guess a seed.
iff seed β₯ Ξ©(ππ), the seeded algorithm approximately recovers π. [Yartseva-Grossglauserβ13]
π»0 π»1
For any π > 0, if π β
ππ(1) π
,
π
1 153
π
βͺ
π
2 3
π , π1βπ π
and πΏ = Ξ©(1),* there is a ππ(log π) time algorithm that recovers π on π β π(π) of the vertices w/prob β₯ 0.99. Theorem
π»(π, π)
π» π»0 π»1 π»0
πΏ πΏ π
ππ(1) π1/153 π2/3 π1βπ π log π average degrees: *we allow πΏ = Ξ©
1 loglog π
π1/3 π1/2 π3/5 structured π»(π, π)
For any π > 0, if π β
ππ(1) π
,
π
1 153
π
βͺ
π
2 3
π , π1βπ π
and πΏ = Ξ©(1),* there is a ππ(log π) time algorithm that recovers π on π β π(π) of the vertices w/prob β₯ 0.99.
Theorem
*we allow πΏ β₯
1 logπ(1) π
π»(π, ππΏ) π»0 π»1
If π, πΏ are as above then there is a poly(π) time distinguishing algorithm for the structured vs null distributions. Theorem
hypothesis testing structured null π»(π, π)
π» π»0 π»1 π»0
πΏ πΏ π
π»(π, π)
π» π»0 π»1 π»0
πΏ πΏ π
π»(π, ππΏ) π»0 π»1 structured null Given π»0, π»1 sampled equally likely from structured or null, decide w/prob 1 β π(1) from which. π»1 π»0
brute force: is there a π with β₯ ππΏ2π2 matched edges?
π»(π, π)
π» π»0 π»1 π»0
πΏ πΏ π
π»(π, ππΏ) π»0 π»1 structured null π»1 π»0
π»(π, π)
π» π»0 π»1 π»0
πΏ πΏ π
π»(π, ππΏ) π»0 π»1 structured null π»1 π»0
triangle counts in π»0, π»1 are independent π½ πππ
π3(π»0, π»1) β ππΏπ 6
π»(π, π)
π» π»0 π»1 π»0
πΏ πΏ π
structured π»1 π»0
triangle counts in π»0, π»1 are correlated
π»(π, ππΏ) π»0 π»1 null
π½ πππ
πΏ3(π»0, π»1) β ππΏπ 6 + πΏ2ππ 3
π»(π, π)
π» π»0 π»1 π»0
πΏ πΏ π
structured π»(π, ππΏ) π»0 π»1 null
π½ πππ
πΏ3(π»0, π»1) β ππΏπ 6
π½ πππ
πΏ3(π»0, π»1) β ππΏπ 6 + πΏ2ππ 3
Variance?
Optimistically, in null case,
π πππ
πΏ3 π»0, π»1 1/2 β ππΏπ 3
structured null
Suppose we had π βindependent trialsβ:
structured null
1/2 β 1
if π > 1/πΏ6, πππ π is a good test πππ π π»0, π»1 = 1 π ΰ·
π=1 π
πππ
πΏ3 (π)(π»0, π»1) π»(π, π)
π» π»0 π»1 π»0
πΏ πΏ
π π»(π, ππΏ) π»0 π»1
near-independent subgraphs
Suppose we had π βindependentβ subgraphs: πππ π π»0, π»1 = 1 π ΰ·
π=1 π
πππ
πΌπ(π»0, π»1)
π½ #πΌ π» = 5! Θππ£π’ πΌ Θ β π 5 β π7
π = πβ5/7
How many labeled copies of πΌ in π»?
π»(π, π)
β π5π7 = Ξ(1)
π = πβ5/7
How many labeled copies of πΌ in π»?
π»(π, π)
π½ #πΌ π» = 5! Θππ£π’ πΌ Θ β π 5 β π7 How many labeled copies of πΏ4 in π»? π½ #πΏ4 π» = 4! Θππ£π’ πΏ4 Θ β π 4 β π6 β π5π7 = Ξ(1) β π4π6 = Ξ(πβ2/7)
π»(π, π)
For a constant-sized subgraph πΌ, Lemma
2
πΎβπΌ π½[#πΎ π» ]
subgraph of πΌ with fewest expected appearances
For a constant-sized subgraph πΌ, Lemma
2
πΎβπΌ π½[#πΎ π» ]
πΌ is strictly balanced if all its strict subgraphs have edge density <
πΉ πΌ π πΌ .
if π½ #πΌ π» β π π πΌ π πΉ πΌ = Ξ(1), then π½ #πΎ π» = π 1 for any πΎ β πΌ. = π 1 β π½ #πΌ π»
If πΌ1, β¦ , πΌπ are non-isomorphic strictly balanced graphs with π½ #πΌπ π» = Ξ 1 ,
βπ β [π], βπ β π β π ,
their counts concentrate their counts are asymptotically independent
For π€ =
1 poly πΏ , design a βtest setβ
set ππ€ ππΏ π β 1 π»(π, π)
π» π»0 π»1 π»0
π»(π, ππΏ) π»0 π»1
For π€ =
1 poly πΏ , design a βtest setβ
πππ π π»0, π»1 = 1 π ΰ·
π=1 π
πππ
πΌπ(π»0, π»1)
compute
π½ πππ π π»0, π»1 = π2π€(πΏπ)2π+ππ€ πΏ2π π set ππ€ ππΏ π β 1 π2π€(πΏπ)2π π πππ π π»0, π»1 = 1 π ππ€ πΏπ π < ππ€ πΏ2π π structured null null β₯ π < π null structured TODO: variance in structured case. π»(π, π)
π» π»0 π»1 π»0
π»(π, ππΏ) π»0 π»1
For π€ =
1 poly πΏ , design a βtest setβ
set ππ€ ππΏ π β 1 ππ(1) π1/153 π2/3 π1βπ π log π average degree: π1/3 π1/2 π3/5 remember? π»(π, π)
For π€ =
1 poly πΏ , design a βtest setβ
set ππ€ ππΏ π β 1
For π€ =
1 poly πΏ , design a βtest setβ
set ππ€ ππΏ π β 1
claim: connected π-regular graphs are strictly balanced. proof: in any strict subgraph, average degree < π.
For π€ =
1 poly πΏ , design a βtest setβ
set ππ€ ππΏ π β 1
claim: connected π-regular graphs are strictly balanced. proof: in any strict subgraph, average degree < π.
π2/3 π log π average degree: π1/3 π1/2 π3/5 π»(π, π)
For π€ =
1 poly πΏ , design a βtest setβ
set ππ€ ππΏ π β 1
what if we want 2 β
π π€ = π β π + 1 + 1 β π β π? π-regular random graph on π€ vertices + random matching
strict balance? expansion.
π2/3 π1βπ π log π average degree: π1/3 π1/2 π3/5 π»(π, π)
For π€ =
1 poly πΏ , design a βtest setβ
set ππ€ ππΏ π β 1
what if we want 2 β
π π€ = π β π + 1 + 1 β π β π? π-regular random graph on π€ vertices + random matching
strict balance? expansion.
2-regular graphs donβt expand.
For π€ =
1 poly πΏ , design a βtest setβ
set ππ€ ππΏ π β 1
what if we want 2 β
π π€ = π β 3 + 1 β π β 2? 3-regular random
graph on ππ€ vertices subdivide edges into π and π + 1 - length paths π£ π€ π£ π€
strict balance? expansion.
ππ(1) π1/153 π2/3 π1βπ π log π average degree: π1/3 π1/2 π3/5 π»(π, π)
For π€ =
1 poly πΏ , design a βtest setβ
set ππ€ ππΏ π β 1
Conjecture: our construction achieves all
π π€ π1βπ π log π average degree: π1/3 π»(π, π) π-reg +matching subdivide
+ more conditions (for recovery)
distinguishing: subgraphs on
1 poly πΏ = π(1) vertices, each appearing π(1) times
distinguishing: counting subgraphs ambiguity in matching; how to conclude π π£ = π€?
choose test set πΌ1, β¦ , πΌπ so that πΏ2π πππ€ β« πΏπ 2ππ2π€, if we see πΌπ in both graphs, it is most likely because of correlation. choose large test set πΌ1, β¦ , πΌπ with π€ = π(log π) vertices Ξ©(π) vertices participate in subgraphs from our test set.
identify rare subgraphs appearing in both graphs, and match vertices.
π»1 π»0 expected number of that survive subsampling expected number of unrelated pairs of
identify rare subgraphs appearing in both graphs, and match vertices.
π»1 π»0
Claim: there is at most one copy of each in π» with high probability
π»(π, π)
π» π»0 π»1 π»0
Claim: Ξ©(π) vertices in π»0 β© π»1, appear in a surviving subsampled with high probability
proofs: second moment method
emerging intuition/conjectures: SoS β‘πππ low-degree polynomials the sum-of-squares (SoS) semidefinite program is at most as powerful as βlow-degreeβ statistics for average-case problems. known to hold for: planted clique [Barak-Hopkins-Kelner-Kothari-Moitra-Potechinβ16] CSP refutation [Grigorievβ01, Schoenebeckβ08, Kothari-Mori-OβDonnell-Witmerβ17] tensor PCA [Hopkins-Kothari-Potechin-Raghavendra-S-Steurerβ17] also known: SoS is at most as powerful as βlow-degreeβ spectral algorithms for average-case problems [Hopkins-Kothari-Potechin-Raghavendra-S-Steurerβ17]
π
does the natural SoS relaxation recover π?
cares about can ask similar questions about other low-degree functions, e.g. non-backtracking random walk matrix.
SoS relaxation
SoS? or, many variations on our theme are possible.
log π π , π 1 ?
π log π π1/3