On Solving Linear Systems in Sublinear Time Alexandr Andoni, - - PowerPoint PPT Presentation

โ–ถ
on solving linear systems in sublinear time
SMART_READER_LITE
LIVE PREVIEW

On Solving Linear Systems in Sublinear Time Alexandr Andoni, - - PowerPoint PPT Presentation

On Solving Linear Systems in Sublinear Time Alexandr Andoni, Columbia University Robert Krauthgamer, Weizmann Institute Yosef Pogrow, Weizmann Institute Google WOLA 2019 Solving Linear Systems Input: and


slide-1
SLIDE 1

On Solving Linear Systems in Sublinear Time

Alexandr Andoni, Columbia University Robert Krauthgamer, Weizmann Institute Yosef Pogrow, Weizmann Institute ๏ƒ  Google WOLA 2019

slide-2
SLIDE 2

Solving Linear Systems

๏ฎ

Input: ๐ต โˆˆ โ„๐‘œร—๐‘œ and ๐‘ โˆˆ โ„๐‘œ

๏ฎ

Output: vector ๐‘ฆ that solves ๐ต๐‘ฆ = ๐‘

๏ฎ

Many algorithms, different variants:

๏ฑ

Matrix ๐ต is sparse, Laplacian, PSD etc.

๏ฑ

Bounded precision (solution ๐‘ฆ is approximate) vs. exact arithmetic

๏ฎ

Significant progress: Linear system in Laplacian matrix ๐‘€๐ป can be solved approximately in near-linear time เทจ ๐‘ƒ(nnz ๐‘€๐ป โ‹… log

1 ๐œ—) [Spielman-

Tengโ€™04, โ€ฆ, Cohen-Kyng-Miller-Pachocky-Peng-Rao-Xuโ€™14]

On Solving Linear Systems in Sublinear Time

Our focus: Sublinear running time

2

slide-3
SLIDE 3

Sublinear-Time Solver

๏ฎ

Input: ๐ต โˆˆ โ„๐‘œร—๐‘œ, ๐‘ โˆˆ โ„๐‘œ (also ๐œ— > 0) and ๐‘— โˆˆ [๐‘œ]

๏ฎ

Output: approximate coordinate เทœ ๐‘ฆ๐‘— from (any) solution ๐‘ฆโˆ— to ๐ต๐‘ฆ = ๐‘

๏ฑ

Accuracy bound เทœ ๐‘ฆ โˆ’ ๐‘ฆโˆ— โˆž โ‰ค ๐œ— ๐‘ฆโˆ— โˆž

๏ฎ

Formal requirement: There is a solution ๐‘ฆโˆ— to the system, such that โˆ€๐‘— โˆˆ ๐‘œ , Pr เทœ ๐‘ฆ๐‘— โˆ’ ๐‘ฆ๐‘—

โˆ— โ‰ค ๐œ— ๐‘ฆโˆ— โˆž โ‰ฅ 3 4

๏ฎ

Follows framework of Local Computation Algorithms (LCA), previously used for graph problems [Rubinfeld-Tamir-Vardi-Xieโ€™10]

On Solving Linear Systems in Sublinear Time 3

slide-4
SLIDE 4

Motivation

๏ฎ

Fast quantum algorithms for solving linear systems and for machine learning problems [Harrow-Hassidim-Lloydโ€™09, โ€ฆ]

๏ฑ

Can we match their performance classically?

๏ฑ

Recent success story: quantum ๏ƒ  classical algorithm [Tangโ€™18]

๏ฎ

New direction in sublinear-time algorithms

๏ฑ

โ€œLocalโ€ computation in numerical problems

๏ฑ

Compare computational models (representation, preprocessing), accuracy guarantees, input families (e.g., Laplacian vs. PSD)

๏ฑ

Known quantum algorithms have modeling requirements (e.g., quantum encoding of ๐‘)

On Solving Linear Systems in Sublinear Time 4

slide-5
SLIDE 5

Algorithm for Laplacians

๏ฎ

Informally: Can solve Laplacian systems of bounded-degree expander in polylog(n) time

๏ฑ

Key limitations: sparsity and condition number

๏ฎ

Notation:

๏ฑ

๐‘€๐ป = ๐ธ โˆ’ ๐ต is the Laplacian matrix of graph ๐ป

๏ฑ

๐‘€๐ป

+ is its Moore-Penrose pseudo-inverse ๏ฎ

Theorem 1: Suppose the input is a ๐‘’-regular ๐‘œ-vertex graph ๐ป, together with its condition number ๐œ† > 0, ๐‘ โˆˆ โ„๐‘œ, ๐‘ฃ โˆˆ ๐‘œ and ๐œ— > 0. Our algorithm computes เทœ ๐‘ฆ๐‘ฃ โˆˆ โ„ such that for ๐‘ฆโˆ— = ๐‘€๐ป

+๐‘,

โˆ€๐‘ฃ โˆˆ ๐‘œ , Pr เทœ ๐‘ฆ๐‘ฃ โˆ’ ๐‘ฆ๐‘ฃ

โˆ— โ‰ค ๐œ— ๐‘ฆโˆ— โˆž โ‰ฅ 3 4,

and runs in time เทจ ๐‘ƒ(๐‘’๐œ—โˆ’2๐‘ก3) for ๐‘ก = เทจ ๐‘ƒ(๐œ† log ๐‘œ).

On Solving Linear Systems in Sublinear Time

More inputs? Faster?

5

slide-6
SLIDE 6

Some Extensions

๏ฎ

Can replace ๐‘œ with ๐‘ 0

๏ฑ

Example: Effective resistance can be approximate (in expanders) in constant running time! ๐‘†eff(๐‘ฃ, ๐‘ค) = ๐‘“๐‘ฃ โˆ’ ๐‘“๐‘ค ๐‘ˆ๐‘€๐ป

+(๐‘“๐‘ฃ โˆ’ ๐‘“๐‘ค) ๏ฎ

Improved running time if

๏ฑ

Graph ๐ป is preprocessed

๏ฑ

One can sample a neighbor in ๐ป, or

๏ฎ

Extends to Symmetric Diagonally Dominant (SDD) matrix ๐‘‡

๏ฑ

๐œ† is condition number of ๐ธโˆ’1/2๐‘‡๐ธโˆ’1/2

On Solving Linear Systems in Sublinear Time 6

slide-7
SLIDE 7

Lower Bound for PSD Systems

๏ฎ

Informally: Solving โ€œsimilarโ€ PSD systems requires polynomial time

๏ฑ

Similar = bounded condition number and sparsity

๏ฑ

Even if the matrix can be preprocessed

๏ฎ

Theorem 2: For certain invertible PSD matrices ๐‘‡, with bounded sparsity ๐‘’ and condition number ๐œ†, every randomized algorithm must query ๐‘œฮฉ(1/๐‘’2) coordinates of the input ๐‘.

๏ฎ

Here, the output is เทœ ๐‘ฆ๐‘ฃ โˆˆ โ„ for a fixed ๐‘ฃ โˆˆ ๐‘œ , required to satisfy โˆ€๐‘ฃ โˆˆ ๐‘œ , Pr เทœ ๐‘ฆ๐‘ฃ โˆ’ ๐‘ฆ๐‘ฃ

โˆ— โ‰ค 1 5 ๐‘ฆโˆ— โˆž โ‰ฅ 3 4,

for ๐‘ฆโˆ— = ๐‘‡โˆ’1๐‘.

๏ฎ

In particular, ๐‘‡ may be preprocessed

On Solving Linear Systems in Sublinear Time 7

slide-8
SLIDE 8

Dependence on Condition Number

๏ฎ

Informally: Quadratic dependence on ๐œ† is necessary

๏ฑ

Our algorithmic bound เทฉ O(๐œ†3) is near-optimal, esp. when matrix ๐‘‡ can be preprocessed

๏ฎ

Theorem 3: For certain graphs ๐ป of maximum degree 4 and any condition number ๐œ† > 0, every randomized algorithm (for ๐‘€๐ป) with accuracy ๐œ— =

1 log ๐‘œ must probe เทฉ

ฮฉ(๐œ†2) coordinates of the input ๐‘.

๏ฎ

Again, the output is เทœ ๐‘ฆ๐‘ฃ โˆˆ โ„ for a fixed ๐‘ฃ โˆˆ ๐‘œ , required to satisfy โˆ€๐‘ฃ โˆˆ ๐‘œ , Pr เทœ ๐‘ฆ๐‘ฃ โˆ’ ๐‘ฆ๐‘ฃ

โˆ— โ‰ค 1 log ๐‘œ ๐‘ฆโˆ— โˆž โ‰ฅ 3 4,

for ๐‘ฆโˆ— = ๐‘€๐ป

+๐‘.

๏ฎ

In particular, ๐ป may be preprocessed

On Solving Linear Systems in Sublinear Time 8

slide-9
SLIDE 9

Algorithmic Techniques

๏ฎ

Famous Monte-Carlo method of von Neumann and Ulam: Write matrix inverse by power series โˆ€ ๐‘Œ < 1, ๐ฝ โˆ’ ๐‘Œ โˆ’1 = ฯƒ๐‘ขโ‰ฅ0 ๐‘Œ๐‘ข then estimate it by random walks (in ๐‘Œ) with unbiased expectation

๏ฎ

Inverting a Laplacian ๐‘€๐ป = ๐‘’๐ฝ โˆ’ ๐ต corresponds to summing walks in ๐ป

๏ฑ

For us: view ๐‘“๐‘ฃ

๐‘ˆ ฯƒ๐‘ขโ‰ฅ0 ๐ต๐‘ข๐‘ as sum over all walks, estimate it by sampling

(random walks)

๏ฎ

Need to control: number of walks and their length

๏ฑ

Large powers ๐‘ข > ๐‘ขโˆ— contribute relatively little (by condition number)

๏ฑ

Estimate truncated series (๐‘ข โ‰ค ๐‘ขโˆ—) by short random walks (by Chebyshevโ€™s inequality)

On Solving Linear Systems in Sublinear Time 9

slide-10
SLIDE 10

Related Work โ€“ All Algorithmic

๏ฎ

Similar techniques were used before in related contexts but under different assumptions, models and analyses:

๏ฑ

Probabilistic log-space algorithms for approximating ๐‘€๐ป

+ [Doron-Le Gall-

Ta-Shmaโ€™17]

๏ฎ

Asks for entire matrix, uses many long random walks (independent of ๐œ†)

๏ฑ

Local solver for Laplacian systems with boundary conditions [Chung- Simpsonโ€™15]

๏ฎ

Solver relies on a different power series and random walks

๏ฑ

Local solver for PSD systems [Shyamkumar-Banerjee-Lofgrenโ€™16]

๏ฎ

Polynomial time nnz ๐‘‡ 2/3 under assumptions like bounded matrix norm and random ๐‘ฃ โˆˆ ๐‘œ

๏ฑ

Local solver for Pagerank [Bressan-Peserico-Prettoโ€™18, Borgs-Brautbar- Chayes-Tengโ€™14]

๏ฎ

Polynomial time O(๐‘œ2/3) and O( nd 1/2) for certain matrices (non-symmetric but by definition are diagonally-dominant)

On Solving Linear Systems in Sublinear Time 10

slide-11
SLIDE 11

Lower Bound Techniques

๏ฎ

PSD lower bound: Take Laplacian of 2๐‘’-regular expander but with:

๏ฑ

high girth,

๏ฑ

edges signed ยฑ1 at random, and

๏ฑ

๐‘ƒ( ๐‘’) on the diagonal (PSD but not Laplacian)

๏ฎ

The graph looks like a tree locally

๏ฑ

Up to radius ฮ˜ log ๐‘œ around ๐‘ฃ

๏ฎ

Set ๐‘๐‘ฅ = ยฑ1 for ๐‘ฅ at distance ๐‘ , and 0 otherwise

๏ฑ

Signs have small bias ๐œ€ โ‰ˆ ๐‘’โˆ’๐‘ /2

๏ฑ

Recovering it requires reading ฮฉ(๐œ€โˆ’2) entries

๏ฎ

Using inversion formula, ๐‘ฆ๐‘ฃ โ‰ˆ average of ๐‘๐‘ฅโ€˜s

๏ฎ

Condition number lower bound: Take two 3-regular expanders connected by a matching of size ๐‘œ/๐œ†

๏ฑ

Let ๐‘๐‘ฅ = ยฑ1 with slight bias inside each expander

On Solving Linear Systems in Sublinear Time

๐‘  ๐‘๐‘ฅ = ยฑ1

๐‘๐‘ฅ = 0 ๐‘๐‘ฅ = 0

๐‘ฃ

11

slide-12
SLIDE 12

Further Questions

๏ฎ

Accuracy guarantees

๏ฑ

Different norms?

๏ฑ

Condition number of ๐‘‡ instead of ๐ธโˆ’1/2๐‘‡๐ธโˆ’1/2?

๏ฎ

Other representations (input/output models)?

๏ฑ

Access the input ๐‘ via random sampling?

๏ฑ

Sample from the output ๐‘ฆ?

๏ฎ

Other numerical problems?

On Solving Linear Systems in Sublinear Time

Thank You!

12