Pseudospectra of structured random matrices Oberwolfach, 2019/12/13 - PowerPoint PPT Presentation

Pseudospectra of structured random matrices Oberwolfach, 2019/12/13 Nicholas Cook, Stanford University Partly based on joint work with Alice Guionnet and Jonathan Husson

Outline 1. Geometric approach to RMT: successes and limitations 2. Spectral anti-concentration for structured Hermitian random matrices 3. Pseudospectra of i-non-id matrices 4. Pseudospectra and convergence to Brown measure for quadratic polynomials in Ginibre matrices (linearization � pseudospectra for patterned block random matrices). 1

Geometric approach to RMT Family of techniques originating from the local theory of Banach spaces (Grothendieck, Dvoretzky, Lindenstrauss, Milman, Schechtman . . . ). Modern reference: Vershynin’s text High dimensional probability . Can often get quantitative bounds at finite N that are within a constant factor of the asymptotic truth, with arguments that are more flexible. √ E.g. can show � X � op = O ( N ) w.h.p. for X an iid matrix with sub-Gaussian √ entries with a simple net argument and concentration. Compare ∼ 2 N by the trace method. √ ∗ (For X GinOE can even get the right constant E � X � op ≤ 2 N using Slepian’s inequality!) Net arguments and anti-concentration have been key for controlling the invertibility / condition number / pseudospectrum of random matrices. 2

Spectral anti-concentration Consider H an N × N Hermitian random matrix with { H ij } i ≤ j independent, centered, sub-Gaussian with variances σ 2 ij ∈ [0 , 1]. Denote by Σ = ( σ ij ) N i , j =1 the standard deviation profile. How many eigenvalues can lie in an interval I ⊂ R ? Under what conditions on Σ can we show ∀I ⊂ R , |I| ≥ N − 1+ ε µ N H ( I ) � |I| 1 √ with high probability (w.h.p.)? Local semicircle law (Erd˝ os–Schlein–Yau ’08) Suppose σ ij ≡ 1. With high probability, for any interval I ⊂ R with |I| ≥ N − 1+ ε , | µ N H ( I ) − µ sc ( I ) | = o ( |I| ) . 1 √ Extended to non-constant variance by Ajanki, Erd˝ os & Kr¨ uger through careful analysis of associated vector Dyson equations. 3

Spectral anti-concentration With spt(Σ) = { ( i , j ) ∈ [ N ] 2 : σ ij ≥ σ 0 } (some fixed cutoff σ 0 > 0) say Σ is • δ - broadly connected if ∀ I , J ⊂ [ N ] with | I | + | J | ≥ N , | spt(Σ) ∩ ( I × J ) | ≥ δ | I || J | (Rudelson–Zeitouni ’13); • δ - robustly irreducible if ∀ J ⊂ [ N ], | spt(Σ) ∩ ( J × J c ) | ≥ δ | J || J c | . Robust irreducibility permits µ N H to have an atom at zero. 1 √ Theorem (C. ’17, unpublished) 1. Fix δ > 0 and suppose Σ is δ -broadly connected. Then w.h.p., for any I ⊂ R with |I| ≥ C log N we have µ N H ( I ) � δ |I| . 1 N √ 2. Fix δ, κ > 0 and suppose Σ is δ -robustly irreducible. Then w.h.p., for any I ⊂ R \ ( − κ, κ ) with |I| ≥ C log N we have µ N H ( I ) � δ,κ |I| . 1 N √ Related result of C.–Hachem–Najim–Renfrew ’16 for deterministic equivalents. Can reach intervals of length N − 1 √ log N using Bourgain–Tzafriri’s restricted invertibility theorem as in independent work of Nguyen for case σ ij ≡ 1. Same strategy can be applied to e.g. H 1 H 2 + H 2 H 1 (local law by Anderson ’15). 4 Cf. Banna–Mai ’18 on H¨ older-regularity for distribution of NC-polynomials.

Pseudospectrum (Already came up in talks of Capitaine, Fyodorov, Zeitouni and Vogel.) For A ∈ M N ( C ), λ ∈ Λ( A ) is a qualitative statement. More useful: For ε > 0 the ε -pseudospectrum is the set Λ ε ( A ) = Λ( A ) ∪ { z ∈ C : � ( A − z ) − 1 � op ≥ 1 /ε } = { z ∈ C : ∃ E with � E � op ≤ ε and z ∈ Λ( A + E ) } . For A normal ( A ∗ A = AA ∗ ), Λ ε ( A ) = Λ( A ) + ε D . (We always have Λ ε ( A ) ⊇ Λ( A ) + ε D .) In particular, the spectrum of normal operators is stable: the spectrum is in a sense a 1-Lipschitz function of the matrix. This can be extremely untrue for non-normal matrices! 5

The standard example: Left shift operator on C N   0 1 0 0 · · · 0 0 1 0 · · ·   ∗ T N = − → Haar unitary element u ∈ ( A , τ ) .   · · ·     0 1 · · · 0 0 · · · Λ ε ( T N ) → D for ε = e − o ( N ) , ESDs ≡ δ 0 , Brown measure = Unif( ∂ D ). Eigenvalues of T N + N − 10 X N , with X N GinOE. (Figure by Phil Wood.) 6

Pseudospectra of random matrices Pseudospectrum of a random non-normal matrix is not so large. For iid matrix X with sub-Gaussian entries, � 1 � 1 � − 1 � � �� N ε + e − cN P z ∈ Λ ε √ X = P √ X − z op ≥ 1 /ε � � � � N N for any fixed z ∈ C ( ≈ Rudelson–Vershynin ’07). 1 X )) � N ε + e − cN . In particular E Leb(Λ ε ( √ Improves to N 2 ε 2 for complex entries with independent real and imaginary parts [Luh ’17] or real matrices with dist( z , R ) � 1 [Ge ’17]. Compare deterministic bound Leb(Λ ε ( A )) ≤ π N ε 2 for normal matrices. Pseudospectrum related to eigenvalue condition numbers (talk of Fyodorov): Leb(Λ ε ( M ) ∩ Ω) ∼ πε 2 � κ j ( M ) 2 as ε → 0 . j : λ j ∈ Ω 7

Pseudospectra of structured random matrices Applications to complex dynamical systems motivate understanding spectra and pseudospectra of sparse random matrices with non-iid entries (recall talk of David Renfrew). Theorem (C. ’16) Let X have independent, centered entries of arbitrary variances σ 2 ij ∈ [0 , 1], 4 + ε moments. For any z � = 0, � 1 � − 1 � � op ≤ N C ( | z | ,ε ) with probability 1 − O ( N − c ( ε ) ) . √ X − z � � � � N ∗ C ( | z | , ε ) = twr(exp(1 / | z | O (1) )) . . . Please improve! ∗ Conjecture: same holds with z replaced by any M with s min ( M ) � 1. ∗ Assuming entries of bounded density, can improve probability bound to 1 − O ( N − K ) for arbitrary K > 0. Main difficulty is to allow σ ij = 0. This is a key ingredient for proof of the inhomogeneous circular law [C.–Hachem–Najim–Renfrew ’16]. (Easier argument suffices for local law of [Alt–Erd˝ os–Kr¨ uger ’16] since they 8 assume σ ij � 1 and bounded density. Cf. survey of Bordenave & Chafai ’11.)

Pseudospectrum for quadratic polynomials in Ginibre matrices Now let X denote a (complex) N × N Ginibre matrix having iid entries X ij ∼ N C (0 , 1 / N ). Theorem (C.–Guionnet–Husson ’19) Let m ≥ 1 and let p be a quadratic polynomial in non-commutative variables x 1 , . . . , x m . Let N ≥ 2 and X 1 , . . . , X m be iid N × N Ginibre matrices. Set P = p ( X 1 , . . . , X m ). For any z ∈ C and any ε > 0, � � � ( P − z ) − 1 � op ≥ 1 ≤ N C ε c + e − cN P { z ∈ Λ ε ( P ) } = P ε for constants C , c > 0 depending only on p . 9

Motivation: convergence of ESDs • Proofs of limits for the ESDs µ X := 1 � N j =1 δ λ j ( X ) of non-normal random N matrices X = X N hinge upon control of the pseudospectrum. In particular, the problem of the pseudospectrum is the reason non-Hermitian RMT has lagged behind the theory for Wigner matrices. ∗ A key idea: Hermitization • Hermitian polynomials – some highlights: • Haagerup–Thorbjørnsen ’05: No outliers (recent alternative proof by Collins–Guionnet–Parraud). Extensions by many authors. • Anderson ’15: local law for the anti-commutator H 1 H 2 + H 2 H 1 of independent Wigner matrices. • Erd˝ os–Kr¨ uger–Nemish ’18: local law for polynomials satisfying a technical condition (includes homogeneous quadratic polynomials and symmetrized monomials in iid matrices X 1 X 2 · · · X m X ∗ m · · · X 2 X 1 ). • Products of independent iid matrices: limiting ESDs (G¨ otze–Tikhomirov and O’Rourke–Soshnikov ’10). No outliers and local law (Nemish ’16, ’17). ∗ A key idea: Linearization 10

Hermitization • One can encode the ESD of a non-normal M ∈ M N ( C ) in a family of ESDs of � Hermitian matrices | M − z | = ( M − z ) ∗ ( M − z ) as follows: � ∞ N µ M = 1 δ λ j ( M ) = 1 � 2 π ∆ z log( s ) µ | M − z | ( ds ) . N 0 j =1 • So it seems we can recover limit of µ X N if we know the limits of ESDs of the family of Hermitian matrices {| X N − z |} z ∈ C . • But not quite! Possible escape of mass to zero: Pseudospectrum • Bai ’97 controlled the pseudospectrum of iid matrices (under some technical assumptions) and obtained the Circular Law. Assumptions relaxed in works of G¨ otze–Tikhomirov ’07, Pan–Zhou ’07, Tao–Vu ’07, ’08. 11

Brown measure • Free probability gives tools to calculate limiting ESDs for polynomials in independent random matrices, at least if they’re normal (e.g. XY + YX for X , Y iid Wigner). • For a normal element a of a non-commutative probability space ( A , τ ), the spectral theorem provides us with a spectral measure µ a determined by the ∗ -moments τ ( a k ( a ∗ ) l ). • For general (non-normal) elements a , can define the Brown measure : � ∞ ν a := 1 2 π ∆ z log( s ) µ | a − z | ( ds ) 0 which is determined by the ∗ -moments ( | a − z | is self-adjoint). • If A N converge in ∗ -moments to a , it doesn’t follow that µ A N converge weakly to ν a (Brown measure isn’t continuous in this topology). • Question: If A N are non-normal random matrices, do the ESDs converge to the Brown measure? (Answer is yes for single iid matrix X N .) 12

Pseudospectra of structured random matrices Oberwolfach, 2019/12/13 - PowerPoint PPT Presentation

Pseudospectra of structured random matrices Oberwolfach, 2019/12/13 Nicholas Cook, Stanford University Partly based on joint work with Alice Guionnet and Jonathan Husson Outline 1. Geometric approach to RMT: successes and limitations 2.

PSEUDOSPECTRA o Application of eigenvalue o Pseudospectra definition D E F I N I T I O N S A N D

Results for different matrices and comparisons Dense Matrices Rectangular Matrices

MATHEMATICS 1 CONTENTS Matrices Special matrices Operations with matrices Matrix

Trogir Summer School Pseudospectra Exercises 1. Getting to know EigTool. (a) Download EigTool (

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

JUST THE MATHS SLIDES NUMBER 9.10 MATRICES 10 (Symmetric matrices & quadratic forms)

Random Vectors, Random Matrices, and Matrix Expected Value James H. Steiger Department of

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Operator limits of random matrices I. Stochastic Airy Brian Rider Temple University Brian Rider

Spectral rigidity for addition of random matrices at the regular edge Zhigang Bao HKUST Random

Gaussian Free Field in (self-adjoint) random matrices and random surfaces Alexei Borodin Corners

The separation of two matrices and its application in eigenvalue perturbation theory Michael

Scaling Log-Structured KV-Stores featuring Monkey and Dostoevsky SIGMOD17 / SIGMOD18 Niv Dayan

Machine Learning Fall 2017 Structured Prediction (structured perceptron, HMM, structured SVM)

Matrices with Application to Page Rank Markov Matrices Pagerank Anil Maheshwari

Cache-Oblivious Algorithms 1 Cache-Oblivious Model 2 The Unknown Machine Algorithm Algorithm

Congruences for Fishburn numbers modulo prime powers Partitions, q -series, and modular forms AMS

Improving Word Alignment With Bridge Languages Shankar Kumar Joint Work with Franz Och and

Optimization and Dynamical Systems: Variational, Hamiltonian, and Symplectic Perspectives

Some Recent Advances in Nonnegative Matrix Factorization and their Applications to Hyperspectral

A smoothing majorization method for l 2 2 - l p p matrix minimization Liwei Zhang Dalian

Transfer Matrix Method G. Eric Moorhouse, University of Wyoming Reference: Transfer Matrix

Von Dyck symmetries and lepton mixing Daniel Hernndez In colab. with A. Yu. Smirnov; 1204.0445

Sambuz

Useful Links

Newsletter

Mail Us

Pseudospectra of structured random matrices Oberwolfach, 2019/12/13 - PowerPoint PPT Presentation

Pseudospectra of structured random matrices Oberwolfach, 2019/12/13 Nicholas Cook, Stanford University Partly based on joint work with Alice Guionnet and Jonathan Husson Outline 1. Geometric approach to RMT: successes and limitations 2.

PSEUDOSPECTRA o Application of eigenvalue o Pseudospectra definition D E F I N I T I O N S A N D

Results for different matrices and comparisons Dense Matrices Rectangular Matrices

MATHEMATICS 1 CONTENTS Matrices Special matrices Operations with matrices Matrix

Trogir Summer School Pseudospectra Exercises 1. Getting to know EigTool. (a) Download EigTool (

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

JUST THE MATHS SLIDES NUMBER 9.10 MATRICES 10 (Symmetric matrices &amp; quadratic forms)

Random Vectors, Random Matrices, and Matrix Expected Value James H. Steiger Department of

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Operator limits of random matrices I. Stochastic Airy Brian Rider Temple University Brian Rider

Spectral rigidity for addition of random matrices at the regular edge Zhigang Bao HKUST Random

Gaussian Free Field in (self-adjoint) random matrices and random surfaces Alexei Borodin Corners

The separation of two matrices and its application in eigenvalue perturbation theory Michael

Scaling Log-Structured KV-Stores featuring Monkey and Dostoevsky SIGMOD17 / SIGMOD18 Niv Dayan

Machine Learning Fall 2017 Structured Prediction (structured perceptron, HMM, structured SVM)

Matrices with Application to Page Rank Markov Matrices Pagerank Anil Maheshwari

Cache-Oblivious Algorithms 1 Cache-Oblivious Model 2 The Unknown Machine Algorithm Algorithm

Congruences for Fishburn numbers modulo prime powers Partitions, q -series, and modular forms AMS

Improving Word Alignment With Bridge Languages Shankar Kumar Joint Work with Franz Och and

Optimization and Dynamical Systems: Variational, Hamiltonian, and Symplectic Perspectives

Some Recent Advances in Nonnegative Matrix Factorization and their Applications to Hyperspectral

A smoothing majorization method for l 2 2 - l p p matrix minimization Liwei Zhang Dalian

Transfer Matrix Method G. Eric Moorhouse, University of Wyoming Reference: Transfer Matrix

Von Dyck symmetries and lepton mixing Daniel Hernndez In colab. with A. Yu. Smirnov; 1204.0445

Sambuz

Useful Links

Newsletter

Mail Us

JUST THE MATHS SLIDES NUMBER 9.10 MATRICES 10 (Symmetric matrices & quadratic forms)