Estimation in the Presence of Group Actions Alex Wein MIT - PowerPoint PPT Presentation

Estimation in the Presence of Group Actions Alex Wein MIT Mathematics 1 / 28

Joint work with: Amelia Perry 1991 – 2018 2 / 28

Joint work with: Afonso Bandeira Ben Blum-Smith Jonathan Weed Ankur Moitra 3 / 28

Group actions G – compact group, e.g. ◮ S n (permutations of { 1 , 2 , . . . , n } ) ◮ Z / n (cyclic / integers mod n ) ◮ any finite group ◮ SO (2) (2D rotations) ◮ SO (3) (3D rotations) Group action G � V : map G × V → V , write g · x Axioms: 1 · x = x and g · ( h · x ) = ( gh ) · x ◮ S n � R n (permute coordinates) ◮ Z / n � R n (permute coordinates cyclically) ◮ SO (2) � R 2 (rotate vector) ◮ SO (3) � R 3 (rotate vector) ◮ SO (3) � R n (rotate some object...) 4 / 28

Motivation: cryo-electron microscopy (cryo-EM) Image credit: [Singer, Shkolnisky ’11] ◮ Biological imaging method: determine structure of molecule ◮ 2017 Nobel Prize in Chemistry ◮ Given many noisy 2D images of a 3D molecule, taken from different unknown angles ◮ Goal is to reconstruct the 3D structure of the molecule ◮ Group action SO (3) � R n 5 / 28

Other examples Other problems involving random group actions: ◮ Image registration ◮ Multi-reference alignment Image credit: Jonathan Weed Image credit: [Bandeira, PhD thesis ’15] Group: SO(2) (2D rotations) Group: Z / p (cyclic shifts) ◮ Applications: computer vision, radar, structural biology, robotics, geology, paleontology, ... ◮ Methods used in practice often lack provable guarantees... 6 / 28

Orbit recovery problem Let G be a compact group acting linearly on a finite-dimensional real vector space V = R p . ◮ Linear: homomorphism ρ : G → GL ( V ) GL ( V ) = { invertible p × p matrices } ◮ Action: g · x = ρ ( g ) x for g ∈ G , x ∈ V ◮ Equivalently: G is a subgroup of matrices GL ( V ) 7 / 28

Orbit recovery problem Let G be a compact group acting linearly on a finite-dimensional real vector space V = R p . Unknown signal x ∈ V (e.g. the molecule) For i = 1 , . . . , n observe y i = g i · x + ε i where . . . ◮ g i ∼ Haar ( G ) (“uniform distribution” on G ) ◮ ε i ∼ N (0 , σ 2 I p ) (noise) Goal: Recover some ˜ x in the orbit { g · x : g ∈ G } of x 8 / 28

Special case: multi-reference alignment (MRA) G = Z / p acts on R p via cyclic shifts For i = 1 , . . . , n observe y i = g i · x + ε i with ε i ∼ N (0 , σ 2 I ) Image credit: Jonathan Weed 9 / 28

Special case: multi-reference alignment (MRA) G = Z / p acts on R p via cyclic shifts For i = 1 , . . . , n observe y i = g i · x + ε i with ε i ∼ N (0 , σ 2 I ) How to solve this? Maximum likelihood? ◮ Optimal rate but computationally intractable [1] Synchronization? (learn the group elements / align the samples) [2] ◮ Can’t learn the group elements if noise is too large Iterative method? (EM, belief propagation) ◮ Not sure how to analyze... [1] Bandeira, Rigollet, Weed, Optimal rates of estimation for multi-reference alignment , 2017 [2] Singer, Angular Synchronization by Eigenvectors and Semidefinite Programming , 2011 10 / 28

Method of invariants Idea: measure features of the signal x that are shift-invariant [1,2] Degree-1: � i x i (mean) i x 2 Degree-2: � i , x 1 x 2 + x 2 x 3 + · · · + x p x 1 , . . . (autocorrelation) Degree-3: x 1 x 2 x 4 + x 2 x 3 x 5 + . . . (triple correlation) Invariant features are easy to estimate from the samples [1] Bandeira, Rigollet, Weed, Optimal rates of estimation for multi-reference alignment , 2017 [2] Perry, Weed, Bandeira, Rigollet, Singer, The sample complexity of multi-reference alignment , 2017 11 / 28

Sample complexity Theorem [1] : (Upper bound) With noise level σ , can estimate degree- d invariants using n = O ( σ 2 d ) samples. (Lower bound) If x (1) , x (2) agree on all invariants of degree ≤ d − 1 then Ω( σ 2 d ) samples are required to distinguish them. ◮ Method of invariants is optimal Question: What degree d ∗ of invariants do we need to learn before we can recover x (up to orbit)? ◮ Optimal sample complexity is n = Θ( σ 2 d ∗ ) Answer (for MRA) [1]: ◮ For “generic” x , degree 3 is sufficient, so sample complexity n = Θ( σ 6 ) ◮ But for a measure-zero set of “bad” signals, need much higher degree (as high as p ) [1] Bandeira, Rigollet, Weed, Optimal rates of estimation for multi-reference alignment , 2017 12 / 28

Another viewpoint: mixtures of Gaussians MRA sample: y = g · x + ε with g ∼ G , ε ∼ N (0 , σ 2 I ) The distribution of y is a (uniform) mixture of | G | Gaussians centered at { g · x : g ∈ G } ◮ For infinite groups, a mixture of infinitely-many Gaussians Method of moments: Estimate moments E [ y ] , E [ yy ⊤ ] , . . . , E [ y ⊗ d ] De-bias to get moments of signal term: E [ y ⊗ k ] � E g [( g · x ) ⊗ k ] Fact: Moments are equivalent to invariants ◮ E g [( g · x ) ⊗ k ] contains the same information as the degree- k invariant polynomials 13 / 28

Our contributions Joint work with Ben Blum-Smith, Afonso Bandeira, Amelia Perry, Jonathan Weed [1] ◮ We generalize from MRA to any compact group ◮ Again, the method of invariants/moments is optimal ◮ Independently by [2] ◮ We give an (inefficient) algorithm that achieves optimal sample complexity: solve polynomial system ◮ To determine what degree of invariants are required, we use invariant theory and algebraic geometry [1] Bandeira, Blum-Smith, Perry, Weed, W., Estimation under group actions: recovering orbits from invariants , 2017 [2] Abbe, Pereira, Singer, Estimation in the group action channel , 2018 14 / 28

Invariant theory Variables x 1 , . . . , x p (corresponding to the coordinates of x ) The invariant ring R [ x ] G is the subring of R [ x ] := R [ x 1 , . . . , x p ] consisting of polynomials f such that f ( g · x ) = f ( x ) ∀ g ∈ G . ◮ Aside: A main result of invariant theory is that R [ x ] G is finitely-generated R [ x ] G ≤ d – invariants of degree ≤ d (Simple) algorithm: ◮ Pick d ∗ (to be chosen later) ◮ Using Θ( σ 2 d ∗ ) samples, estimate invariants up to degree d ∗ : learn value f ( x ) for all f ∈ R [ x ] G ≤ d ◮ Solve for an ˆ x that is consistent with those values: x ) = f ( x ) ∀ f ∈ R [ x ] G f (ˆ ≤ d (polynomial system of equations) 15 / 28

Example: norm recovery G = SO (3) acting on R 3 (by rotation) Samples: noisy, randomly-rotated copies of x ∈ R 3 To learn orbit, need to learn � x � Invariant ring is generated by � x � 2 = � i x 2 i ◮ d ∗ = 2 Sample complexity Θ( σ 2 d ∗ ) = Θ( σ 4 ) 16 / 28

Example: learning a “bag of numbers” G = S p acting on R p (by permuting coordinates) Samples: noisy copes of x ∈ R p with entries permuted randomly To learn orbit, need to learn the multiset { x i } i ∈ [ p ] Invariants are the symmetric polynomials ◮ Generated by elementary symmetric polynomials: � � � e 1 = x i , e 2 = x i x j , e 3 = x i x j x k , . . . i i < j i < j < k Can’t learn e p = � p i =1 x i until degree p ◮ d ∗ = p so sample complexity Θ( σ 2 p ) 17 / 28

All invariants determine orbit Theorem [1] : If G is compact, for every x ∈ V , the full invariant ring R [ x ] G determines x up to orbit. ◮ In the sense that if x , x ′ do not lie in the same orbit, there exists f ∈ R [ x ] G that separates them: f ( x ) � = f ( x ′ ) ≤ d generates R [ x ] G (as Corollary: Suppose that for some d , R [ x ] G an R -algebra). Then R [ x ] G ≤ d determines x up to orbit and so sample complexity is O ( σ 2 d ). Problem: This is for worst-case x ∈ V . For MRA (cyclic shifts) this requires d = p whereas generic x only requires d = 3 [2] . Actually care about whether R [ x ] G ≤ d generically determines R [ x ] G ◮ “Generic” means that x lies outside a particular measure-zero “bad” set. [1] Kaˇ c, Invariant theory lecture notes, 1994 [2] Bandeira, Rigollet, Weed, Optimal rates of estimation for multi-reference alignment , 2017 18 / 28

Do polynomials generically determine other polynomials? Say we have A ⊆ B ⊆ R [ x ] ◮ (Technically need to assume B is finitely generated) Question: Do the values { a ( x ) : a ∈ A } generically determine the values { b ( x ) : b ∈ B } ? ◮ Formally: does there exist a full-measure set S ⊆ V such that if x ∈ S (“generic”) then any x ′ ∈ V satisfying a ( x ) = a ( x ′ ) ∀ a ∈ A also satisfies b ( x ) = b ( x ′ ) ∀ b ∈ B Definition: Polynomials f 1 , . . . , f m are algebraically independent if there is no P ∈ R [ y 1 , . . . , y m ] with P ( f 1 , . . . , f m ) ≡ 0. Definition: For U ⊆ R [ x ], the transcendence degree trdeg( U ) is the number of algebraically independent polynomials in U . 19 / 28

Do polynomials generically determine other polynomials? Definition: For U ⊆ R [ x ], the transcendence degree trdeg( U ) is the number of algebraically independent polynomials in U . Answer: Suppose trdeg( A ) = trdeg( B ). If x is generic then the values { a ( x ) : a ∈ A } determine a finite number of possibilities for the entire collection { b ( x ) : b ∈ B } . ◮ Formally: for generic x there is a finite list x (1) , . . . , x ( s ) such that for any x ′ satisfying a ( x ) = a ( x ′ ) ∀ a ∈ A there exists i such that b ( x ( i ) ) = b ( x ′ ) ∀ b ∈ B A determines B (up to finite ambiguity) if A has as many algebraically independent polynomials as B ◮ Intuition: algebraically independent polynomials are “degrees-of-freedom” 20 / 28

Estimation in the Presence of Group Actions Alex Wein MIT - PowerPoint PPT Presentation

Estimation in the Presence of Group Actions Alex Wein MIT Mathematics 1 / 28 Joint work with: Amelia Perry 1991 2018 2 / 28 Joint work with: Afonso Bandeira Ben Blum-Smith Jonathan Weed Ankur Moitra 3 / 28 Group actions G

Actions of Compact Quantum Groups V Free and homogeneous actions I Kenny De Commer (VUB,

Actions of Compact Quantum Groups III Reduced and universal actions Kenny De Commer (VUB,

Civil Actions Civil Actions Civil Actions Lesson No. 13 ENV H 471 Environmental Health

Presence Presence Presence When we wake up in the morning we may automatically leave our

Statistical Estimation in the Presence of Group Actions Alex Wein MIT Mathematics 1 / 39 In

Motion Estimation by Affine Transforms Motion Estimation by Affine Transforms Motion Estimation

MLSE Channel Estimation MLSE Channel Estimation MLSE Channel Estimation Parametric or Non-

M-Estimation under High-Dimensional Asymptotics DLD, Andrea Montanari 2014-05-01 DLD, Andrea

Part 3. Spectrum Estimation Part 3. Spectrum Estimation 3.2 Parametric Methods for Spectral

Lecture 6. Bayesian estimation Lecture 6. Bayesian estimation 1 (172) 6. Bayesian estimation

Sug Suggested Actions sted Actions on W on WebSAMS bSAMS Suggested Actions Keep Latest

Actions of Compact Quantum Groups I Definition Kenny De Commer (VUB, Brussels, Belgium) CQG

SFTR Corporate Actions Working Group #4 27 th June 2019 James Langlois SFTR Corporate Actions Q2

Efficient Small Area Estimation in the Presence of Measurement Error in Covariates Dr. Trijya

An LNJ Bhilwara Group Company LNJ Bhilwara Group - Diversified presence across sectors Set up in

C*-algebras associated with algebraic actions Joachim Cuntz Abel, August 2015 Topic: Actions by

BEGINNING OF GAME 1999 Debate in NYC TIME PASSES 2000 to 2005 2006 Vote in Prague Original

Reflections on cylindrical contact homology Jo Nelson (Rice) Symplectic Zoominar, May 2020

Optimal control in aerospace elat 1 Emmanuel Tr 1 Sorbonne Universit e (Paris 6), Labo. J.-L.

The 3x 1 Problem (or how to assign intractable open questions to undergraduates) The

ORBIT Project Overview www.orbit-lab.org ORBIT Overview: Project Rationale Wireless

Synodic versus Sidereal Orbital Periods 1. General Remarks Any mechanical object (for example, a

Solving Steiner Triple Covering Problems Jeff Linderoth Dept. of Industrial and Systems

Module varieties with dense orbits in every component Ryan Kinser (joint with Calin Chindris and

Sambuz

Useful Links

Newsletter

Mail Us