One sketch for all: Fast algorithms for compressed sensing Martin - PowerPoint PPT Presentation

One sketch for all: Fast algorithms for compressed sensing Martin J. Strauss University of Michigan Covers joint work with Anna Gilbert (Michigan), Joel Tropp (Michigan), and Roman Vershynin (UC Davis)

Heavy Hitters/Sparse Recovery Sparse Recovery is the idea that noisy sparse signals can be approximately reconstructed efficiently from a small number of nonadaptive linear measurements. Known as “Compress(ed/ive) Sensing,” or the “Heavy Hitters” problem in database. 1

Simple Example Signal, s Measurements ✁ ❅ ❘ ❅ ✁   ✁ Measurement matrix, Φ 0 ☛ ✁         0   5 . 3 1 1 1 1 1 1 1 1         5 . 3       · · · · · · · · · · · · · · · · · · · · · · · · · · ·             0       = · 0 0 0 0 0 1 1 1 1       0             5 . 3 0 0 1 1 0 0 1 1       0     0 0 1 0 1 0 1 0 1   0   0 Recover position and coefficient of single spike in signal. 2

In Streaming Algorithms • Maintain vector s of frequency counts from transaction stream: ✸ 2 spinach sold, 1 spinach returned, 1 kaopectate sold, ... • Recompute top-selling items upon each new sale Linearity of Φ: • Φ( s + ∆ s ) = Φ(∆ s ). 3

Goals • Input: All noisy m -sparse vectors in d dimensions • Output: Locations and values of the m spikes, with – Error Goal: Error proportional to the optimal m -term error Resources: • Measurement Goal: n ≤ m polylog d fixed measurements • Algorithmic Goal: Computation time poly( m log( d )) – Time close to output size m � d . • Universality Goal: One matrix works for all signals. 4

Overview • One sketch for all • Goals and Results • Chaining Algorithm • HHS Algorithm (builds on Chaining) 5

Role of Randomness Signal is worst-case, not random. Two possible models for random measurement matrix. 6

Random Measurement Matrix “for each” Signal We present coin-tossing algorithm. ✟ ✟ ❅ ✟ ✟ ✙ ❅ ❅ Coins are flipped. ❅ ❘ Adversary picks worst signal. ❄ ✁ Matrix Φ is fixed. ✁ ❅ ✁ ❅ ✁ ❅ ❘ ✁ ☛ Algorithm runs • Randomness in Φ is needed to defeat the adversary. 7

Universal Random Measurement Matrix We present coin-tossing algorithm. ❄ Coins are flipped. ❄ Matrix Φ is fixed. ❄ Adversary picks worst signal. ❄ Algorithm runs • Randomness is used to construct correct Φ efficiently (probabilistic method). 8

Why Universal Guarantee? Often unnecessary, but needed for iterative schemes. E.g. • Inventory s 1 : 100 spinach, 5 lettuce, 2 bread, 30 back-orders for kaopectate ... • Sketch using Φ: 98 spinach, − 31 kaopectate • Manager: Based on sketch, remove all spinach and lettuce; order 40 kaopectate • New inventory s 2 : 0 spinach, 0 lettuce, 2 bread, 10 kaopectate, ... s 2 depends on measurement matrix Φ. No guarantees for Φ on s 2 . Too costly to have separate Φ per sale. Today: Universal guarantee. 9

Overview • One sketch for all � • Goals and Results • Chaining Algorithm • HHS Algorithm (builds on Chaining) 10

Goals • Universal guarantee: one sketch for all • Fast: decoding time poly( m log( d )) • Few: optimal number of measurements (up to log factors) Previous work achieved two out of three. Ref. Univ. Fast Few meas. technique KM × comb’l � � D, CRT × LP( d ) � �� CM ∗ × comb’l �� Today comb’l � � � ∗ restrictions apply 11

Results Two algorithms, Chaining and HHS. � O hides factors of log( d ) /� . # meas. Time # out error � � Chg O ( m ) O ( m ) m � E � 1 ≤ O (log( m )) � E opt � 1 12

Results Two algorithms, Chaining and HHS. � O hides factors of log( d ) /� . # meas. Time # out error � � Chg O ( m ) O ( m ) m � E � 1 ≤ O (log( m )) � E opt � 1 � E � 2 ≤ ( �/ √ m ) � E opt � 1 � � � O ( m 2 ) HHS O ( m ) O ( m ) 13

Results Two algorithms, Chaining and HHS. � O hides factors of log( d ) /� . # meas. Time # out error � � Chg O ( m ) O ( m ) m � E � 1 ≤ O (log( m )) � E opt � 1 � E � 2 ≤ ( �/ √ m ) � E opt � 1 � � � O ( m 2 ) HHS O ( m ) O ( m ) 3 m � E � 2 ≤ � E opt � 2 + ( �/ √ m ) � E opt � 1 4 � E � 1 ≤ (1 + � ) � E opt � 1 (3) and (4) are gotten by truncating output of HHS. 14

Results # meas. Time error Failure � K-M O ( m ) poly( m ) � E � 2 ≤ (1 + � ) � E opt � 2 “for each” � E � 2 ≤ ( �/ √ m ) � E opt � 1 d (1to3) D, C-T O ( m log( d )) univ. � E � 2 ≤ ( �/ √ m ) � E opt � < 1 � O ( m 2 ) CM poly(m) Det’c � � Chg O ( m ) O ( m ) � E � 1 ≤ O (log( m )) � E opt � 1 univ. � E � 2 ≤ ( �/ √ m ) � E opt � 1 � � O ( m 2 ) HHS O ( m ) univ. � O and poly() hide factors of log( d ) /� . 15

Overview • One sketch for all � • Goals and Results � • Chaining Algorithm • HHS Algorithm (builds on Chaining) 16

Chaining Algorithm—Overview • Handle the universal guarantee • Group testing – Process several spikes at once – Reduce noise • Process single spike bit-by-bit as above. • Iterate on residual. 17

Universal Guarantee • Fix m spike positions • Succeed except with probability exp( − m log( d )) / 4 – succeed “for each” signal • Union bound over all spike configurations. – At most exp( m log( d )) configurations of spikes. – Convert “for each” to universal model 18

Noisy Example—Isolation Each group is defined by a mask: signal: 0 . 1 0 5 . 3 0 0 − 0 . 1 0 . 2 6 . 8 random mask: 1 1 1 0 1 0 1 0 product: 0 . 1 0 5 . 3 0 0 0 0 . 2 0 19

Noisy Example   0 . 1         0   5 . 6 1 1 1 1 1 1 1 1         5 . 3       · · · · · · · · · · · · · · · · · · · · · · · · · · ·             0       = · 0 . 2 0 0 0 0 1 1 1 1       0             5 . 5 0 0 1 1 0 0 1 1       0     0 0 1 0 1 0 1 0 1   0 . 2   0 Recover position and coefficient of single spike, even with noise. (Mask and bit tests combine into measurements.) 20

Group Testing for Spikes E.g., m spikes ( i, s i ) at height 1 /m ; � noise � 1 = 1 / 20. (For now.) � 1 � • ( i, s i ) is a spike if | s i | ≥ � noise � 1 . m 21

Group Testing for Spikes E.g., m spikes ( i, s i ) at height 1 /m ; � noise � 1 = 1 / 20. (For now.) � 1 � • ( i, s i ) is a spike if | s i | ≥ � noise � 1 . m Throw d positions into n = O ( m ) groups, by Φ. • ≥ c 1 m of m spikes isolated in their groups • ≤ c 2 m groups have noise ≥ 1 / (2 m ) (see next slide.) • ≥ ( c 1 − c 2 ) m groups have unique spike and low noise—recover! ...except with probability e − m . Repeat O (log( d )) times: Recover Ω( m ) spikes except with prob e − m log( d ) . 22

Noise • � Φ E opt � 1 ≤ � Φ � 1 → 1 � E opt � 1 . • We’ll show � Φ � 1 → 1 ≤ 1. • Thus total noise contamination is at most the signal noise. • At most m/ 10 buckets get noise more than (10 /m ) � E opt � 1   1         2     7 1 0 0 0 0 1       3        =  ·   9 0 0 0 1 1 0     4   5 0 1 1 0 0 0     5   6 23

We’ve found some spikes We’ve found (1 / 4) m spikes. • Subtract off spikes (in sketch): Φ( s − ∆ s ) = Φ s − Φ(∆ s ). • Recurse on problem of size (3 / 4) m . • Done after O (log( m )) iterations. But... 24

More Noise Issues • ≥ c 1 m of n groups have unique spikes (of m ) � • ≤ c 2 m groups have noise ≥ 1 / (2 m ) � • ≤ c 3 m groups have false spike ✸ Subtract off large phantom spike ✸ Introduce new (negative) spike (to be found later) • Other groups contribute additional noise (never to be found) � 3 m � − 1 . ✸ Spike threshold rises from m − 1 to 4 25

More Noise Issues • ≥ c 1 m of n groups have unique spikes (of m ) � • ≤ c 2 m groups have noise ≥ 1 / (2 m ) � • ≤ c 3 m groups have false spike • Other groups contribute additional noise (never to be found) Number of spikes: m → ( c 1 − c 2 − c 3 ) m ≈ (3 / 4) m. Spike threshold increases—delicate analysis. � � 1 • Need spike ( i, s i ) with | s i | ≥ Ω � noise � 1 . m log( m ) ✸ Lets noise grow from round to round. • Prune carefully to reduce noise. • Get log factor in approximation. 26

Drawbacks with Chaining Pursuit • log factor in error • 1-to-1 error bound is weaker than standard 1-to-2 27

One sketch for all: Fast algorithms for compressed sensing Martin - PowerPoint PPT Presentation

One sketch for all: Fast algorithms for compressed sensing Martin J. Strauss University of Michigan Covers joint work with Anna Gilbert (Michigan), Joel Tropp (Michigan), and Roman Vershynin (UC Davis) Heavy Hitters/Sparse Recovery Sparse

Decoding in Compressed Sensing Ronald DeVore USC, 2008 p. 1/33 Discrete Compressed Sensing R

Sketching Streams Chris Taylor DoD Overview What-Why Sketch? Sketches Hyper Log Log

Compressed Membership for NFA (DFA) with Compressed Labels is in NP (P) Artur Je University of

Foundations of Compressed Sensing Mike Davies Edinburgh Compressed Sensing research group (E-CoS)

Compressed Sensing: Challenges and Emerging Topics Mike Davies Edinburgh Compressed Sensing

Deep Compressed Sensing Yan Wu, Mihaela Rosca, Tim Lillicrap Compressed Sensing A Brief Review

Infinite Dimensional Compressed Sensing Anders C. Hansen, University of Cambridge Chemnitz,

Message Passing Algorithms for Compressed Sensing DLD, Arian Maleki, Andrea Montanari Stanford

Fast Data Driven Compressed Sensing and application to compressed quantitative MRI Mike Davies

Introduction to Compressed Sensing Gitta Kutyniok (Institut f ur Mathematik, Technische

One-bit compressed sensing with Gaussian circulant matrices Sjoerd Dirksen (RWTH Aachen

Compressed Sensing and Bayesian Experimental Design or Optimal Sensing and Reconstruction of N -

Sparsity-optimized Harmonic Wavelets for Compressed Sensing MRI Ruediger Willenberg (ECE)

Compressed sensing in the real world - The need for a new theory Anders C. Hansen (Cambridge)

Structured random measurements in compressed sensing Holger Rauhut Lehrstuhl C f ur

Compressed Sensing under Optimal Quantization Alon Kipnis (Stanford) Galen Reeves (Duke) Yonina

Intro to Redis Streams IMCSUMMIT - NOVEMBER 2019 | DAVE NIELSEN What is a data stream?

ES6 JavaScript, Metaprogramming, & Object Proxies Prof. Tom Austin San Jos State

Mining Knowledge Graphs from Text WSDM 2018 J AY P UJARA , S AMEER S INGH Tutorial Overview

burlingtonbaptist.org.uk/sovereign Ezekiel 4:1 "Now, son of man, take a clay tablet, put it

Finding Four-Leaf Clovers: A Benchmark for Fine-Grained Object Localization Gustavo Prez * ,

Innovation and Opportunity: A Closer Look at the New CEDS Content Guidelines Thursday, February

CONSENSUS OR CONTROVERSY? Clinical Investigators Provide Perspectives on the Treatment of

Tidal Disruptions of Binary Stars Ilya Mandel University of Birmingham, UK (and Monash

One sketch for all: Fast algorithms for compressed sensing Martin - PowerPoint PPT Presentation

One sketch for all: Fast algorithms for compressed sensing Martin J. Strauss University of Michigan Covers joint work with Anna Gilbert (Michigan), Joel Tropp (Michigan), and Roman Vershynin (UC Davis) Heavy Hitters/Sparse Recovery Sparse

Decoding in Compressed Sensing Ronald DeVore USC, 2008 p. 1/33 Discrete Compressed Sensing R

Sketching Streams Chris Taylor DoD Overview What-Why Sketch? Sketches Hyper Log Log

Compressed Membership for NFA (DFA) with Compressed Labels is in NP (P) Artur Je University of

Foundations of Compressed Sensing Mike Davies Edinburgh Compressed Sensing research group (E-CoS)

Compressed Sensing: Challenges and Emerging Topics Mike Davies Edinburgh Compressed Sensing

Deep Compressed Sensing Yan Wu, Mihaela Rosca, Tim Lillicrap Compressed Sensing A Brief Review

Infinite Dimensional Compressed Sensing Anders C. Hansen, University of Cambridge Chemnitz,

Message Passing Algorithms for Compressed Sensing DLD, Arian Maleki, Andrea Montanari Stanford

Fast Data Driven Compressed Sensing and application to compressed quantitative MRI Mike Davies

Introduction to Compressed Sensing Gitta Kutyniok (Institut f ur Mathematik, Technische

One-bit compressed sensing with Gaussian circulant matrices Sjoerd Dirksen (RWTH Aachen

Compressed Sensing and Bayesian Experimental Design or Optimal Sensing and Reconstruction of N -

Sparsity-optimized Harmonic Wavelets for Compressed Sensing MRI Ruediger Willenberg (ECE)

Compressed sensing in the real world - The need for a new theory Anders C. Hansen (Cambridge)

Structured random measurements in compressed sensing Holger Rauhut Lehrstuhl C f ur

Compressed Sensing under Optimal Quantization Alon Kipnis (Stanford) Galen Reeves (Duke) Yonina

Intro to Redis Streams IMCSUMMIT - NOVEMBER 2019 | DAVE NIELSEN What is a data stream?

ES6 JavaScript, Metaprogramming, &amp; Object Proxies Prof. Tom Austin San Jos State

Mining Knowledge Graphs from Text WSDM 2018 J AY P UJARA , S AMEER S INGH Tutorial Overview

burlingtonbaptist.org.uk/sovereign Ezekiel 4:1 &quot;Now, son of man, take a clay tablet, put it

Finding Four-Leaf Clovers: A Benchmark for Fine-Grained Object Localization Gustavo Prez * ,

Innovation and Opportunity: A Closer Look at the New CEDS Content Guidelines Thursday, February

CONSENSUS OR CONTROVERSY? Clinical Investigators Provide Perspectives on the Treatment of

Tidal Disruptions of Binary Stars Ilya Mandel University of Birmingham, UK (and Monash

ES6 JavaScript, Metaprogramming, & Object Proxies Prof. Tom Austin San Jos State

burlingtonbaptist.org.uk/sovereign Ezekiel 4:1 "Now, son of man, take a clay tablet, put it