LISTA: Theoretical Linear Convergence, Practical Weights and - - PowerPoint PPT Presentation

lista theoretical linear convergence practical weights
SMART_READER_LITE
LIVE PREVIEW

LISTA: Theoretical Linear Convergence, Practical Weights and - - PowerPoint PPT Presentation

LISTA: Theoretical Linear Convergence, Practical Weights and Thresholds Xiaohan Chen , Jialin Liu , Zhangyang Wang , and Wotao Yin TAMU, CSE UCLA, Math NeurIPS18 Overview Recover sparse x from b := Ax + white


slide-1
SLIDE 1

LISTA: Theoretical Linear Convergence, Practical Weights and Thresholds

Xiaohan Chen⋆, Jialin Liu†, Zhangyang Wang⋆, and Wotao Yin†

⋆ TAMU, CSE † UCLA, Math

NeurIPS’18

slide-2
SLIDE 2

Overview

Recover sparse x∗ from b := Ax∗ + white noise Our methods improve on LISTA (Gregor&LeCun’10) and related work by

learning fewer parameters (faster training) adding support detection (faster recovery) proving linear convergence and robustness (theoretical guarantee)

slide-3
SLIDE 3

Review: ISTA and LISTA

ISTA (iterative soft thresholding) x(k+1) = SoftThresholdθ

  • x(k) + αAT (b − Ax(k))
  • .

α, θ are chosen by hand or cross validation.

slide-4
SLIDE 4

Review: ISTA and LISTA

ISTA (iterative soft thresholding) x(k+1) = SoftThresholdθ

  • x(k) + αAT (b − Ax(k))
  • .

α, θ are chosen by hand or cross validation. LISTA (Learned ISTA) x(k+1) = SoftThresholdθk

  • W k

1 b + W k 2 x(k)

. θk, W k

1 , W k 2 are chosen by stochastic optimization

minimize

{θk,W k

1 ,W k 2 }

  • Ex⋆,bxK(b) − x⋆2

using synthesized (x⋆, b) obeying b = Ax⋆ + white noise.

slide-5
SLIDE 5

Review: ISTA and LISTA

ISTA (iterative soft thresholding) x(k+1) = SoftThresholdθ

  • x(k) + αAT (b − Ax(k))
  • .

α, θ are chosen by hand or cross validation. LISTA (Learned ISTA) x(k+1) = SoftThresholdθk

  • W k

1 b + W k 2 x(k)

. θk, W k

1 , W k 2 are chosen by stochastic optimization

minimize

{θk,W k

1 ,W k 2 }

  • Ex⋆,bxK(b) − x⋆2

using synthesized (x⋆, b) obeying b = Ax⋆ + white noise. Compare: ISTA is slow, no training. LISTA is fast, difficult-to-train.

slide-6
SLIDE 6

Proposed — coupled LISTA

LISTA-CP: couple W k

1 and W k 2 via

W k

1 A + W k 2 = I.

We show: x(k) → x⋆ implies this relation to hold asymptotically.

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 0.5 1 1.5 2 2.5 3

slide-7
SLIDE 7

Proposed — support selection

LISTA-CPSS: support selection

Only the large coordinates pass activations to the next iteration. Ideas from Linearized Bregman iteration (kicking)1 and Fixed-Point Continuation method (FPC)2.

1Stanley Osher et al. ’2011 2Elaine Hale, Wotao Yin, Yin Zhang ’2008

slide-8
SLIDE 8

Robust global linear convergence

Theorem

Fix A, sparsity level s, and noise level σ. There exist {θk, W k

1 } such that LISTA-CP obeys

x(k) − x⋆2 ≤ sC1e−C2k + C3σ, k = 1, 2, . . . where C1, C2, C3 > 0 are constants. LISTA-CPSS improves the constants C2, C3.

slide-9
SLIDE 9

Weight coupling test

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

  • 50
  • 40
  • 30
  • 20
  • 10

ISTA FISTA AMP LISTA LISTA-CP

CP can stabilize intermediate results. CP will not hurt final recovery performance.

slide-10
SLIDE 10

Support selection test (no noise)

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

  • 70
  • 60
  • 50
  • 40
  • 30
  • 20
  • 10

ISTA FISTA AMP LISTA LAMP LISTA-CP LISTA-SS LISTA-CPSS

← LISTA and LISTA-CP ← LISTA-SS ← LISTA-CPSS

slide-11
SLIDE 11

Thank you!

10:45 AM – 12:45 PM Room 210 & 230 AB #163 Welcome to our poster for more details!