Identifiability of Blind Deconvolution with Subspace or Sparsity - - PowerPoint PPT Presentation

identifiability of blind deconvolution with subspace or
SMART_READER_LITE
LIVE PREVIEW

Identifiability of Blind Deconvolution with Subspace or Sparsity - - PowerPoint PPT Presentation

Identifiability of Blind Deconvolution with Subspace or Sparsity Constraints Yanjun Li Joint work with Kiryung Lee and Yoram Bresler Coordinated Science Laboratory Department of Electrical and Computer Engineering University of Illinois,


slide-1
SLIDE 1

Identifiability of Blind Deconvolution with Subspace or Sparsity Constraints

Yanjun Li

Joint work with Kiryung Lee and Yoram Bresler

Coordinated Science Laboratory Department of Electrical and Computer Engineering University of Illinois, Urbana-Champaign Email: yli145@illinois.edu

SPARS 2015 July 6-9, 2015, Cambridge, UK

1

slide-2
SLIDE 2

Blind Deconvolution

PSF

signal: 𝑣 ⊛ filter: 𝑤 = measurement: 𝑨

Both u and v are unknown = ⇒ Ill-posed bilinear inverse problem Solved with “good” priors (e.g., subspace, sparsity) Empirical success in various applications (e.g., blind image deblurring, speech dereverberation, seismic data analysis, etc.) – Theoretical results are limited. = ⇒ The focus of this presentation

2

slide-3
SLIDE 3

Problem Statement

Signal: u0 ∈ Cn Filter: v0 ∈ Cn Measurement: z = u0 ⊛ v0 ∈ Cn find (u, v) s.t. u ⊛ v = z, u ∈ ΩU, v ∈ ΩV. Three scenarios:

1

Subspace constraints

2

Sparsity constraints

3

Mixed constraints

z y x

u ∈

z y x

v ∈

3

slide-4
SLIDE 4

Problem Statement

Signal: u0 ∈ Cn Filter: v0 ∈ Cn Measurement: z = u0 ⊛ v0 ∈ Cn find (u, v) s.t. u ⊛ v = z, u ∈ ΩU, v ∈ ΩV. Three scenarios:

1

Subspace constraints

2

Sparsity constraints

3

Mixed constraints

z y x

u ∈

z y x

v ∈

3

slide-5
SLIDE 5

Problem Statement

Signal: u0 ∈ Cn Filter: v0 ∈ Cn Measurement: z = u0 ⊛ v0 ∈ Cn find (u, v) s.t. u ⊛ v = z, u ∈ ΩU, v ∈ ΩV. Three scenarios:

1

Subspace constraints

2

Sparsity constraints

3

Mixed constraints

z y x

u ∈

z y x

v ∈

3

slide-6
SLIDE 6

Problem Statement

Signal: u0 = Dx0, the columns of D ∈ Cn×m1 form a basis or a frame Filter: v0 = Ey0, the columns of E ∈ Cn×m2 form a basis or a frame Measurement: z = u0 ⊛ v0 = (Dx0) ⊛ (Ey0) ∈ Cn (BD) find (x, y) s.t. (Dx) ⊛ (Ey) = z, x ∈ ΩX , y ∈ ΩY. Three scenarios:

1

Subspace constraints: ΩX = Cm1 and ΩY = Cm2

2

Sparsity constraints: ΩX = {x ∈ Cm1 : x0 ≤ s1} and ΩY = {y ∈ Cm2 : y0 ≤ s2}

3

Mixed constraints: ΩX = {x ∈ Cm1 : x0 ≤ s1} and ΩY = Cm2

4

slide-7
SLIDE 7

Identifiability up to Scaling, and Lifting

Definition (Identifiability up to scaling)

For (BD), the pair (x0, y0) is identifiable up to scaling from the measurement (Dx0) ⊛ (Ey0), if every solution (x, y) satisfies x = σx0 and y = 1

σy0 for some

nonzero scalar σ.

Lifting

Define GDE : Cm1×m2 → Cn such that GDE(xyT ) = (Dx) ⊛ (Ey), and M0 = x0yT

0 ∈ ΩM = {xyT : x ∈ ΩX , y ∈ ΩY}.

(BD) find (x, y), s.t. (Dx) ⊛ (Ey) = z, x ∈ ΩX , y ∈ ΩY. = ⇒ (Lifted BD) find M, s.t. GDE(M) = z, M ∈ ΩM.

5

slide-8
SLIDE 8

Identifiability up to Scaling, and Lifting

Definition (Identifiability up to scaling)

For (BD), the pair (x0, y0) is identifiable up to scaling from the measurement (Dx0) ⊛ (Ey0), if every solution (x, y) satisfies x = σx0 and y = 1

σy0 for some

nonzero scalar σ.

Lifting

Define GDE : Cm1×m2 → Cn such that GDE(xyT ) = (Dx) ⊛ (Ey), and M0 = x0yT

0 ∈ ΩM = {xyT : x ∈ ΩX , y ∈ ΩY}.

(BD) find (x, y), s.t. (Dx) ⊛ (Ey) = z, x ∈ ΩX , y ∈ ΩY. = ⇒ (Lifted BD) find M, s.t. GDE(M) = z, M ∈ ΩM.

5

slide-9
SLIDE 9

Previous Results (all based on a lifting formulation)

Identifiability analysis

◮ [Choudhary and Mitra, 2014]: canonical sparsity constraints

– Lacks sample-complexity type interpretation

Guaranteed recovery algorithms

◮ [Ahmed, Recht, and Romberg, 2014]: nuclear norm minimization ◮ [Ling and Strohmer, 2015]: ℓ1 norm minimization ◮ [Lee, Y. Li, Junge, and Bresler, 2015]: alternating minimization ◮ [Chi, 2015]: atomic norm minimization

Constructive proof of uniqueness – Requires probabilistic assumptions and interpretations

Goal

Identifiability in BD with more general bases or frames Algebraic analysis with minimal and deterministic assumptions Optimality in terms of sample complexities

6

slide-10
SLIDE 10

Previous Results (all based on a lifting formulation)

Identifiability analysis

◮ [Choudhary and Mitra, 2014]: canonical sparsity constraints

– Lacks sample-complexity type interpretation

Guaranteed recovery algorithms

◮ [Ahmed, Recht, and Romberg, 2014]: nuclear norm minimization ◮ [Ling and Strohmer, 2015]: ℓ1 norm minimization ◮ [Lee, Y. Li, Junge, and Bresler, 2015]: alternating minimization ◮ [Chi, 2015]: atomic norm minimization

Constructive proof of uniqueness – Requires probabilistic assumptions and interpretations

Goal

Identifiability in BD with more general bases or frames Algebraic analysis with minimal and deterministic assumptions Optimality in terms of sample complexities

6

slide-11
SLIDE 11

Sample Complexities for Uniqueness in BD

z n

=

D x0 m1 s1

E y0 m2

Mixed constraints

z = (Dx0) ⊛ (Ey0)

Theorem (Generic bases or frames)

The pair (x0, y0) is identifiable up to scaling from (Dx0) ⊛ (Ey0) for almost all D ∈ Cn×m1 and E ∈ Cn×m2 if: (subspace constraints) n ≥ m1m2 (sparsity constraints) n ≥ 2s1s2 (mixed constraints) n ≥ 2s1m2

7

slide-12
SLIDE 12

Proof Sketch (Subspace Constraints, Generic D & E)

Lemma

If n ≥ m1m2, then for almost all D ∈ Cn×m1 and E ∈ Cn×m2, the following matrix GDE has full column rank: GDE vec(xyT ) = (Dx) ⊛ (Ey)

Lemma [Harikumar and Bresler, 1998] “Proof by Example”

Suppose the entries of GDE are polynomials in the entries of D and E. Suppose GDE has full column rank for at least one choice of D and E. Then GDE has full column rank for almost all D and E.

One good choice of D & E for n ≥ m1m2

Fn DFT matrix z = (FnD

  • D

x) ⊙ (FnE

  • E

y) = FnGDE

  • GDE

vec(xyT ) – In frequency domain

  • D =

m1

  • E =

m2

= ⇒

  • GDE =

m1m2 m1m2 n Full rank submatrices 8

slide-13
SLIDE 13

Optimality?

Theorem (Generic bases or frames)

The pair (x0, y0) is identifiable up to scaling from (Dx0) ⊛ (Ey0) for almost all D ∈ Cn×m1 and almost all E ∈ Cn×m2 if: (subspace constraints) n ≥ m1m2 (sparsity constraints) n ≥ 2s1s2 (mixed constraints) n ≥ 2s1m2 Suspect this is suboptimal (# df = m1 + m2 − 1 for subspace constraints) Q: Can we get optimal sample complexities? A: Yes, if we consider more specialized scenarios.

9

slide-14
SLIDE 14

Optimality?

Theorem (Generic bases or frames)

The pair (x0, y0) is identifiable up to scaling from (Dx0) ⊛ (Ey0) for almost all D ∈ Cn×m1 and almost all E ∈ Cn×m2 if: (subspace constraints) n ≥ m1m2 (sparsity constraints) n ≥ 2s1s2 (mixed constraints) n ≥ 2s1m2 Suspect this is suboptimal (# df = m1 + m2 − 1 for subspace constraints) Q: Can we get optimal sample complexities? A: Yes, if we consider more specialized scenarios.

9

slide-15
SLIDE 15

Optimality?

Theorem (Generic bases or frames)

The pair (x0, y0) is identifiable up to scaling from (Dx0) ⊛ (Ey0) for almost all D ∈ Cn×m1 and almost all E ∈ Cn×m2 if: (subspace constraints) n ≥ m1m2 (sparsity constraints) n ≥ 2s1s2 (mixed constraints) n ≥ 2s1m2 Suspect this is suboptimal (# df = m1 + m2 − 1 for subspace constraints) Q: Can we get optimal sample complexities? A: Yes, if we consider more specialized scenarios.

9

slide-16
SLIDE 16

Optimality?

Theorem (Generic bases or frames)

The pair (x0, y0) is identifiable up to scaling from (Dx0) ⊛ (Ey0) for almost all D ∈ Cn×m1 and almost all E ∈ Cn×m2 if: (subspace constraints) n ≥ m1m2 (sparsity constraints) n ≥ 2s1s2 (mixed constraints) n ≥ 2s1m2 Suspect this is suboptimal (# df = m1 + m2 − 1 for subspace constraints) Q: Can we get optimal sample complexities? A: Yes, if we consider more specialized scenarios.

9

slide-17
SLIDE 17

Sub-band Structured Basis

Definition

  • E(:,k) := FnE(:,k) – the DFT of the kth atom (column) in E

Jk – the support of E(:,k)

  • Jk – passband

ℓk := | Jk| – bandwidth DFTs of the atoms in E DFTs of some possible signals

  • E(:,1)
  • E(:,2)
  • E(:,3)

J1 J2 J3

  • Ey1
  • Ey2
  • Ey3

10

slide-18
SLIDE 18

Sub-band Structured Basis

Definition

  • E(:,k) := FnE(:,k) – the DFT of the kth atom (column) in E

Jk – the support of E(:,k)

  • Jk – passband

ℓk := | Jk| – bandwidth DFTs of the atoms in E DFTs of some possible signals

  • E(:,1)
  • E(:,2)
  • E(:,3)
  • J1
  • J2
  • J3
  • Ey1
  • Ey2
  • Ey3

10

slide-19
SLIDE 19

Sub-band Structured Basis

Definition

  • E(:,k) := FnE(:,k) – the DFT of the kth atom (column) in E

Jk – the support of E(:,k)

  • Jk – passband

ℓk := | Jk| – bandwidth

10

slide-20
SLIDE 20

BD with a Sub-band Structured Basis

Blind Deconvolution: given D, E, & z, find x & y

(:,1)

E

(:,2)

E

(:,3)

E Dx

(1)

y

(2)

y

(3)

y z = (Dx) (Ey)

Blind Gain and Phase Calibration

2

J

1

J

3

J

zi = ( Eφ) ⊙ (Axi), column of A – array response support of x – DOA structure of E – sensor groups entry of φ – gain and phase

  • E =

11

slide-21
SLIDE 21

BD with a Sub-band Structured Basis

Blind Deconvolution: given D, E, & z, find x & y

(:,1)

E

(:,2)

E

(:,3)

E Dx

(1)

y

(2)

y

(3)

y z = (Dx) (Ey)

Blind Gain and Phase Calibration

11

slide-22
SLIDE 22

BD with a Sub-band Structured Basis

Sufficient Conditions with (Essentially) Optimal Sample Complexities

  • E(:,1)
  • E(:,2)
  • E(:,3)

ℓ1 ℓ2 ℓ3 n

Theorem (Sub-band structured basis)

Suppose E forms a sub-band structured basis, x0 ∈ Cm1 is nonzero, and all the entries of y0 ∈ Cm2 are nonzero. If the sum of all the bandwidths satisfies (subspace constraints) m2

k=1 ℓk ≥ m1 + m2 − 1

(mixed constraints) m2

k=1 ℓk ≥ 2s1 + m2 − 1

then for almost all D ∈ Cn×m1, the pair (x0, y0) is identifiable up to scaling.

12

slide-23
SLIDE 23

Proof Sketch

Lemma [Y. Li, Lee, & Bresler, 2015] Identifiability in bilinear inverse problems: http://arxiv.org/abs/1501.06120

In (BD), the pair (x0, y0) (x0 = 0, y0 = 0) is identifiable up to scaling if and only if the following two conditions are met:

1

If there exists (x, y) ∈ ΩX × ΩY such that (Dx) ⊛ (Ey) = (Dx0) ⊛ (Ey0), then x = σx0 for some nonzero σ ∈ C.

2

If there exists y ∈ ΩY such that (Dx0) ⊛ (Ey) = (Dx0) ⊛ (Ey0), then y = y0. Condition 2 is easy to verify. Condition 1 relies on the following fact: If D is generic, and (x, y) ∈ ΩX × ΩY satisfies (Dx) ⊛ (Ey) = (Dx0) ⊛ (Ey0), then Px⊥

0 x = 0.

Hence x = σx0 for some scalar σ.

13

slide-24
SLIDE 24

BD with a Sub-band Structured Basis

Necessary Conditions with Optimal Sample Complexities

DFTs of the atoms in E

  • E(:,1)
  • E(:,2)
  • E(:,3)

n

Theorem (Necessary conditions)

If the supports Jk (1 ≤ k ≤ m2) partition the DFT frequency range, then (x0, y0) is identifiable up to scaling only if (subspace constraints) n ≥ m1 + m2 − 1 (mixed constraints) n ≥ s1 + m2 − 1

14

slide-25
SLIDE 25

BD with a Sub-band Structured Basis

Necessary Conditions with Optimal Sample Complexities

DFTs of the atoms in E

  • E(:,1)
  • E(:,2)
  • E(:,3)

n

Theorem (Necessary conditions)

If the supports Jk (1 ≤ k ≤ m2) partition the DFT frequency range, then (x0, y0) is identifiable up to scaling only if (subspace constraints) n ≥ m1 + m2 − 1 n ≥ m1 + m2 − 1 (mixed constraints) n ≥ s1 + m2 − 1 Necessary Conditions n ≥ 2s1 + m2 − 1 Sufficient Conditions

14

slide-26
SLIDE 26

Conclusions

The first algebraic sample complexities for unique blind deconvolution Generic bases or frames:

◮ Subspace constraints:

n ≥ m1m2

◮ Sparsity constraints:

n ≥ 2s1s2

◮ Mixed constraints:

n ≥ 2s1m2

A sub-band structured basis:

◮ Subspace constraints:

n ≥ m1 + m2 − 1 (optimal)

◮ Mixed constraints:

n ≥ 2s1 + m2 − 1 (nearly optimal)

Generic bases or frames ⇒ violated on a set of Lebesgue measure zero Journal version: http://arxiv.org/abs/1505.03399 Blind gain and phase calibration: http://arxiv.org/abs/1501.06120

15

slide-27
SLIDE 27

Thank you!

16

slide-28
SLIDE 28

References

  • A. Ahmed, B. Recht, and J. Romberg. Blind deconvolution using convex programming.

IEEE Trans. Inf. Theory, 60(3):1711–1732, Mar 2014.

  • Y. Chi. Guaranteed blind sparse spikes deconvolution via lifting and convex
  • ptimization. arXiv preprint arXiv:1506.02751, 2015.
  • S. Choudhary and U. Mitra. Sparse blind deconvolution: What cannot be done. In ISIT,

pages 3002–3006. IEEE, June 2014.

  • G. Harikumar and Y. Bresler. FIR perfect signal reconstruction from multiple

convolutions: minimum deconvolver orders. IEEE Trans. Signal Process., 46(1): 215–218, Jan 1998.

  • K. Lee, Y. Li, M. Junge, and Y. Bresler. Stability in blind deconvolution of sparse signals

and reconstruction by alternating minimization. SampTA, 2015.

  • Y. Li, K. Lee, and Y. Bresler. A unified framework for identifiability analysis in bilinear

inverse problems with applications to subspace and sparsity models. arXiv preprint arXiv:1501.06120, 2015.

  • S. Ling and T. Strohmer. Self-calibration and biconvex compressive sensing. arXiv

preprint arXiv:1501.06864, 2015.

17

slide-29
SLIDE 29

Proof Sketch

Lemma [Y. Li, Lee, & Bresler, 2015] Identifiability in bilinear inverse problems: http://arxiv.org/abs/1501.06120

In (BD), the pair (x0, y0) (x0 = 0, y0 = 0) is identifiable up to scaling if and only if the following two conditions are met:

1

If there exists (x, y) ∈ ΩX × ΩY such that (Dx) ⊛ (Ey) = (Dx0) ⊛ (Ey0), then x = σx0 for some nonzero σ ∈ C.

2

If there exists y ∈ ΩY such that (Dx0) ⊛ (Ey) = (Dx0) ⊛ (Ey0), then y = y0. Condition 2 is easy to verify. Condition 1 relies on the following fact: If D is generic, and (x, y) ∈ ΩX × ΩY satisfies (Dx) ⊛ (Ey) = (Dx0) ⊛ (Ey0), then diag( Ey) Dx = ( Dx) ⊙ ( Ey) = ( Dx0) ⊙ ( Ey0) = diag( Ey0) Dx0. Consider the passband Jk, k = 1, 2, · · · , m2, Px⊥

0 x ∈ x⊥

R( D(

Jk,:)∗)

  • x⊥

⊥ = x⊥

  • V⊥

k .

Hence Px⊥

0 x ∈ x⊥

  • V⊥

1

  • V⊥

2

  • · · ·
  • V⊥

m2.

18

slide-30
SLIDE 30

Proof Sketch

Px⊥

0 x ∈ x⊥

  • V⊥

1

  • V⊥

2

  • · · ·
  • V⊥

m2

For a generic matrix D, the subspaces V1, V2, · · · , Vm2 are generic subspaces of x⊥

0 ,

with dim(Vk) = ℓk − 1. If m2

k=1 ℓk ≥ m1 + m2 − 1, i.e., m2 k=1(ℓk − 1) ≥ m1 − 1, then m2

  • k=1

Vk = x⊥

0 ,

span(x0) +

m2

  • k=1

Vk = Cm1, Px⊥

0 x ∈

  • span(x0) +

m2

  • k=1

Vk ⊥ = {0}. Hence x = σx0 for some scalar σ.

19