Unbiased Risk Estimation as Parameter Choice Rule for Filter-based - PowerPoint PPT Presentation

Unbiased Risk Estimation as Parameter Choice Rule for Filter-based Regularization Methods Frank Werner 1 Statistical Inverse Problems in Biophysics Group Max Planck Institute for Biophysical Chemistry, G¨ ottingen and Felix Bernstein Institute for Mathematical Statistics in the Biosciences University of G¨ ottingen Chemnitz Symposium on Inverse Problems 2017 (on Tour in Rio) 1 joint work with Housen Li Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 1 / 34

Outline 1 Introduction 2 A posteriori parameter choice methods 3 Error analysis 4 Simulations 5 Conclusion Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 2 / 34

Introduction Outline 1 Introduction 2 A posteriori parameter choice methods 3 Error analysis 4 Simulations 5 Conclusion Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 3 / 34

Introduction Statistical inverse problems Setting: X , Y Hilbert spaces, T : X → Y bounded, linear Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 4 / 34

Introduction Statistical inverse problems Setting: X , Y Hilbert spaces, T : X → Y bounded, linear Task: Recover unknown f ∈ X from noisy measurements Y = Tf + σξ Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 4 / 34

Introduction Statistical inverse problems Setting: X , Y Hilbert spaces, T : X → Y bounded, linear Task: Recover unknown f ∈ X from noisy measurements Y = Tf + σξ Noise: ξ is a standard Gaussian white noise process, σ > 0 noise level Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 4 / 34

Introduction Statistical inverse problems Setting: X , Y Hilbert spaces, T : X → Y bounded, linear Task: Recover unknown f ∈ X from noisy measurements Y = Tf + σξ Noise: ξ is a standard Gaussian white noise process, σ > 0 noise level The model has to be understood in a weak sense: Y g := � Tf , g � Y + σ � ξ, g � for all g ∈ Y � � 0 , � g � 2 with � ξ, g � ∼ N and E [ � ξ, g 1 � � ξ, g 2 � ] = � g 1 , g 2 � Y . Y Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 4 / 34

Introduction Statistical inverse problems Assumptions: • T is injective and Hilbert-Schmidt ( � σ 2 k < ∞ , σ k singular values) • σ is known exactly Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 5 / 34

Introduction Statistical inverse problems Assumptions: • T is injective and Hilbert-Schmidt ( � σ 2 k < ∞ , σ k singular values) • σ is known exactly As the problem is ill-posed, regularization is needed. Consider filter-based regularization schemes f α := q α ( T ∗ T ) T ∗ Y , ˆ α > 0 . Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 5 / 34

Introduction Statistical inverse problems Assumptions: • T is injective and Hilbert-Schmidt ( � σ 2 k < ∞ , σ k singular values) • σ is known exactly As the problem is ill-posed, regularization is needed. Consider filter-based regularization schemes f α := q α ( T ∗ T ) T ∗ Y , ˆ α > 0 . Aim: A posteriori choice of α such that rate of convergence (as σ ց 0) is order optimal (no loss of log-factors) Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 5 / 34

Introduction Statistical inverse problems Assumptions: • T is injective and Hilbert-Schmidt ( � σ 2 k < ∞ , σ k singular values) • σ is known exactly As the problem is ill-posed, regularization is needed. Consider filter-based regularization schemes f α := q α ( T ∗ T ) T ∗ Y , ˆ α > 0 . Aim: A posteriori choice of α such that rate of convergence (as σ ց 0) is order optimal (no loss of log-factors) Note: Heuristic parameter choice rules might work here as well, as the Bakushinski˘ ı veto does not hold in our setting (Becker ’11). Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 5 / 34

A posteriori parameter choice methods Outline 1 Introduction 2 A posteriori parameter choice methods 3 Error analysis 4 Simulations 5 Conclusion Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 6 / 34

A posteriori parameter choice methods The discrepancy principle � � � � � T ˆ � • For deterministic data: α DP = max f α − Y Y ≤ τσ α > 0 � � � � Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 7 / 34

A posteriori parameter choice methods The discrepancy principle � � � � � T ˆ � • For deterministic data: α DP = max f α − Y Y ≤ τσ α > 0 � � � � ∈ Y ! Either pre-smoothing ( Y � Z := T ∗ Y ∈ X ) ... • But here: Y / Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 7 / 34

A posteriori parameter choice methods The discrepancy principle � � � � � T ˆ � • For deterministic data: α DP = max f α − Y Y ≤ τσ α > 0 � � � � ∈ Y ! Either pre-smoothing ( Y � Z := T ∗ Y ∈ X ) ... • But here: Y / • ... or discretization: Y ∈ R n , ξ ∼ N n (0 , I n ) and choose 2 ≤ τσ √ n � � � � � T ˆ � α DP = max α > 0 f α − Y � � � � Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 7 / 34

A posteriori parameter choice methods The discrepancy principle � � � � � T ˆ � • For deterministic data: α DP = max f α − Y Y ≤ τσ α > 0 � � � � ∈ Y ! Either pre-smoothing ( Y � Z := T ∗ Y ∈ X ) ... • But here: Y / • ... or discretization: Y ∈ R n , ξ ∼ N n (0 , I n ) and choose 2 ≤ τσ √ n � � � � � T ˆ � α DP = max α > 0 f α − Y � � � � Pros: Cons: • Easy to implement • How to choose τ ≥ 1? • Works for all q α • Only discretized meaningful • Order-optimal convergence • Early saturation rates Davies & Anderssen ’86, Lukas ’95, Blanchard, Hoffmann & Reiß ’16 Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 7 / 34

A posteriori parameter choice methods The quasi-optimality criterion � � � r α ( T ∗ T ) ˆ • Neubauer ’08 ( r α ( λ ) = 1 − λ q α ( λ )): α QO = argmin f α � � � X α> 0 Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 8 / 34

A posteriori parameter choice methods The quasi-optimality criterion � � � r α ( T ∗ T ) ˆ • Neubauer ’08 ( r α ( λ ) = 1 − λ q α ( λ )): α QO = argmin f α � � � X α> 0 • But for spectral cut-off r α ( T ∗ T ) ˆ f α = 0 for all α > 0 Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 8 / 34

A posteriori parameter choice methods The quasi-optimality criterion � � � r α ( T ∗ T ) ˆ • Neubauer ’08 ( r α ( λ ) = 1 − λ q α ( λ )): α QO = argmin f α � � � X α> 0 • But for spectral cut-off r α ( T ∗ T ) ˆ f α = 0 for all α > 0 • Alternative formulation for Tikhonov regularization if candidates α 1 < ... < α m are given: � � � ˆ f α n − ˆ n QO = argmin X , α QO := α n QO . f α n +1 � � � 1 ≤ n ≤ m − 1 Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 8 / 34

A posteriori parameter choice methods The quasi-optimality criterion � � � r α ( T ∗ T ) ˆ • Neubauer ’08 ( r α ( λ ) = 1 − λ q α ( λ )): α QO = argmin f α � � � X α> 0 • But for spectral cut-off r α ( T ∗ T ) ˆ f α = 0 for all α > 0 • Alternative formulation for Tikhonov regularization if candidates α 1 < ... < α m are given: � � � ˆ f α n − ˆ n QO = argmin X , α QO := α n QO . f α n +1 � � � 1 ≤ n ≤ m − 1 Cons: Pros: • Only for special q α • Easy to implement, very fast • Additional assumptions on • No knowledge of σ necessary noise and/or f necessary • Order-optimal convergence • Performance unclear in rates in mildly ill-posed severely ill-posed situations situations Bauer & Kindermann ’08, Bauer & Reiß ’08, Bauer & Kindermann ’09 Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 8 / 34

A posteriori parameter choice methods The Lepski˘ ı-type balancing principle • For given α , the standard deviation of ˆ f α can be bounded by � � q α k ( T ∗ T ) 2 T ∗ T � std ( α ) := σ Tr • If candidates α 1 < ... < α m are given: � � � � � ˆ f α j − ˆ � X ≤ 4 κ std ( α k ) for all 1 ≤ k ≤ j n LEP = max j f α k � � � � and α LEP = α n LEP Frank Werner, MPIbpC G¨ ottingen Unbiased Risk Estimation October 30, 2017 9 / 34

Unbiased Risk Estimation as Parameter Choice Rule for Filter-based - PowerPoint PPT Presentation

Unbiased Risk Estimation as Parameter Choice Rule for Filter-based Regularization Methods Frank Werner 1 Statistical Inverse Problems in Biophysics Group Max Planck Institute for Biophysical Chemistry, G ottingen and Felix Bernstein

Finite Projective Planes http://math.uwyo.edu/moorhouse/pub/planes/ Eric Moorhouse Mutually

I 4 - Bayesian parameter estimation in a normal model STAT 587 (Engineering) Iowa State

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Unsupervised Maximum Likelihood

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood

Risk-parameter estimation in volatility models Christian Francq Jean-Michel Zakoan CREST and

Maximum-likelihood and Bayesian parameter estimation Andrea Passerini passerini@disi.unitn.it

Maximum likelihood parameter estimation Maximum likelihood parameter estimation For an HMM

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Martin Emms September 20, 2019 4CSLL5

6. Parameter Passing Parameter Passing CS 381 Spring 2016 Example (Formal) Parameter void

10/16/19 Parameter Control Genetic Algorithms Motivation Parameter setting Tuning

MOBILITY CHOICE STUDY MOBILITY CHOICE STUDY MOBILITY CHOICE STUDY Planning for Mobility in

Outline Introduction Knowledge Structures Parameter Estimation Maximum Likelihood Estimation

Interval Estimation Edwin Leuven Interval estimation While an estimator may be unbiased or

Lecture 6. Bayesian estimation Lecture 6. Bayesian estimation 1 (172) 6. Bayesian estimation

Judicious Choice of Waveform Parameters and Judicious Choice of Waveform Parameters and Accurate

Voting in Maines Ranked Choice Election A non-partisan guide to ranked choice elections

International Taxation and Company Tax Policy in Small Open Economies George R. Zodrow Cline

FEMT, an open source library and tools for solving large sparse systems of equations in parallel

Using Dynatrace Monitoring Data for Generating Performance Models of Java EE Applications Tool

SCALASCA: Sc alable performance a nalysis of la rge- sc ale parallel a pplications Brian J. N.

Development and applications of regional reanalyses for Europe and Germany based on DWDs NWP

Towards a network approach of educational development 6 June 2018, 3.15 4.15 p.m. Ine Rens,

Dening a Database Sc hema name (list of elemen ts). CREATE TABLE Principal elemen

Voting Jos e M Vidal Department of Computer Science and Engineering, University of South