Supere ffi cient estimation of the intensity of a stationary Poisson - PowerPoint PPT Presentation

Supere ffi cient estimation of the intensity of a stationary Poisson point process via the Stein method Marianne Clausel Joint work with Jean-Franc ¸ois Coeurjolly and J´ erˆ ome Lelong Laboratoire Jean Kuntzmann (LJK), Grenoble Alpes University

Back to the initial ideas of Charles Stein Considered as the father of di ff erent problems related to optimal estimation, stochastic calculus,.. .

Back to the initial ideas of Charles Stein Considered as the father of di ff erent problems related to optimal estimation, stochastic calculus,.. . Everything started in the following context Let X ∼ N ( θ, σ 2 I d ) where I d is the d -dimensional identity matrix. Objective: estimate θ based on a single (for simplicity) observation X .

Back to the initial ideas of Charles Stein Considered as the father of di ff erent problems related to optimal estimation, stochastic calculus,.. . Everything started in the following context Let X ∼ N ( θ, σ 2 I d ) where I d is the d -dimensional identity matrix. Objective: estimate θ based on a single (for simplicity) observation X . � θ − θ � 2 � θ mle = X minimizes MSE( � � � � θ ) = E among unbiased estimators.

Back to the initial ideas of Charles Stein Considered as the father of di ff erent problems related to optimal estimation, stochastic calculus,.. . Everything started in the following context Let X ∼ N ( θ, σ 2 I d ) where I d is the d -dimensional identity matrix. Objective: estimate θ based on a single (for simplicity) observation X . � θ − θ � 2 � θ mle = X minimizes MSE( � � � � θ ) = E among unbiased estimators. Stein (1956) � � � (1 − b ( a + X 2 i ) − 1 ) X i i = 1 ,..., d ⇒ MSE( � θ S ) ≤ MSE( � θ mle ) when d ≥ 3 θ S = James-Stein (1961) � θ JS = X (1 − ( d − 2) / � X � 2 ) ⇒ MSE( � θ JS ) ≤ MSE( � θ mle ) when d ≥ 3 θ = X + g ( X ), g : R d → R d . Stein (1981) key-ingredients for the class: �

θ = X + g ( X ) ( X ∼ N ( θ, σ 2 I d , σ 2 known) MSE of � MSE( � θ ) =

θ = X + g ( X ) ( X ∼ N ( θ, σ 2 I d , σ 2 known) MSE of � � d θ ) = E � X − θ � 2 + E � g ( X ) � 2 + 2 MSE( � E (( X i − θ i ) g i ( X )) i = 1

θ = X + g ( X ) ( X ∼ N ( θ, σ 2 I d , σ 2 known) MSE of � � d θ ) = E � X − θ � 2 + E � g ( X ) � 2 + 2 MSE( � E (( X i − θ i ) g i ( X )) i = 1 E[ Zh ( Z )] = E[ h ′ ( Z )], Z ∼ N (0 , 1) Using an IbP for Gaussian r.v. 1 � d θ mle ) + E � g ( X ) � 2 + 2 σ 2 MSE( � θ ) = MSE( � E ∇ g i ( X ) i = 1

θ = X + g ( X ) ( X ∼ N ( θ, σ 2 I d , σ 2 known) MSE of � � d θ ) = E � X − θ � 2 + E � g ( X ) � 2 + 2 MSE( � E (( X i − θ i ) g i ( X )) i = 1 E[ Zh ( Z )] = E[ h ′ ( Z )], Z ∼ N (0 , 1) Using an IbP for Gaussian r.v. 1 � d θ mle ) + E � g ( X ) � 2 + 2 σ 2 MSE( � θ ) = MSE( � E ∇ g i ( X ) i = 1 Now choose g = σ 2 ∇ log f . Use the well-known fact [based on 2 product and chain–rules ] that for h : R → R , (log( h ) ′ ) 2 + 2(log h ) ′′ =

θ = X + g ( X ) ( X ∼ N ( θ, σ 2 I d , σ 2 known) MSE of � � d θ ) = E � X − θ � 2 + E � g ( X ) � 2 + 2 MSE( � E (( X i − θ i ) g i ( X )) i = 1 E[ Zh ( Z )] = E[ h ′ ( Z )], Z ∼ N (0 , 1) Using an IbP for Gaussian r.v. 1 � d θ mle ) + E � g ( X ) � 2 + 2 σ 2 MSE( � θ ) = MSE( � E ∇ g i ( X ) i = 1 Now choose g = σ 2 ∇ log f . Use the well-known fact [based on 2 product and chain–rules ] that for h : R → R , √ h ) ′′ (log( h ) ′ ) 2 + 2(log h ) ′′ = 4 ( h . Get √  �    � ∇∇ f ( X )     MSE( � θ ) = MSE( �    ≤ MSE( � θ mle ) + 4 σ 2 E θ mle )  �  if ∇∇ f ≤ 0 .  f ( X )

θ = X + g ( X ) ( X ∼ N ( θ, σ 2 I d , σ 2 known) MSE of � � d θ ) = E � X − θ � 2 + E � g ( X ) � 2 + 2 MSE( � E (( X i − θ i ) g i ( X )) i = 1 E[ Zh ( Z )] = E[ h ′ ( Z )], Z ∼ N (0 , 1) Using an IbP for Gaussian r.v. 1 � d θ mle ) + E � g ( X ) � 2 + 2 σ 2 MSE( � θ ) = MSE( � E ∇ g i ( X ) i = 1 Now choose g = σ 2 ∇ log f . Use the well-known fact [based on 2 product and chain–rules ] that for h : R → R , √ h ) ′′ (log( h ) ′ ) 2 + 2(log h ) ′′ = 4 ( h . Get √  �    � ∇∇ f ( X )     MSE( � θ ) = MSE( �   ≤ MSE( �  θ mle ) + 4 σ 2 E θ mle )  �  if ∇∇ f ≤ 0 .  f ( X ) Goal : mimic both steps, derive a Stein estimator for the intensity of a Poisson point process [extension Privault-R´ eveillac (2009), d = 1]

Point processes S : Polish state space of the point process (equipped with the σ -algebra of Borel sets B ). A configuration of points is denoted x = { x 1 , · · · , x n , · · · } . For B ⊂ S : x B = x ∩ B . N lf : space of locally finite configurations, i.e. { x , n ( x B ) = | x B | < ∞} , ∀ B bounded ∈ S } equipped with N lf = σ ( { x ∈ N lf , n ( x B ) = m } ; B ∈ B , B bounded , m ≥ 1). Definition A point process X defined on S is a measurable application defined on some probability space ( Ω , F , P ) with values on N lf . Measurability of X ⇐⇒ N ( B ) = | X B | is a r.v. for any bounded B ∈ B .

Poisson point processes Definition of a Poisson point process with intensity ρ ( · ) . ∀ m ≥ 1, ∀ bounded and disjoint B 1 , · · · , B m ⊂ S , the r.v. X B 1 , · · · , X B m are independent. � N ( B ) ∼ P ( B ρ ( u ) du ) ∀ B ⊂ S , ∀ F ∈ N lf � � � � � n e − B ρ ( u ) du P ( X B ∈ F ) = · · · 1( { ( x 1 , · · · , x n ) ∈ F } ) ρ ( x i ) dx i n ! B B n ≥ 0 i = 1 Notation : X ∼ Poisson( S , ρ ).

Our framework Case considered here ρ ( u ) ≡ θ . X homogeneous Poisson point process with intensity θ . We assume observing X on W ⊂ R d . Given N ( W ) = n , we denote X 1 , . . . , X n the n points in W . The MLE estimate of the intensity θ is � θ = N ( W ) / | W | . Construction of a Stein estimator of θ ?

Poisson functionals S : space of Poisson functionals F defined on Ω by � F = f 0 1 ( N ( W ) = 0) + 1 ( N ( W ) = n ) f n ( X 1 , . . . , X n ) , n ≥ 1 f 0 ∈ R , f n : W n → R d measurable symmetric functions called form functions of F .

Towards a Stein estimator (1) θ mle = N ( W ) / | W | . MLE is defined by � θ mle + Aim : define � θ of the form � θ = � 1 | W | ζ where ζ = ∇ log( F ) . Relation MSE( � θ ) < MSE( � θ mle ) satisfied?  � 2  �     θ mle + 1   MSE( �  �  θ ) = E  | W |∇ log F − θ    � � 1 = MSE( � θ mle ) + E[( ∇ log F ) 2 ] + 2E[( ∇ log F )( N ( W ) − θ | W | )] | W | 2 = ⇒ Need to transform 2E[ G ( N ( W ) − θ | W | )] with G = ∇ log F using a IbP formula. Notion of derivative ?

Malliavin derivatives (1) Di ff erential operator: let π : W 2 → R d � � n D π x F = − 1 ( N ( W ) = n ) ( ∇ x i f n )( X 1 , . . . , X n ) π ( X i , x ) , n ≥ 1 i = 1 where S ′ = � F ∈ S : ’the f n are cont. di ff . in any variable x i ’ � and where ∇ x i f n gradient of x i �→ f n ( . . . , x i , . . . ).

Malliavin derivatives (2) Lemma [product and chain rules] For any x ∈ W , for all F , G ∈ S ′ , g ∈ C 1 b ( R ) we have D π x ( FG ) = ( D π x F ) G + F ( D π D π x g ( F ) = g ′ ( F ) D π x G ) and x F .

Malliavin derivatives (2) Lemma [product and chain rules] For any x ∈ W , for all F , G ∈ S ′ , g ∈ C 1 b ( R ) we have D π x ( FG ) = ( D π x F ) G + F ( D π D π x g ( F ) = g ′ ( F ) D π x G ) and x F . To get an IbP type formula, we need to introduce Dom( D π ) of S ′ as � F ∈ S ′ : ∀ n ≥ 1 and z 1 , . . . , z n ∈ R d Dom( D π ) = � f n + 1 � � z n + 1 ∈ ∂ W ( z 1 , . . . , z n + 1 ) = f n ( z 1 , . . . , z n ) , f 1 � � � � z ∈ ∂ W ( z ) = 0 , (1) Remark: compatibility conditions important to derive a correct Stein estimator.

Integration by parts formula Theorem π ), V : R d → R , V ∈ C 1 ( W , R d ) Let G ∈ Dom( D     ��   �               D π     E x G · V ( x )d x = E  F ∇ · V ( u ) − θ ∇ · V ( u )d u        W W � �� u ∈ X W : = ∇ π, V F � where V : W → R d is defined by V ( u ) = W V ( x ) π ( u , x )d x .

Integration by parts formula Theorem π ), V : R d → R , V ∈ C 1 ( W , R d ) Let G ∈ Dom( D     ��   �               D π     E x G · V ( x )d x = E  F ∇ · V ( u ) − θ ∇ · V ( u )d u        W W � �� u ∈ X W : = ∇ π, V F � where V : W → R d is defined by V ( u ) = W V ( x ) π ( u , x )d x . Main application: let π ( u , x ) = u ⊤ V ( x ), we can find some V ( omit details ) such that V ( u ) = u / d and ∇ · V ( u ) = 1. Then � � n ∇ G = ∇ π, V G = − 1 1 ( N ( W ) = n ) ∇ x i g n ( X 1 , . . . , X n ) · X i d n ≥ 1 i = 1 ⇒ E[ ∇ G ] = E [ G ( N ( W ) − θ | W | )] .

Supere ffi cient estimation of the intensity of a stationary Poisson - PowerPoint PPT Presentation

Supere ffi cient estimation of the intensity of a stationary Poisson point process via the Stein method Marianne Clausel Joint work with Jean-Franc ois Coeurjolly and J er ome Lelong Laboratoire Jean Kuntzmann (LJK), Grenoble Alpes

Lecture 3. Su ffi ciency Lecture 3. Su ffi ciency 1 (114) 3. Su ffi ciency 3.1. Su ffi cient

Immutability, or Putting the Dream Machine to Work The trie memory scheme is ine ffi cient for

Immutability, or Putting the Dream Machine to Work The trie memory scheme is ine ffi cient for

An E ffi cient A ffi ne-Scaling Algorithm for Hyperbolic Programming Jim Renegar joint work

FFI The good, the bad and the ugly Esteban Lorenzano (The Pharo firefighter) Current status of

15 E ffi cient mesh models Steve Marschner CS5625 Spring 2020 Follows chapter 16 in RTR 4e Basics

New tools for intensity interferometry New tools for intensity interferometry First experiment in

Taming the C Monster Haskell FFI Techniques Fraser Tweedale @hackuador May 22, 2018 FFI basics

Solid State Drive Based Energy E ffi cient Cloud Storage Jesus Ramos Alexis Je ff erson Ti ff any

E ffi cient, Cost E ff ective and Sustainable Self-Delivery of Asphalt for Small Works 1

A Large Scale Study of the Small Sample Performance of Random Coe ffi cient Models of Demand

E ffi cient and Incentive-Compatible Liver Exchange Haluk Ergin Tayfun Snmez M. Utku nver U

Point-Voxel CNN for E ffi cient 3D Deep Learning Zhijian Liu* , Haotian Tang* , Yujun Lin , and

E ffi cient use of semidefinite programming for the selection of rotamers in protein conformation

Tips on Writing Papers with Mathematical Content John N. Tsitsiklis May 2019

Ti e E ffi cient Server Audit Problem, Deduplicated Re-execution, and the Web Cheng Tan, Lingfan

Lacc el eration de Nesterov est-elle vraiment une acc el eration ? cois Aujol 1

Algebraic Techniques in Cryptanalysis of Block Ciphers with a bias towards Gr obner bases

POLYNOMIALS, QUADRATIC FORMS AND THE TOPOLOGY OF MANIFOLDS For Erik Kjr Pedersen Andrew

Strategy Synthesis for Multi-dimensional Quantitative Objectives Krishnendu Chatterjee 1 Mickael

Generalizing Vlus formulas and some applications ECC 2010 Romain Coss et 1 , David L ubicz

FALL 2018 PRISME ForumTechnical Meeting PRISME Forum Chair: Olivier Gien VP, Global Head,

Strategy Synthesis for Quantitative Objectives Krishnendu Chatterjee 1 Mickael Randour 2 cois

Numerical resolution of McKean-Vlasov Forward Backward Stochastic Differential Equations using