Topics in U-statistics and Risk Estimation Qing Wang and Bruce G. - PowerPoint PPT Presentation

Outline Introduction Unbiased Variance Estimator Implementation in Risk Estimation Future Work Reference Topics in U-statistics and Risk Estimation Qing Wang and Bruce G. Lindsay March 17, 2011 Qing Wang and Bruce G. Lindsay Department of Statistics,Pennsylvania State University Topics in U-statistics and Risk Estimation

Outline Introduction Unbiased Variance Estimator Implementation in Risk Estimation Future Work Reference Outline Introduction of U-statistics Proposed Unbiased Variance Estimator Implementation in Risk Estimation Future Work Reference Qing Wang and Bruce G. Lindsay Department of Statistics,Pennsylvania State University Topics in U-statistics and Risk Estimation

Outline Introduction Unbiased Variance Estimator Implementation in Risk Estimation Future Work Reference Suppose F is a p -variate distribution function ( p ∈ N + ), denoted as F ( x ) = F ( x (1) , ..., x ( p ) ). We are considering a parameter of interest θ which can be written as a functional of F with the following form: � � θ ( F ) = ... K ( x 1 , x 2 , ..., x m ) dF ( x 1 ) dF ( x 2 ) ... dF ( x m ) where x 1 , ..., x m are all p -variate and K is a symmetric function of m arguments. Given a sample of size n ( n ≥ m , m =size of the kernel) from F , K ( X 1 , X 2 , ..., X m ) is an unbiased estimate of the parameter θ . However, intuition reminds us that there should be some better estimators, since K ( X 1 , X 2 , ..., X m ) does not use up the entire data set. Qing Wang and Bruce G. Lindsay Department of Statistics,Pennsylvania State University Topics in U-statistics and Risk Estimation

Outline Introduction Unbiased Variance Estimator Implementation in Risk Estimation Future Work Reference Definition (Hoeffding (1948)) Let X 1 , X 2 , ..., X n be i.i.d. random variables (vectors) and K ( x 1 , ..., x m ) be a symmetric real-valued function of m arguments, then a U-statistic is defined as: 1 � U n = K ( X i 1 , ..., X i m ) � n � m 1 ≤ i 1 <...< i m ≤ n The unbiasedness of U n follows from the unbiasedness of K . It can be seen that U n is a function of order statistics (which is a set of sufficient statistics). When we are doing nonparametric inference, the set of order statistics is a complete sufficient statistic if the underlying distribution family is large enough (Fraser (1954)). Therefore, a U-statistic is the best unbiased estimator in this context by Rao-Blackwell Theorem. Qing Wang and Bruce G. Lindsay Department of Statistics,Pennsylvania State University Topics in U-statistics and Risk Estimation

Outline Introduction Unbiased Variance Estimator Implementation in Risk Estimation Future Work Reference Example in Risk Estimation As a motivating problem, we consider risk estimation in the context of nonparametric kernel density estimation. Consider a probability density function of a continuous random variable X , denoted as f ( x ). Consider the most visible and used density estimation method, the nonparametric kernel density estimator which is defined for a kernel K as n f h ( x ) = 1 ˆ � K h ( x − X i ) , n i =1 where x ∈ R , h > 0, and K h ( t ) = 1 h K ( t / h ). Qing Wang and Bruce G. Lindsay Department of Statistics,Pennsylvania State University Topics in U-statistics and Risk Estimation

Outline Introduction Unbiased Variance Estimator Implementation in Risk Estimation Future Work Reference One option to select the “optimal” bandwidth is to compute a risk function which measures the “average distance” between ˆ f h ( x ) and f ( x ) in a certain fashion, and the best bandwidth h ⋆ is considered as the one that yields the smallest risk score. In practice, given a dataset one estimates the risk function and uses ˆ h ⋆ . As a U-statistic is the best unbiased estimator for nonparametric inferences, we would like to construct a U-statistic form estimator for the risk that arises from L 2 loss function. Qing Wang and Bruce G. Lindsay Department of Statistics,Pennsylvania State University Topics in U-statistics and Risk Estimation

Outline Introduction Unbiased Variance Estimator Implementation in Risk Estimation Future Work Reference U-statistic Form L 2 Risk Estimator 2 π e − ( x − x 0)2 L 2 loss based on a Gaussian kernel K t ( x , x 0 ) = 1 2 t 2 √ t f h ( · ) based on L 2 loss is defined as: The risk of ˆ � ( f ( x ) − ˆ f h ( x )) 2 dx ] = E X n [ Risk L 2 , n n − 1 ∝ E [ K √ 2 h ( X 1 , X 2 )] − 2 E [ K h ( X 1 , X 2 )] . n Therefore, a U-statistic estimate for the above relative risk can be constructed as 1 � = K L 2 , h ( X i , X j ) , where U L 2 � n � 2 i < j n − 1 K L 2 , h ( x 1 , x 2 ) = K √ 2 h ( x 1 , x 2 ) − 2 K h ( x 1 , x 2 ) . n It can be shown that the above risk estimator U L 2 is identical to UCV estimator. Qing Wang and Bruce G. Lindsay Department of Statistics,Pennsylvania State University Topics in U-statistics and Risk Estimation

Outline Introduction Unbiased Variance Estimator Implementation in Risk Estimation Future Work Reference Asymptotic Behavior of U-statistics As a U-statistic is an unbiased estimator of the parameter of interest, exploring its variance to evaluate the parameter estimation is always crucial and of interest. In the case of risk estimation, we may want to know how precise we estimate the risk function by a U-statistic. Qing Wang and Bruce G. Lindsay Department of Statistics,Pennsylvania State University Topics in U-statistics and Risk Estimation

Topics in U-statistics and Risk Estimation Qing Wang and Bruce G. - PowerPoint PPT Presentation

Outline Introduction Unbiased Variance Estimator Implementation in Risk Estimation Future Work Reference Topics in U-statistics and Risk Estimation Qing Wang and Bruce G. Lindsay March 17, 2011 Qing Wang and Bruce G. Lindsay Department of

Risk Management Workshop 1 Risk management workshop Why do we Risk Risk and need risk

Official Statistics Matt Dray, Assistant Statistician Official Statistics 2 Official

Motion Estimation by Affine Transforms Motion Estimation by Affine Transforms Motion Estimation

Statistics Point Estimation Shiu-Sheng Chen Department of Economics National Taiwan University

EARTHQUAKE LOSS ESTIMATION AND RISK EARTHQUAKE LOSS ESTIMATION AND RISK ASSESSMENT METHODOLOGY

MLSE Channel Estimation MLSE Channel Estimation MLSE Channel Estimation Parametric or Non-

M-Estimation under High-Dimensional Asymptotics DLD, Andrea Montanari 2014-05-01 DLD, Andrea

Part 3. Spectrum Estimation Part 3. Spectrum Estimation 3.2 Parametric Methods for Spectral

Lecture 6. Bayesian estimation Lecture 6. Bayesian estimation 1 (172) 6. Bayesian estimation

On the nature of financial risk: Why risk is so hard to measure and why risk models fail so often

Risk/Reward Risk/Reward If you buy here, what is the target? What is the risk? 1 221

Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5 Parameter Estimation Point

Risk Management Ken Haas USA Hockey Atlantic District Risk Manager 215-341-1488 1 Risk

Areal statistics Barry Rowlingson Research Fellow DataCamp Spatial Statistics in R Borders

Dynamic Programming: Estimation ECON 34430: Topics in Labor Markets T. Lamadon (U of Chicago)

Risk Assessment Reduce the workers and Biosafety is an inexact science, and

The Solid Foundation to a Successful College Career Eric Davis, Chad S. Briggs, Amy Fehr-Davis,

EMA Multiplicity Workshop Case study: a phase 3 study with 2 doses and secondary endpoints

Artificial Intelligence Based Control Power Optimization on Tailless Aircraft Frank H. Gern NASA

CENTENNIAL CHALLENGE A 100 - YEAR CELEBRATION Whitney Forman - Cook Communications Director

On the Length of the Longest Common Subsequence Peter Rabinovitch Summary Consider two

Anytime Best+Depth-First Search for Bounding Marginal MAP Radu Marinescu Junkyu Lee, Alex Ihler

Adult Social Care & Health Council Plan Priorities Report to Health Scrutiny Committee, 20 th

Lessons Learnt from Indonesia 35 Manila M-women Working Group November, 2013 Key Barriers for

Sambuz

Useful Links

Newsletter

Mail Us

Topics in U-statistics and Risk Estimation Qing Wang and Bruce G. - PowerPoint PPT Presentation

Outline Introduction Unbiased Variance Estimator Implementation in Risk Estimation Future Work Reference Topics in U-statistics and Risk Estimation Qing Wang and Bruce G. Lindsay March 17, 2011 Qing Wang and Bruce G. Lindsay Department of

Risk Management Workshop 1 Risk management workshop Why do we Risk Risk and need risk

Official Statistics Matt Dray, Assistant Statistician Official Statistics 2 Official

Motion Estimation by Affine Transforms Motion Estimation by Affine Transforms Motion Estimation

Statistics Point Estimation Shiu-Sheng Chen Department of Economics National Taiwan University

EARTHQUAKE LOSS ESTIMATION AND RISK EARTHQUAKE LOSS ESTIMATION AND RISK ASSESSMENT METHODOLOGY

MLSE Channel Estimation MLSE Channel Estimation MLSE Channel Estimation Parametric or Non-

M-Estimation under High-Dimensional Asymptotics DLD, Andrea Montanari 2014-05-01 DLD, Andrea

Part 3. Spectrum Estimation Part 3. Spectrum Estimation 3.2 Parametric Methods for Spectral

Lecture 6. Bayesian estimation Lecture 6. Bayesian estimation 1 (172) 6. Bayesian estimation

On the nature of financial risk: Why risk is so hard to measure and why risk models fail so often

Risk/Reward Risk/Reward If you buy here, what is the target? What is the risk? 1 221

Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5 Parameter Estimation Point

Risk Management Ken Haas USA Hockey Atlantic District Risk Manager 215-341-1488 1 Risk

Areal statistics Barry Rowlingson Research Fellow DataCamp Spatial Statistics in R Borders

Dynamic Programming: Estimation ECON 34430: Topics in Labor Markets T. Lamadon (U of Chicago)

Risk Assessment Reduce the workers and Biosafety is an inexact science, and

The Solid Foundation to a Successful College Career Eric Davis, Chad S. Briggs, Amy Fehr-Davis,

EMA Multiplicity Workshop Case study: a phase 3 study with 2 doses and secondary endpoints

Artificial Intelligence Based Control Power Optimization on Tailless Aircraft Frank H. Gern NASA

CENTENNIAL CHALLENGE A 100 - YEAR CELEBRATION Whitney Forman - Cook Communications Director

On the Length of the Longest Common Subsequence Peter Rabinovitch Summary Consider two

Anytime Best+Depth-First Search for Bounding Marginal MAP Radu Marinescu Junkyu Lee, Alex Ihler

Adult Social Care &amp; Health Council Plan Priorities Report to Health Scrutiny Committee, 20 th

Lessons Learnt from Indonesia 35 Manila M-women Working Group November, 2013 Key Barriers for

Sambuz

Useful Links

Newsletter

Mail Us

Adult Social Care & Health Council Plan Priorities Report to Health Scrutiny Committee, 20 th