Beta kernels and transformed kernels applications to copulas and - PowerPoint PPT Presentation

Arthur CHARPENTIER, transformed kernels and beta kernels Beta kernels and transformed kernels applications to copulas and quantiles Arthur Charpentier Université Rennes 1 arthur.charpentier@univ-rennes1.fr http ://freakonometrics.blog.free.fr/ Statistical seminar, Université Laval, Québec, April 2011 1 ✶✶♣t ✶✶♣t ◆♦t❡ ❊①❡♠♣❧❡ ❊①❡♠♣❧❡ ✶✶♣t Pr❡✉✈❡

Arthur CHARPENTIER, transformed kernels and beta kernels Agenda • General introduction and kernel based estimation Copula density estimation • Kernel based estimation and bias • Beta kernel estimation • Transforming observations Quantile estimation • Parametric estimation • Semiparametric estimation, extreme value theory • Nonparametric estimation • Transforming observations 2

Arthur CHARPENTIER, transformed kernels and beta kernels Moving histogram to estimate a density A natural way to estimate a density at x from an i.i.d. sample { X 1 , · · · , X n } is to count (and then normalized) how many observations are in a neighborhood of x , e.g. | x − X i | ≤ h , 3

Arthur CHARPENTIER, transformed kernels and beta kernels Kernel based estimation Instead of a step function 1 ( | x − X i | ≤ h ) consider so smoother functions, � x − x i � n n � � f h ( x ) = 1 K h ( x − x i ) = 1 � K , n nh h i =1 i =1 where K ( · ) is a kernel i.e. a non-negative real-valued integrable function such � + ∞ K ( u ) du = 1 so that � that f h ( · ) is a density , and K ( · ) is symmetric, i.e. −∞ K ( − u ) = K ( u ). 4

Arthur CHARPENTIER, transformed kernels and beta kernels 5

Arthur CHARPENTIER, transformed kernels and beta kernels Standard kernels are • uniform (rectangular) K ( u ) = 1 2 1 {| u |≤ 1 } • triangular K ( u ) = (1 − | u | ) 1 {| u |≤ 1 } • Epanechnikov K ( u ) = 3 4(1 − u 2 ) 1 {| u |≤ 1 } � � 1 − 1 2 u 2 √ • Gaussian K ( u ) = 2 π exp 6

Arthur CHARPENTIER, transformed kernels and beta kernels Standard kernels are • uniform (rectangular) K ( u ) = 1 2 1 {| u |≤ 1 } • triangular K ( u ) = (1 − | u | ) 1 {| u |≤ 1 } • Epanechnikov K ( u ) = 3 4(1 − u 2 ) 1 {| u |≤ 1 } � � 1 − 1 2 u 2 • Gaussian K ( u ) = √ 2 π exp 7

Arthur CHARPENTIER, transformed kernels and beta kernels word about copulas A 2-dimensional copula is a 2-dimensional cumulative distribution function restricted to [0 , 1] 2 with standard uniform margins. Copula (cumulative distribution function) Level curves of the copula Copula density Level curves of the copula If C is twice differentiable, let c denote the density of the copula. 8

Arthur CHARPENTIER, transformed kernels and beta kernels Sklar theorem : Let C be a copula, and F X and F Y two marginal distributions, then F ( x, y ) = C ( F X ( x ) , F Y ( y )) is a bivariate distribution function, with F ∈ F ( F X , F Y ). Conversely, if F ∈ F ( F X , F Y ), there exists C such that F ( x, y ) = C ( F X ( x ) , F Y ( y ). Further, if F X and F Y are continuous, then C is unique, and given by C ( u, v ) = F ( F − 1 X ( u ) , F − 1 Y ( v )) for all ( u, v ) ∈ [0 , 1] × [0 , 1] We will then define the copula of F , or the copula of ( X, Y ). 9

Arthur CHARPENTIER, transformed kernels and beta kernels Motivation Example Loss-ALAE : consider the following dataset, were the X i ’s are loss amount (paid to the insured) and the Y i ’s are allocated expenses. Denote by R i and S i the respective ranks of X i and Y i . Set U i = R i /n = ˆ F X ( X i ) and V i = S i /n = ˆ F Y ( Y i ). Figure 1 shows the log-log scatterplot (log X i , log Y i ), and the associate copula based scatterplot ( U i , V i ). Figure 2 is simply an histogram of the ( U i , V i ), which is a nonparametric estimation of the copula density. Note that the histogram suggests strong dependence in upper tails (the interesting part in an insurance/reinsurance context). 10

Arthur CHARPENTIER, transformed kernels and beta kernels Log ! log scatterplot, Loss ! ALAE Copula type scatterplot, Loss ! ALAE 1.0 5 0.8 Probability level ALAE 4 0.6 log(ALAE) 0.4 3 0.2 2 0.0 1 1 2 3 4 5 6 0.0 0.2 0.4 0.6 0.8 1.0 log(LOSS) Probability level LOSS Figure 1 – Loss-ALAE, scatterplots (log-log and copula type). 11

Arthur CHARPENTIER, transformed kernels and beta kernels Figure 2 – Loss-ALAE, histogram of copula type transformation. 12

Arthur CHARPENTIER, transformed kernels and beta kernels Why nonparametrics, instead of parametrics ? In parametric estimation, assume the the copula density c θ belongs to some given family C = { c θ , θ ∈ Θ } . The tail behavior will crucially depend on the tail behavior of the copulas in C Example : Table below show the probability that both X and Y exceed high thresholds ( X > F − 1 X ( p ) and Y > F − 1 Y ( p )), for usual copula families, where parameter θ is obtained using maximum likelihood techniques. Clayton ∗ p Clayton Frank Gaussian Gumbel max/min 90% 1 . 93500% 2 . 73715% 4 . 73767% 4 . 82614% 5 . 66878% 2 . 93 95% 0 . 51020% 0 . 78464% 1 . 99195% 2 . 30085% 2 . 78677% 5 . 46 99% 0 . 02134% 0 . 03566% 0 . 27337% 0 . 44246% 0 . 55102% 25 . 82 99 . 9% 0 . 00021% 0 . 00037% 0 . 01653% 0 . 04385% 0 . 05499% 261 . 85 Probability of exceedances, for given parametric copulas, τ = 0 . 5. 13

Arthur CHARPENTIER, transformed kernels and beta kernels Figure 3 shows the graphical evolution of p �→ P � Y ( p ) � X > F − 1 X ( p ) , Y > F − 1 . If the original model is an multivariate student vector ( X, Y ), the associated probability is the upper line. If either marginal distributions are misfitted (e.g. Gaussian assumption), or the dependence structure is mispecified (e.g. Gaussian assumption), probabilities are always underestimated. 14

Arthur CHARPENTIER, transformed kernels and beta kernels Joint probability of exceeding high quantiles Ratios of exceeding probability 0.30 1.0 Student dependence structure, Student margins Gaussian dependence structure, Student margins Student dependence structure, Gaussian margins 0.25 Gaussian dependence structure, Gaussian margins 0.8 0.20 0.6 0.15 0.4 0.10 Misfitting dependence structure 0.2 Misfitting margins 0.05 Misfit margins and dependence 0.0 0.00 0.5 0.6 0.7 0.8 0.9 1.0 0.6 0.7 0.8 0.9 1.0 Quantile levels � � X > F − 1 X ( p ) , Y > F − 1 Figure 3 – p �→ P Y ( p ) when ( X, Y ) is a Student t random vector, and when either margins or the dependence structure is mispectified. 15

Arthur CHARPENTIER, transformed kernels and beta kernels Kernel estimation for bounded support density Consider a kernel based estimation of density f , � x − X i � � n f ( x ) = 1 � K , nh h i =1 where K is a kernel function, given a n sample X 1 , ..., X n of positive random variable ( X i ∈ [0 , ∞ [). Let K denote a symmetric kernel, then f (0)) = 1 E ( � 2 f (0) + O ( h ) 16

Arthur CHARPENTIER, transformed kernels and beta kernels Exponential random variables Exponential random variables 1.2 1.2 1.0 1.0 0.8 0.8 Density Density 0.6 0.6 0.4 0.4 0.2 0.2 0.0 0.0 ! 1 0 1 2 3 4 5 6 ! 1 0 1 2 3 4 5 6 Figure 4 – Density estimation of an exponential density, 100 points. 17

Arthur CHARPENTIER, transformed kernels and beta kernels 1.2 1.0 1.0 0.8 0.8 0.6 Density Density 0.6 0.4 0.4 0.2 0.2 0.0 0.0 0 2 4 6 ! 0.2 0.0 0.1 0.2 0.3 0.4 0.5 Figure 5 – Density estimation of an exponential density, 10 , 000 points. 18

Arthur CHARPENTIER, transformed kernels and beta kernels Kernel based estimation of the uniform density on [0,1] Kernel based estimation of the uniform density on [0,1] 1.2 1.2 1.0 1.0 0.8 0.8 Density Density 0.6 0.6 0.4 0.4 0.2 0.2 0.0 0.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 Figure 6 – Density estimation of an uniform density on [0 , 1]. 19

Arthur CHARPENTIER, transformed kernels and beta kernels How to get a proper estimation on the border Several techniques have been introduce to get a better estimation on the border, – boundary kernel ( Müller (1991)) – mirror image modification ( Deheuvels & Hominal (1989), Schuster (1985)) – transformed kernel ( Devroye & Györfi (1981), Wand, Marron & Ruppert (1991)) In the particular case of densities on [0 , 1], – Beta kernel ( Brown & Chen (1999), Chen (1999, 2000)), – average of histograms inspired by Dersko (1998). 20

Arthur CHARPENTIER, transformed kernels and beta kernels Local kernel density estimators The idea is that the bandwidth h ( x ) can be different for each point x at which f ( x ) is estimated. Hence, � � n � 1 x − X i � f ( x, h ( x )) = K , nh ( x ) h ( x ) i =1 (see e.g. Loftsgaarden & Quesenberry (1965)). Variable kernel density estimators The idea is that the bandwidth h can be replaced by n values α ( X i ). Hence, � � � n x − X i f ( x, α ) = 1 1 � α ( X i ) K , n α ( X i ) i =1 (see e.g. Abramson (1982)). 21

Beta kernels and transformed kernels applications to copulas and - PowerPoint PPT Presentation

Arthur CHARPENTIER, transformed kernels and beta kernels Beta kernels and transformed kernels applications to copulas and quantiles Arthur Charpentier Universit Rennes 1 arthur.charpentier@univ-rennes1.fr http

Beta star measurement G. Wang and M.Bai Yellow beta star and chromatic beta beat measurement

Overview: Kernels for Sequences and Graphs String Kernels 8 Example Sequence Classification

1 CONVEYANCING TRANSFORMED | END TO END E-CONVEYANCING HAS ARRIVED Conveyancing Transformed NSW

The Gray Code Kernels The Gray Code Kernels The Gray Code Kernels Gil Ben-Artzi Hagit Hel-Or

Kernels on structures Andrea Passerini passerini@disi.unitn.it Machine Learning Kernels on

Beta Decay Beta Decay Microscopic picture Microscopic picture On a more fundamental level, beta

1 CONVEYANCING TRANSFORMED | END TO END E-CONVEYANCING HAS ARRIVED Conveyancing transformed And

1 CONVEYANCING TRANSFORMED | END TO END E-CONVEYANCING HAS ARRIVED Conveyancing Transformed NSW

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Scalable Machine Learning 6. Kernels Alex Smola Yahoo! Research and ANU

SVM Kernels COMPSCI 371D Machine Learning COMPSCI 371D Machine Learning SVM Kernels 1 /

First Oscillation Results From MiniBooNE Martin Tzanov University of Colorado Double Beta Decay

Beta Regression: Summary Shaken, Stirred, Mixed, and Partitioned Achim Zeileis, Francisco

SC Beta Ring Ching Pi Beta Phi was founded as I.C. Sorosis in 1867 at Monmouth College as

INFORMATION FOR PASTORS --2020-21 BETA PROGRAM INFORMATION FOR PASTORS -- --2020-21 BETA

Windows.NET Windows.NET Beta 3 Beta 3 Active Directory New Features Directory New Features

Stochastic cosmic ray sources and the TeV break in the all-electron spectrum arXiv:1809.?????

Subjects, Predicates, and Systema3city Paul M. Pietroski University

Copula Mixture Model for Dependency-seeking Clustering Melanie Rey, Volker Roth Department of

On the mixing advantage Kais Hamza Monash University ANZAPW, July 2013, University of Queensland

Generic Side-Channel Distinguishers: Improvements and Limitations N. Veyrat-Charvillon and F-X.

Statistical Bias Correction of hydrological forcing fields from GCMs: basic concepts . Claudio

Variation on Preschoolers Acquisition of Questions Student Author: Saundra Scott Mentor

What is Judgment? The definition of judgment The definition of proposition Elements of a