The Many Faces of a Simple Identity Larry Goldstein University of - PowerPoint PPT Presentation

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE The Many Faces of a Simple Identity Larry Goldstein University of Southern California ICML Workshop, June 15 th 2019

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Guided Tour

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE In the Beginning

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Itinerary 1. Stein Identity 2. Distributional Approximation 3. Concentration 4. Second order Poincar´ e Inequalities, and Malliavin Calculus 5. Shrinkage, Unbiased Risk Estimation

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Poisson Distribution (Chen 1975) Non-negative integer valued random variable W is distributed P λ if and only if all f ∈ F . E [ Wf ( W )] = λ E [ f ( W + 1)]

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Poisson Distribution (Chen 1975) Non-negative integer valued random variable W is distributed P λ if and only if all f ∈ F . E [ Wf ( W )] = λ E [ f ( W + 1)] For any W ≥ 0 with mean λ ∈ (0 , ∞ ), size bias distribution: E [ Wf ( W )] = λ E [ f ( W s )] all f ∈ F . Restatement: W s = d W + 1 if and only if W ∼ P ( λ ).

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Poisson Distribution (Chen 1975) Non-negative integer valued random variable W is distributed P λ if and only if all f ∈ F . E [ Wf ( W )] = λ E [ f ( W + 1)] For any W ≥ 0 with mean λ ∈ (0 , ∞ ), size bias distribution: E [ Wf ( W )] = λ E [ f ( W s )] all f ∈ F . Restatement: W s = d W + 1 if and only if W ∼ P ( λ ). d TV ( W , P λ ) ≤ (1 − e − λ ) E | ( W s − 1) − W | . Applications e.g. to matchings in molecular sequence analysis.

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE d TV ( W , P λ ) ≤ (1 − e − λ ) E | ( W s − 1) − W | Simple Example: Let n � W = with λ = E [ W ] , X i i =1 the sum of independent Bernoullis with p i = E [ X i ] ∈ (0 , 1). Then W s = W − X I + 1 where P ( I = i ) = p i /λ , I independent. Then n � d TV ( W , P λ ) ≤ (1 − e − λ ) EX I = 1 − e − λ p 2 i . λ i =1 If p i = λ/ n then the bound specializes to λ (1 − e − λ ) / n .

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE The Big Question

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Stein Identity for Standard Gaussian Let Y be normal N ( θ, σ 2 ) with density √ φ θ,σ 2 ( t ) = e − ( t − θ ) 2 / 2 σ 2 / 2 πσ 2 . Then the law of a random variable W has the same distribution as Y if and only E [( W − θ ) f ( W )] = σ 2 E [ f ′ ( W )] for all f ∈ F , where F is some sufficiently rich class of smooth functions. 1. All functions f for which the two sides above exist. 2. All functions in Lip 1 = { f : | f ( x ) − f ( y ) | ≤ | x − y |} .

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Proof of Stein Identity; Standard normal case Direction normality of W implies for all f ∈ F equality, some say √ integration by parts: with φ ( t ) = e − t 2 / 2 / 2 π t φ ( t ) = − φ ′ ( t ) E [ Wf ( W )] = E [ f ′ ( W )] . hence Requires restricting to finite interval, resulting in boundary terms, on which conditions will be needed for taking limit.

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Proof of Stein Identity; Standard normal case Direction normality of W implies for all f ∈ F equality, some say √ integration by parts: with φ ( t ) = e − t 2 / 2 / 2 π t φ ( t ) = − φ ′ ( t ) E [ Wf ( W )] = E [ f ′ ( W )] . hence Requires restricting to finite interval, resulting in boundary terms, on which conditions will be needed for taking limit. Use Fubini as Stein did, breaking into positive and negative parts: � ∞ � ∞ � ∞ f ′ ( w ) φ ( w ) dw = − f ′ ( w ) φ ′ ( t ) dtdw 0 0 w � ∞ � t � ∞ t φ ( t ) f ′ ( w ) dwdt = = t φ ( t )[ f ( t ) − f (0)] dt . 0 0 0 Combining with portion on ( −∞ , 0], obtain E [ f ′ ( W )] = E [ W ( f ( W ) − f (0))] = E [ Wf ( W )] .

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Stein Equation For a given class of functions H (e.g. Lip 1 ), and distributions of random variables X and Y , let (e.g. Wasserstein distance) d H ( X , Y ) = sup | Eh ( X ) − Eh ( Y ) | . h ∈H Given a mean zero, variance 1 random variable W , and a test function h in a class H , bound the difference Eh ( W ) − Eh ( Z ) . Now, reason as follows: since this expectation, and E [ f ′ ( W ) − Wf ( W )] are both zero when W is normal, lets equate them.

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Stein Equation (1)

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Stein Equation and Couplings Stein equation for the standard normal: f ′ ( x ) − xf ( x ) = h ( x ) − Eh ( Z ) . Now to compute the expectation of the right hand side involving h to bound d H ( W , Z ), lets solve a differential equation for f and compute the expectation E [ f ′ ( W ) − Wf ( W )] of the left. Would at first glace appear to make the problem harder. However, there is only one random variable in this expectation, rather than two. Can handle the left hand side expectation using construction of auxiliary random variables, couplings.

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Extend Stein Identity One direction of the Stein identity, for W with E [ W ] = 0 and Var ( W ) = 1, E [ Wf ( W )] = E [ f ′ ( W )] for all f ∈ F (1) only if W ∼ N (0 , 1). So if W has any other distribution (1) does not hold. Can we can modify the identity, or make some similar identity, so that it holds for a different W distribution?

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Some Options Feel free to add to the list! 1. Stein’s exchangeable pair 2. Stein Kernels 3. Size Bias 4. Zero Bias 5. Score function

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Stein Kernels and Zero Bias Coupling Modify the right hand side of the identity E [ Wf ( W )] = E [ f ′ ( W )] for all f ∈ F in some way to accommodate non-normal distribution. Stein Kernel (Cacoullos and Papathanasiou ’92) E [ Wf ( W )] = E [ Tf ′ ( W )] for all f ∈ F Zero Bias (G. and Reinert ’97) E [ Wf ( W )] = E [ f ′ ( W ∗ )] for all f ∈ F

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Use of Stein Kernels: E [ Wf ( W )] = E [ Tf ′ ( W )] Given h ∈ H let f be the unique bounded solution to f ′ ( x ) − xf ( x ) = h ( x ) − Eh ( Z ) . Then, using Stein kernels, for H = { f : R → [0 , 1] } | Eh ( W ) − Eh ( Z ) | = | E [ f ′ ( W ) − Wf ( W )] | = | E [ f ′ ( W ) − Tf ′ ( W )] | = | E [(1 − T ) f ′ ( W )] | ≤ � f ′ � E | T − 1 | ≤ 2 E | T − 1 | . Taking supremum over this choice of H on the left hand side yields d TV ( W , Z ) ≤ 2 E | T − 1 | , a bound on the total variation distance.

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Use of Zero Bias Coupling: E [ Wf ( W )] = E [ f ′ ( W ∗ )] Given h ∈ H let f be the unique bounded solution to f ′ ( x ) − xf ( x ) = h ( x ) − Eh ( Z ) . Then, using zero bias, for H = Lip 1 | Eh ( W ) − Eh ( Z ) | = | E [ f ′ ( W ) − Wf ( W )] | = | E [ f ′ ( W ) − f ′ ( W ∗ )] | ≤ � f ′′ � E | W − W ∗ | . Taking infimum over all couplings on the right, and then supremum over this choice of H on the left hand side yields d 1 ( W , Z ) ≤ 2 d 1 ( W , W ∗ ) , a bound on the Wasserstein distance.

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Other Distributions Classical: Poisson, Gamma, Binomial, Multinomial, Beta, Stable laws, Rayleigh, ... Not so classical: PRR distribution, Dickman distribution, ...

Introduction Poisson Normal Other Distributions Concentration Poincar´ e and Malliavin Shrinkage and SURE Other Distributions Classical: Poisson, Gamma, Binomial, Multinomial, Beta, Stable laws, Rayleigh, ... Not so classical: PRR distribution, Dickman distribution, ... Dickman characterizations for W ≥ 0, independent U ∼ U [0 , 1], W s = d W + U and W = d U ( W + 1)

The Many Faces of a Simple Identity Larry Goldstein University of - PowerPoint PPT Presentation

Introduction Poisson Normal Other Distributions Concentration Poincar e and Malliavin Shrinkage and SURE The Many Faces of a Simple Identity Larry Goldstein University of Southern California ICML Workshop, June 15 th 2019 Introduction

Changing Places/Changing Faces 1 Running Head: CHANGING PLACES/CHANGES FACES Changing

Identity Theft Identity Theft Identity theft occurs when your personal information is stolen

Face Recognition: Motivation 1 Overview: 1. Why faces? 2. Applications for Face Analysis

Table of Contents Java Server Faces 3/4 tier architecture MVC AWT Java Server Faces 3)

Adopting the global Marketing Lead Domains.coop co-operative identity www.identity.coop 24

Identity and Access Management Using Identity Management and Identity Governance to increase

MA111: Contemporary mathematics . Jack Schmidt University of Kentucky November 26, 2012

The Many Faces of Rationalizability Krzysztof Apt CWI & University of Amsterdam The Many

Toward Efficient Many-to-Many Broadcast in Dynamic Wireless Networks Fabian Mager , Carsten

Key Escrow free Identity-based Identity-based Cryptosystem Cryptosystem Identity-based

Smart Glasses Fashion Glasses Smart Glasses identity Ephesians 2 Ephesians 2 Ephesians 2

Classification of curves Simple, not closed Simple, closed Closed, not simple Not simple, not

From Real faces to Virtual faces Alberto Borghese Department of Computer Science University of

burlingtonbaptist.org.uk/sovereign Four Faces - a lion, an ox, a man, an eagle Ezekiel 1:6, 10

Making Faces: Conditional generation of faces using GANs via Keras+Tensorflow SOPHIE SEARCY

Limits on Representing Functions by Linear Combinations of Simple Functions 0,1

Identity is a Wicked Problem Identity is a "Wicked" Problem There is no universally

Overview of Discrete-Time Filters First-order filters Ideal filters Practical filters

Political Communication: Gatekeeping POLS 418 MWF 10:00-10:50 Drew Seib April 19, 2011 Drew

Phase-type Distributions for Realistic Modelling in Discrete-Event Simulation Philipp Reinecke

Identity-Based Encryption and Pairings 1 Mihir Bellare, UCSD 2 Mihir Bellare, UCSD Receiver

Modeling Process Rich Hickey Which are more fundamental? Messages, classes, encapsulation,

https://staysafeonline.org/

JUST THE MATHS SLIDES NUMBER 4.1 HYPERBOLIC FUNCTIONS 1 (Definitions, graphs and

Sambuz

Useful Links

Newsletter

Mail Us

The Many Faces of a Simple Identity Larry Goldstein University of - PowerPoint PPT Presentation

Introduction Poisson Normal Other Distributions Concentration Poincar e and Malliavin Shrinkage and SURE The Many Faces of a Simple Identity Larry Goldstein University of Southern California ICML Workshop, June 15 th 2019 Introduction

Changing Places/Changing Faces 1 Running Head: CHANGING PLACES/CHANGES FACES Changing

Identity Theft Identity Theft Identity theft occurs when your personal information is stolen

Face Recognition: Motivation 1 Overview: 1. Why faces? 2. Applications for Face Analysis

Table of Contents Java Server Faces 3/4 tier architecture MVC AWT Java Server Faces 3)

Adopting the global Marketing Lead Domains.coop co-operative identity www.identity.coop 24

Identity and Access Management Using Identity Management and Identity Governance to increase

MA111: Contemporary mathematics . Jack Schmidt University of Kentucky November 26, 2012

The Many Faces of Rationalizability Krzysztof Apt CWI &amp; University of Amsterdam The Many

Toward Efficient Many-to-Many Broadcast in Dynamic Wireless Networks Fabian Mager , Carsten

Key Escrow free Identity-based Identity-based Cryptosystem Cryptosystem Identity-based

Smart Glasses Fashion Glasses Smart Glasses identity Ephesians 2 Ephesians 2 Ephesians 2

Classification of curves Simple, not closed Simple, closed Closed, not simple Not simple, not

From Real faces to Virtual faces Alberto Borghese Department of Computer Science University of

burlingtonbaptist.org.uk/sovereign Four Faces - a lion, an ox, a man, an eagle Ezekiel 1:6, 10

Making Faces: Conditional generation of faces using GANs via Keras+Tensorflow SOPHIE SEARCY

Limits on Representing Functions by Linear Combinations of Simple Functions 0,1

Identity is a Wicked Problem Identity is a &quot;Wicked&quot; Problem There is no universally

Overview of Discrete-Time Filters First-order filters Ideal filters Practical filters

Political Communication: Gatekeeping POLS 418 MWF 10:00-10:50 Drew Seib April 19, 2011 Drew

Phase-type Distributions for Realistic Modelling in Discrete-Event Simulation Philipp Reinecke

Identity-Based Encryption and Pairings 1 Mihir Bellare, UCSD 2 Mihir Bellare, UCSD Receiver

Modeling Process Rich Hickey Which are more fundamental? Messages, classes, encapsulation,

https://staysafeonline.org/

JUST THE MATHS SLIDES NUMBER 4.1 HYPERBOLIC FUNCTIONS 1 (Definitions, graphs and

Sambuz

Useful Links

Newsletter

Mail Us

The Many Faces of Rationalizability Krzysztof Apt CWI & University of Amsterdam The Many

Identity is a Wicked Problem Identity is a "Wicked" Problem There is no universally