Groe Abweichungen Large deviations M.Gubinelli/N.Barashkov Thema - PowerPoint PPT Presentation

S2F2 - Hauptseminar Stochastische Prozesse und Stochastische Analysis (WS2019/20) Große Abweichungen Large deviations M.Gubinelli/N.Barashkov

Thema Die Theorie der großen Abweichungen behandelt in systematischen Weise die Berechnung von Wahrscheinlichkeiten “exponentiell unwahrscheinlicher” Ereignisse. Diese Theorie ist zu einem der wichtigsten Instrumente der Wahrscheinlichkeitstheorie geworden und erlaubt die Behandlung zahlreicher Anwendungsprobleme. In dem Seminar wollen wir die wichtigsten Grundlagen dieser Theorie erarbeiten und auch einige interessante Anwendungen kennenlernen. Grundlage bildet das Buch "A Weak Convergence Approach to the Theory of Large Devia- tions", von Dupuis, Paul, and Richard S. Ellis. Vorkenntnisse . Mindestens Einführung in die W-Theorie, ein bisschen auch noch Stochastische Prozesse Literatur . Dupuis, Paul, and Richard S. Ellis. 1997. A Weak Convergence Approach to the Theory of Large Deviations . Wiley Series in Probability and Statistics: Probability and Statistics. John Wiley & Sons, Inc., New York. https://doi.org/10.1002/9781118165904.

Introduction

Wolf's dice data Rudolph Wolf (1816-1893, swiss astronomer) ( N i − pN ) 2 ∑ ≈(76.87) 2 N i “die Wurfelseiten nicht als gleichmögliche Fälle sich darstellen” References https://en.wikipedia.org/wiki/Edwin_Thompson_Jaynes http://bayes.wustl.edu/etj/articles/entropy.concentration.pdf

Boltzmann's discovery: Sanov's theorem For sequences ( X n ) n � 1 of iid variables on the finite set 𝒧 ={1,…, N } with common law ρ ∈Π( 𝒧 ) we can define the empirical vector L n with values in the compact metrizable space Π( 𝒧 )={ p ∈[0, 1] N : p 1 +⋯+ p N =1} as n L n ( i )= 1 1 X k = i = #{1 � k � n : X n = i } n ∑ n k =1 and let µ n to be the law on L n (thus µ n ∈Π(Π( 𝒧 ))). N ν ( i )log ν ( i ) H ( ν ∣ µ )= ∑ Relative entropy of ν wrt µ : µ ( i ). i =1 Theorem. The sequence ( µ n ) n satisfy 1 1 ˚) � limsup ¯)=−inf −inf ˚ H ( ν ∣ ρ )=liminf n log µ n ( A n log µ n ( A ¯ H ( ν ∣ ρ ). n ν ∈ A ν ∈ A n That is ( µ n ) n satisfies a large deviation principle on Π( 𝒧 ) with rate function I ( ν )= H ( ν ∣ ρ ) .

This formulation of large deviations have been introduced by Donsker and Varadhan. Laplace's principle Theorem. Laplace principle ( µ n ) n has large deviations on 𝒴 with rate n and rate function I iff 1 n ∫ 𝒴 e − nf ( x ) µ n (d x )=−inf( f ( x )+ I ( x )) n loglim for all bounded (Lipshitz) continuous f : 𝒴 →ℝ . Example. Revisiting dice throwing. Observed entropy of Wolf's data: ˆ n ∣ ρ ) =0.0067696, h = H ( L n =200000 by large deviations: ℙ( H ( L n ∣ ρ ) � h )≈ e − nh ≈6×10 58 !!

Gibbsian conditioning In the previous setting fix some integer k � 1 and consider the law µ n ∈ Π( 𝒧 k ) of ( X 1 , …, X k ) n δ X i , the empirical measure of the vector ( X 1 , …, conditional of an event involving L n = n −1 ∑ i =1 X n ) : µ n ( f )= ∫ 𝒧 k f ( x ) µ n (d x )= E [ f ( X 1 ,…, X k )∣ L n ∈ B ] where A ∈ ℬ ( 𝒧 k ) and B ∈ ℬ (Π( 𝒧 )). We will work with k = 1 generalization to higher k being easy. Lemma. Assume that B is closed and inf B o H ρ =min B H ρ = H ρ ( ν ˆ) for a unique ν ˆ then µ n ( f )→ ν ˆ( f ) . Interesting case : B ={ ν ∈Π( 𝒧 ): ν ( φ )∈[ e , e + δ ]}. Take δ >0 small and e ∈ℝ is such that E [ φ ( X 1 )]< e <sup 𝒧 φ so that ν ( φ )≈ e is atypical for ρ , by the LLN we have L n ( φ )→ E [ φ ( X 1 )] a.s. Let λ ∈ℝ and introduce the “tilted” measures ρ λ = e λf ρ / Z ( λ ) with Z ( λ )= ρ ( e λf ) and observe that H ρ ( ν )= H ρ λ ( ν )+ λν ( f )−log Z ( λ ). H ρ ( ρ λ )= λe +log Z ( λ )= ν : ν ( f )∈[ e , e + δ ] [ H ρ λ ( ν )+ λν ( f )]+log Z ( λ )=min min B H ρ so ν ˆ = ρ λ .

Physical interpretation Consider an assembly of n independent particles each of them characterized by some quantity X i , i =1,…, n taking values in 𝒧 (e.g. energy, momentum, position, etc...) and assume that the allowed configurations of the whole system are those compatible with a given mean value of some function f : 𝒧 →ℝ : ∑ i f ( X i )/ n ∼ − e (e.g. energy per particle, density, etc..). This constraint is macroscopic in the sense that involves only an average over all the particles. − 10 23 ) the configurations of a very Then in the limit of a infinite system ( n →∞, in reality n ∼ small subsystem of size k (in our model k is fixed as n → ∞) are described by iid configurations, each particle distributes as ρ λ , the Gibbs distribution compatible with the macroscopic constraint. This is the mathematical basis of statistical mechanics.

Jupiter's red spot Can be mathematically understood via large deviations (see the bachelor thesis of Adrian Rieckert)

Mogulskii theorem Let ( X n ) n � 1 be an iid sequence of Bernoulli( p ) r.v. and X � n =( X 1 ,…, X n ). Let n F n ( x 1 ,…, x n )( θ )= ∑ x i 1 θ ∈[( i −1)/, i / n ) i =1 so that F n ( X � n ) is a random element in 𝒧 ={ f ∈ L ∞ ([0,1]):‖ f ‖ L 1 � 1} and we denote by µ n its law. On 𝒧 define a distance by taking a countable dense subset { φ k } k � 1 of the unit ball of L 1 and letting d ( f , g ) = ∑ k � 1 2 − k ∣ φ k ( f ) − φ k ( g )∣. Another possible distance is given by d ( f , g ) = sup 0 � t � 1 ∣ ∫ t ( f ( θ )− g ( θ ))d θ ∣. 0 Let p +(1− x )log 1− x J p ( x )= H (Ber( x )∣Ber( p ))= x log x 1− p Theorem. (Mogulskii) The sequence ( µ n ) n obey the LDP on 𝒧 with rate function I ( f )= ∫ 1 J p ( f ( θ ))d θ . 0

Large deviations for random walks Let ( X n ) n � 1 be a sequence of iid Bernoulli( p ) random variables. Consider the process S n = X 1 +⋯+ X n with S 0 =0. Define a continuous random function φ n on [0,1] by n + ( S k +1 − S k ) φ n ( t )= S k ( t − k ) for k � t < k +1. n Let 𝒧 be the subset of C ([0,1]) such that f ∈ 𝒧 if and only if f (0)=0 and ∣ f ( t )− f ( s )∣ � ∣ t − s ∣ for all 0 � s � t � 1. Observe that φ n ( t ) is a piecewise linear function for which φ n ( k / n )= S k / n . Theorem. The sequence ( µ n ) n obey the LDP on 𝒧 with rate function I ( f )= ∫ 1 J p ( f ʹ( s ))d s 0 where f ʹ( s ) is the derivative of f ∈ 𝒧 (which exists almost everywhere since f is Lipshitz).

Seminarplan Wch Thema Name 1 Large deviations in terms of Laplace principle (1.1-1.2) 2 Basic results in the theory (1.3) 3 Properties of relative entropy (1.4) 4 Γ-convergence and Gibbsian-conditioning (notes) 5 Sanov's theorem. Statement and representation formula (2.1-2.3) 6 Lower and upper bounds (2.4-2.5) 7 Mogulskii's theorem. Representation formula (3.1-3.2) 8 Upper bound and rate function (3.3) 9 Statement of the theorem and proof + Cramérs theorem + comments (3.4-3.5-3.6) 10 Random walk model, rep formula + compactness (5.2-5.3) 11 Upper bound and rate function (6.2) 12 Lower bound and statement of the theorem (6.5) 13 Markov chains, rep formula + compactness (8.2) 14 Upper bound and rate function (8.3-8.4) 15 Properties of rate function and Lower bound (8.5-8.6) 16 ???

Groe Abweichungen Large deviations M.Gubinelli/N.Barashkov Thema - PowerPoint PPT Presentation

S2F2 - Hauptseminar Stochastische Prozesse und Stochastische Analysis (WS2019/20) Groe Abweichungen Large deviations M.Gubinelli/N.Barashkov Thema Die Theorie der groen Abweichungen behandelt in systematischen Weise die Berechnung von

Selection, large deviations and metastability Kyoto () Dynamics with selection, large deviations

Selection, large deviations and metastability () Dynamics with selection, large deviations and

Large Deviations for Multi-valued Stochastic Differential Equations Large Deviations for

F o x Ca nyo n Gro undwa te r Ma na g e me nt Ag e nc y s Gro undwa te r Susta ina b ility

U Gro Capital Q4 FY19 / FY19 Earnings Update U GRO Capital | Executive Summary U GRO Capital

Large deviations and heterogeneities in driven or non-driven kinetically constrained models

18.175: Lecture 13 More large deviations Scott Sheffield MIT 1 18.175 Lecture 13 Outline Legendre

Large deviations and heterogeneities in driven or non-driven kinetically constrained models

18.175: Lecture 11 Independent sums and large deviations Scott Sheffield MIT 1 18.175 Lecture 11

Large and very large deviations in disordered systems Giorgio Parisi (work done in collaboration

Symmetric Gro up pe rmutatio ns o f 3 o bje c ts Gro up elements c an be written in this

Symmetric Gro up pe rmutatio ns o f 3 o bje c ts Gro up elements c an be written in this

Large Deviations for a Randomly Indexed Branching Process with Applications in Finance Sheng-Jhih

LARGE DEVIATIONS FOR RANDOM NETWORKS AND APPLICATIONS. LECTURE NOTES SHIRSHENDU GANGULY Abstract.

Top eigenvalue of a random matrix: Large deviations Satya N. Majumdar Laboratoire de Physique

Large Deviations for Statistics of Jacobi Processes N. Demni (Paris VI), M. Zani (Paris XII) 14

ss r s s t sts

The power and weakness of randomness (when you are short on time) Avi Wigderson Institute for

Interim results presentation for the six months ended 31 March 2018 AGENDA Business Financial

FINANCIAL AND STRATEGIC UPDATE June 2017 2 Agenda Overview of the FirstRand group Overview of

Forecasting Ination Using Dynamic Model Averaging Gary Koop and Dimitris Korobilis September

Formalising Semantics for Expected Running Time of Probabilistic Programs (Rough Diamond)

A nonlinear cointegration model for US bond Introduction Aim of the thesis Literature Emanuele

Uncertainty Traps Pablo Fajgelbaum 1 Edouard Schaal 2 Mathieu Taschereau-Dumouchel 3 1 UCLA 2 New