How to estimate a density on a spider web ? Dominique Picard How to - PowerPoint PPT Presentation

How to estimate a density on a spider web ? How to estimate a density on a spider web ? Dominique Picard How to estimate a density on a spider web ? 2020 1 / 37

Spider web from exhibition On Air Tomas Saraceno

spider web

Spider

Density estimation problem • one observes X 1 , . . . , X n that are i.i.d. random variables defined on a space M with the common density f . • M is equipped with a distance ρ and a measure µ . • our aim is to identify the density

M = spider web We identify M with a graph (simple, undirected, no loops) with T vertices and edges. Adjacency matrix A ij = 1 if there is an edge between i and j , 0, oth- erwise. M is naturally equipped with a distance (geodesic-distance) L − 1 � ρ ( x, y ) = inf A γ ( i ) γ ( i +1) . pathsγ ⊂ M , γ (0)= x, γ ( L )= y i =0

M graph Laplacian we calculate a Laplacian matrix L T × T as: L = D − A, where D is the (diagonal) degree matrix (popularity of each vertex) and A is the adjacency matrix of the graph. � D ii = A i,j j � = i is the degree of the vertex i .

Spectral decomposition of L L has λ 1 ≤ . . . ≤ λ T for eigenvalues and V 1 , . . . , V T for eigenvectors (normed) : V j = ( V j 1 , . . . , V j T ) . Our estimator is : for x point of the graph : n T T n � � � � � � K δ ( x ) := 1 1 λ j ) V j V j ˆ X i V j λ j ) V j Φ( δ x = Φ( δ x X i n n i =1 j =1 j =1 i =1 δ is the bandwidth, Φ is a ’Littlewood-Paley function’ : Φ be a C ∞ ( R ) real-valued function with the following properties: supp (Φ) ⊂ [0 , 1] and Φ( λ ) = 1 for λ ∈ [0 , 1 / 2] .

Theoretical part : Kernel and wavelet density estimators on manifolds and more general metric spaces Theoretical part : Kernel and wavelet density estimators on manifolds and more general metric spaces Issued from a joint work : G. Cleanthous, A. Georgiadis, G. Kerkyacharian, P. Petrushev, D. P. Theoretical part : Kernel and wavelet density estimators on manifolds and more general me 2020 10 / 37

Setting and motivation We assume that ( M , ρ, µ ) is a metric measure space equipped with a distance ρ and a measure µ . X 1 , . . . , X n are i.i.d. random variables on M with a common density function (pdf) f with respect to the measure µ . Our purpose is to estimate the density f . To an estimator ˆ f n of f , we associate its risk: �� 1 p R n ( ˆ | ˆ f n ( x ) − f ( x ) | p µ ( dx ) f, f, p ) = E f 1 ≤ p < ∞ M as well as its L ∞ risk: R n ( ˆ f, f, ∞ ) = E f � ˆ f n − f � ∞ . We will operate in the following setting. Most of the material can be found in an extended form in the papers [4, 8] Coulhon, Kerkyacharian, Petrushev.

Doubling condition : setting the dimension C1. We assume that the metric space ( M , ρ, µ ) satisfies the so called doubling volume condition : µ ( B ( x, 2 r )) ≤ c 0 µ ( B ( x, r )) for any x ∈ M and r > 0 , (1) where B ( x, r ) := { y ∈ M : ρ ( x, y ) < r } and c 0 > 1 is a constant. The above implies that there exist constants c ′ 0 ≥ 1 and d > 0 such that µ ( B ( x, λr )) ≤ c ′ 0 λ d µ ( B ( x, r )) for any x ∈ M , r > 0 , and λ > 1 , (2) The least d such that (2) holds is the so called homogeneous dimension of ( M , ρ, µ )

We can additionally assume that ( M , ρ, µ ) is a compact measure space with µ ( M ) < ∞ satisfying the following condition: C1A. Ahlfors regular volume condition: There exist constants c 1 , c 2 > 0 and d > 0 such that c 1 r d ≤ µ ( B ( x, r )) ≤ c 2 r d for any x ∈ M and 0 < r ≤ diam ( M ) . The doubling condition is precisely related to the metric entropy : For ǫ > 0 , we define, as usual, the covering number N ( ǫ, M ) as the smallest number of balls of radius ǫ covering M . Lemma Under the condition C1A and if M is compact, there exist constants c ′ > 0 , c ′′ > 0 and ǫ 0 > 0 such that � 1 � d � 1 � d ≤ N ( ǫ, M ) ≤ 2 d 1 , c ′ c ′′ ǫ ǫ for all 0 < ǫ ≤ ǫ 0 .

spider web

Smooth operator : Setting regularity, construct a kernel One rather standard method in density estimation is the kernel estimation method, i.e. considering a family of functions indexed by δ > 0 : K δ : M × M → R an associated kernel density estimator is defined by n � K δ ( x ) := 1 � K δ ( X i , x ) , x ∈ M . (3) n i =1 In R d , an important family is the family of translation kernels δ ) , where G is a function R d → R . When M is a δ ] d G ( x − y K δ ( x, y ) = [ 1 more involved set such as a spider web, a manifold or a set of graphs, of matrices, the simple operations of translation and dilation may not be meaningful . Hence, even finding a family of kernels to start with might be a difficulty.

When dealing with a kernel estimation method, it is standard to consider two quantities : b δ ( f ) := � E f � � ξ f � p := � � K δ − E f � K δ − f � p , K δ � p . The analysis of the first term b δ ( f ) is linked to the approximation properties of the family E f � K δ . which are also often linked to regularity properties of the function f . It is standardly proved (see e.g. [7]), that in R d if K is a translation family with mild properties on K , then polynomial rates of approximation are obtained for functions with Besov regularity .

Regularity spaces Hence, an important issue becomes finding spaces of regularity associated to a possibly complex set M . On a compact metric space ( M, ρ ) one can always define the scale of s -Lipschitz spaces defined by the following norm | f ( x ) − f ( y ) | � f � Lip s := � f � ∞ + sup , 0 < s ≤ 1 . (4) ρ ( x, y ) s x � = y In Euclidian spaces a function can be much more regular than Lipschitz , for instance differentiable at different orders, or belong to some Sobolev or Besov spaces. When M is a set where there is no obvious notion of differentiability , one can make the observation that in R d or Riemannian manifolds, regularity properties can also be expressed via the associated Laplacian . The Laplacian itself is an operator of order 2, but its square root is of order 1 and can be interpreted as a substitute for derivation.

Smooth operator Our main assumption is that the space ( M , ρ, µ ) is complemented by an essentially self-adjoint non-negative operator L on L 2 ( M , µ ) , mapping real-valued to real-valued functions, such that the associated semigroup P t = e − tL consists of integral operators with the (heat) kernel p t ( x, y ) obeying the conditions mentioned in the sequel. � e − tλ k P k ( x, y ) , p t ( x, y ) := k with P k ( x, y ) = � i v λ k i ( x ) v λ k i ( y ) , where v λ k i ( x ) , i = 1 , . . . , dim( E λ k ) is an orthonormal basis of E λ k , when the spectrum is discrete.

C3. Gaussian localization: There exist constants c 4 , c 5 > 0 such that � � − c 5 ρ 2 ( x,y ) c 4 exp t | p t ( x, y ) | ≤ for x, y ∈ M , t > 0 . (5) � √ √ � 1 / 2 | B ( x, t ) || B ( y, t ) | C4. H¨ older continuity: There exists a constant α > 0 such that � � − c 5 ρ 2 ( x,y ) � ρ ( y, y ′ ) � α exp � � t � p t ( x, y ) − p t ( x, y ′ ) � ≤ c 4 √ (6) � √ √ � 1 / 2 t | B ( x, t ) || B ( y, t ) | √ for x, y, y ′ ∈ M and t > 0 , whenever ρ ( y, y ′ ) ≤ t . C5. Markov property: � p t ( x, y ) dµ ( y ) = 1 for x ∈ M and t > 0 . (7) M

Associated regularity spaces Regularity classes defined through approximation properties: � Σ u = H λ k √ λ k ≤ u B s p = { f ∈ L p , φ ∈ Σ u � f − φ � p ≤ Cu − s } inf (usually denoted B s p, ∞ ). Moreover, one can develop a theory of ”wavelet” which completely characterizes the previous regularity classes.

Typical examples M = R d Here dµ is the Lebesgue measure and ρ is the Euclidean distance on R d . In this case we consider the operator d � ∂ 2 − L ( f )( x ) = ∆ f ( x ) = i f ( x ) = div( ∇ f )( x ) j =1 defined on the space D ( R d ) of C ∞ functions with compact support. As is well known the operator L is positive essentially self-adjoint. The associate semigroup e t ∆ is given by the operator with the Gaussian kernel: � � − | x − y | 2 p t ( x, y ) = (4 πt ) − d 2 exp . 4 t

Periodic case on M = [ − 1 , 1] Here dµ is the Lebesgue measure and ρ is the Euclidean distance on the circle. The operator is L ( f ) = − f ′′ defined on the set on infinitely differentiable periodic functions. It has eigenvalues k 2 π 2 for k ∈ N 0 and eigenspaces � 1 � ker( L − k 2 π 2 ) = H k = span { cos kπx, sin kπx } ker( L ) = H 0 = span √ , 2

Unit sphere M = S d − 1 in R d , d ≥ 3 This is the most famous Riemannian manifold with the induced structure from R d . Here dµ is the Lebesgue measure on S d − 1 , ρ is the geodesic distance on S d − 1 : ρ ( ξ, η ) = arccos( � ξ, η � R d ) , and L := − ∆ 0 with ∆ 0 being the Laplace-Beltrami operator on S d − 1 . The spectral decomposition of the operator L can be described as follows: � L 2 ( S d − 1 , µ ) = E λ k , E λ k = ker ( L − λ k I d ) , λ k = k ( k + d − 2) . Here E λ k is the restriction to S d − 1 of harmonic homogeneous polynomials of degree k (spherical harmonics). We have � d − 1 � � d − 1 � dim( E λ k ) = − . d + k − 1 d + k − 3

How to estimate a density on a spider web ? Dominique Picard How to - PowerPoint PPT Presentation

How to estimate a density on a spider web ? How to estimate a density on a spider web ? Dominique Picard How to estimate a density on a spider web ? 2020 1 / 37 Spider web from exhibition On Air Tomas Saraceno spider web spider web Spider

SPIDER VEINS What are Spider veins? are small dilated blood vessels near the surface of the

By Laura and Holly Spider Diagram of Spider Diagram of Classical Books Classical Books

A Class y Spider W E B SC R AP IN G IN P YTH ON Thomas Laetsch Data Scientist , NYU Yo u r

Relative Density Chapters 3.5 Relative Density 1 2/5/2015 Minimum Density Pluviate soil from

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

Polyethylene Monomer: Ethylene High Density Polyethylene (HDPE) Low Density Polyethylene

Bulk Density and Void Content Bulk Density Bulk density ( n .) the mass of a unit volume of bulk

Biomimetics and biomaterials Presentation on Spider Silk, Spring 2015 Facts about spiders There

Orange Empire Signal Garden Lessons Learned OR Remove spider * before servicing main board. *

Advanced Algorithms (X) Shanghai Jiao Tong University Chihao Zhang May 11, 2020 Estimate

The Dark Matter density MW Components Global density Data: inner Data: outer Data: masers

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Lecture 1: Semantic Web and RDF Aidan Hogan aidhog@gmail.com THE WEB The Web is now 26 years

Mortgage Loan Transaction Budget Estimate (CF as Primary Counsel) MATTER NAME Time Estimate Code

Downtown Housing Market Downtown Population Estimate 21,197 Downtown Population Estimate - 2

Estimate/Bid Workshop M.B. Kahn Construction Co., Inc. 101 Flintlake Road, Columbia, SC 29202

SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path Integrated Differential Estimator

INTRODUCTION TO GENETIC EPIDEMIOLOGY (GBIO0015-1) Prof. Dr. Dr. K. Van Steen Introduction to

WD TDE and

Process Analysis Techniques to investigate ozone production in regulatory simulations of Houston,

S P I D E R J o a c h I m F r a n k SPIDER is a command-driven application written

Picturing Quantum Processes Aleks Kissinger and Bob Coecke Radboud University and Oxford University

MOL2NET, 2018 , 4, http://sciforum.net/conference/mol2net-04 2 Materials and Methods The scene

Interdomain Routing Decisions Mingchen Zhao * Wenchao Zhou * Alexander Gurney * Andreas Haeberlen *