INFORMATION THEORY: SOURCES, DIRICHLET SERIES, REALISTIC ANALYSIS - PowerPoint PPT Presentation

INFORMATION THEORY: SOURCES, DIRICHLET SERIES, REALISTIC ANALYSIS OF DATA STRUCTURES Mathieu Roux and Brigitte Vall´ ee GREYC Laboratory (CNRS and University of Caen, France) Talk also based on joint works with Viviane Baladi , Eda Cesaratto , Julien Cl´ ement , Jim Fill , Philippe Flajolet WORDS 2011, Prague, September 2011

Description of a framework which – unifies the analyses for text algorithms and searching/sorting algorithms

Description of a framework which – unifies the analyses for text algorithms and searching/sorting algorithms – provides a general model for sources – shows the importance of the Dirichlet generating functions – explains the importance of tameness for sources

Description of a framework which – unifies the analyses for text algorithms and searching/sorting algorithms – provides a general model for sources – shows the importance of the Dirichlet generating functions – explains the importance of tameness for sources – defines a natural subclass of sources, the dynamical sources – provides sufficient conditions for tameness of dynamical sources

Description of a framework which – unifies the analyses for text algorithms and searching/sorting algorithms – provides a general model for sources – shows the importance of the Dirichlet generating functions – explains the importance of tameness for sources – defines a natural subclass of sources, the dynamical sources – provides sufficient conditions for tameness of dynamical sources – provides probabilistic analyses for data structures built on tame sources.

Plan of the talk. – General motivations: Dirichlet generating functions and tameness – An important class of sources: dynamical sources. – Tameness in the case of dynamical sources – Conclusion and possible extensions.

Plan of the talk. – General motivations: Dirichlet generating functions and tameness. – An important class of sources: dynamical sources. – Tameness in the case of dynamical sources – Conclusion and possible extensions.

The classical framework for analysis of algorithms in two main algorithmic domains: Text algorithms – Sorting or Searching algorithms.

The classical framework for analysis of algorithms in two main algorithmic domains: Text algorithms – Sorting or Searching algorithms. – In text algorithms, algorithms deal with words – In sorting or searching algorithms, algorithms deal with keys. A word or a key are both a sequence of symbols ... but

The classical framework for analysis of algorithms in two main algorithmic domains: Text algorithms – Sorting or Searching algorithms. – In text algorithms, algorithms deal with words – In sorting or searching algorithms, algorithms deal with keys. A word or a key are both a sequence of symbols ... but – for comparing two words, importance of the structure of words – for comparing two keys, transparence of the structure of keys only their relative order plays a role.

Text algorithms and dictionaries : The trie structure Probabilistic study a b c b c b a a c abc cba bbc cab c a a b c a b b c b b b

Text algorithms and dictionaries : The trie structure Probabilistic study Main parameter on a node n w labelled with prefix w : N w := the number of words which begin with prefix w . N w := the number of words which go through the node n w a b c b c b a a c abc cba bbc cab c a a b c a b b c b b b

Text algorithms and dictionaries : The trie structure Probabilistic study Main parameter on a node n w labelled with prefix w : N w := the number of words which begin with prefix w . N w := the number of words which go through the node n w a b c The size, and the path length of a trie equal b c b a a c � � abc cba bbc cab R = 1 [ N w ≥ 2] T = 1 [ N w ≥ 2] · N w , w ∈ Σ ⋆ w ∈ Σ ⋆ c a Central role of p w := the probability that a word begins with prefix w . a b c a b b c b b b

A realistic framework for sorting or searching. Keys are viewed as words and are compared [wrt the lexicographic order]. The realistic unit cost is now the symbol–comparison.

A realistic framework for sorting or searching. Keys are viewed as words and are compared [wrt the lexicographic order]. The realistic unit cost is now the symbol–comparison. The realistic cost of the comparison between two words A and B , A = a 1 a 2 a 3 . . . a i . . . and B = b 1 b 2 b 3 . . . b i . . . equals k + 1 , where k is the length of their largest common prefix k := max { i ; ∀ j ≤ i, a j = b j } = the coincidence c ( A, B )

A realistic framework for sorting or searching. Keys are viewed as words and are compared [wrt the lexicographic order]. The realistic unit cost is now the symbol–comparison. The realistic cost of the comparison between two words A and B , A = a 1 a 2 a 3 . . . a i . . . and B = b 1 b 2 b 3 . . . b i . . . equals k + 1 , where k is the length of their largest common prefix k := max { i ; ∀ j ≤ i, a j = b j } = the coincidence c ( A, B ) The probabilistic study of the coincidence deals with p w := the probability that a word begins with prefix w . Pr[ c ( A, B ) ≥ k ] = Pr[ A and B begin with the same w of length k ]

A realistic framework for sorting or searching. Keys are viewed as words and are compared [wrt the lexicographic order]. The realistic unit cost is now the symbol–comparison. The realistic cost of the comparison between two words A and B , A = a 1 a 2 a 3 . . . a i . . . and B = b 1 b 2 b 3 . . . b i . . . equals k + 1 , where k is the length of their largest common prefix k := max { i ; ∀ j ≤ i, a j = b j } = the coincidence c ( A, B ) The probabilistic study of the coincidence deals with p w := the probability that a word begins with prefix w . � p 2 Pr[ c ( A, B ) ≥ k ] = Pr[ A and B begin with the same w of length k ] = w | w | = k

The example of the binary search tree (BST)

The example of the binary search tree (BST) Number of symbol comparisons needed for inserting F = abbbbbbb.

The example of the binary search tree (BST) Number of symbol comparisons needed for inserting F = abbbbbbb. = 7 for comparing to A c ( F, A ) = 6

The example of the binary search tree (BST) Number of symbol comparisons needed for inserting F = abbbbbbb. = 7 for comparing to A c ( F, A ) = 6 + 8 for comparing to B c ( F, B ) = 7

The example of the binary search tree (BST) Number of symbol comparisons needed for inserting F = abbbbbbb. = 7 for comparing to A c ( F, A ) = 6 + 8 for comparing to B c ( F, B ) = 7 + 1 for comparing to C c ( F, C ) = 0

The example of the binary search tree (BST) Number of symbol comparisons needed for inserting F = abbbbbbb. = 7 for comparing to A c ( F, A ) = 6 + 8 for comparing to B c ( F, B ) = 7 + 1 for comparing to C c ( F, C ) = 0 Total = 16 To be compared to the number of key comparisons [ = 3 ]

The example of the binary search tree (BST) Number of symbol comparisons needed for inserting F = abbbbbbb. = 7 for comparing to A c ( F, A ) = 6 + 8 for comparing to B c ( F, B ) = 7 + 1 for comparing to C c ( F, C ) = 0 Total = 16 To be compared to the number of key comparisons [ = 3 ] This defines the symbol-path-length of a BST based on the coincidence We perform a probabilistic study of this symbol path-length

Now, we work inside an unifying framework where searching and sorting algorithms are viewed as text algorithms.

Now, we work inside an unifying framework where searching and sorting algorithms are viewed as text algorithms. In this context, the probabilistic behaviour of algorithms heavily depends on the mechanism which produces words.

Now, we work inside an unifying framework where searching and sorting algorithms are viewed as text algorithms. In this context, the probabilistic behaviour of algorithms heavily depends on the mechanism which produces words. A source:= a mechanism which produces symbols from alphabet Σ , one for each time unit. When (discrete) time evolves, a source produces (infinite) words of Σ N .

Now, we work inside an unifying framework where searching and sorting algorithms are viewed as text algorithms. In this context, the probabilistic behaviour of algorithms heavily depends on the mechanism which produces words. A source:= a mechanism which produces symbols from alphabet Σ , one for each time unit. When (discrete) time evolves, a source produces (infinite) words of Σ N . For w ∈ Σ ⋆ , p w := probability that a word begins with the prefix w . w ∈ Σ ⋆ } defines the source S . The set { p w ,

Fundamental role of the Dirichlet generating functions of the source   � � � p s p s Λ( s ) := w , Λ k ( s ) = w ,  Λ = Λ k  w ∈ Σ ⋆ w ∈ Σ k k ≥ 0 Remark: Λ k (1) = 1 for any k , Λ(1) = ∞ .

INFORMATION THEORY: SOURCES, DIRICHLET SERIES, REALISTIC ANALYSIS - PowerPoint PPT Presentation

INFORMATION THEORY: SOURCES, DIRICHLET SERIES, REALISTIC ANALYSIS OF DATA STRUCTURES Mathieu Roux and Brigitte Vall ee GREYC Laboratory (CNRS and University of Caen, France) Talk also based on joint works with Viviane Baladi , Eda Cesaratto ,

The Dirichlet-Bohr radius Manuel Maestre April 13, 2014 Kent State University Content

Lead Screw Motors LSM08 Series LSM11 Series LSM14 Series LSM17 Series

Perspective Hierarchical Dirichlet Process for Perspective Hierarchical Dirichlet Process for

Hierarchical Dirichlet Processes Presenters: Micah Hodosh, Yizhou Sun 4/7/2010 1 Content

Boundary Representation of Dirichlet Forms on Canonically Compactifiable Graphs Michael Schwarz

A Survey of Recent Results on the Hardy Space of Dirichlet Series Gregory Zitelli University of

Orbit Dirichlet series and multiset permutations Angela Carnevale Universitt Bielefeld (joint

A survey on nonlinear Dirichlet pb: classical results and Calderon-Zygmund theory for infinite

standard series Overview DP series DX series H series M series bitte hier

On a secant Dirichlet series and Eichler integrals of Eisenstein series Oberseminar Zahlentheorie

On a secant Dirichlet series and Eichler integrals of Eisenstein series 28th Automorphic Forms

On a secant Dirichlet series and Eichler integrals of Eisenstein series Modular Forms and Modular

Multiple Dirichlet Series Gautam Chinta 2013-02-18 Mon Previous talks of Sol Friedberg 1.

Sources Sources: Kinds of Sources Citizen witness Confidential informants Anonymous

Sources of Start Sources of Start- -up Capital up Capital up Capital Sources of Start Sources

RC circuits with DC sources A Circuit i (resistors, voltage sources, v C current sources,

ABC...L: The uniform abc -conjecture and zeros of Dirichlet L -functions Christian T afula

Optimal Dirichlet regions for elliptic PDEs Giuseppe Buttazzo Dipartimento di Matematica

INFOGR Computer Graphics Jacco Bikker & Debabrata Panja - April-July 2017 Lecture 8:

WebGL Agenda Rendering pipeline Boilerplate for minimal application Obtaining

Approximations of the Neumann Laplacian in nonuniformly collapsing strips C esar R. de

Transcendence of special values of L-series M. Ram Murty, FRSC Queens Research Chair,

A variant of the large sieve inequality with explicit constants Maciej Grzekowiak Adam

Lecture 4.3: Self-adjoint linear operators Matthew Macauley Department of Mathematical Sciences

INFORMATION THEORY: SOURCES, DIRICHLET SERIES, REALISTIC ANALYSIS - PowerPoint PPT Presentation

INFORMATION THEORY: SOURCES, DIRICHLET SERIES, REALISTIC ANALYSIS OF DATA STRUCTURES Mathieu Roux and Brigitte Vall ee GREYC Laboratory (CNRS and University of Caen, France) Talk also based on joint works with Viviane Baladi , Eda Cesaratto ,

The Dirichlet-Bohr radius Manuel Maestre April 13, 2014 Kent State University Content

Lead Screw Motors LSM08 Series LSM11 Series LSM14 Series LSM17 Series

Perspective Hierarchical Dirichlet Process for Perspective Hierarchical Dirichlet Process for

Hierarchical Dirichlet Processes Presenters: Micah Hodosh, Yizhou Sun 4/7/2010 1 Content

Boundary Representation of Dirichlet Forms on Canonically Compactifiable Graphs Michael Schwarz

A Survey of Recent Results on the Hardy Space of Dirichlet Series Gregory Zitelli University of

Orbit Dirichlet series and multiset permutations Angela Carnevale Universitt Bielefeld (joint

A survey on nonlinear Dirichlet pb: classical results and Calderon-Zygmund theory for infinite

standard series Overview DP series DX series H series M series bitte hier

On a secant Dirichlet series and Eichler integrals of Eisenstein series Oberseminar Zahlentheorie

On a secant Dirichlet series and Eichler integrals of Eisenstein series 28th Automorphic Forms

On a secant Dirichlet series and Eichler integrals of Eisenstein series Modular Forms and Modular

Multiple Dirichlet Series Gautam Chinta 2013-02-18 Mon Previous talks of Sol Friedberg 1.

Sources Sources: Kinds of Sources Citizen witness Confidential informants Anonymous

Sources of Start Sources of Start- -up Capital up Capital up Capital Sources of Start Sources

RC circuits with DC sources A Circuit i (resistors, voltage sources, v C current sources,

ABC...L: The uniform abc -conjecture and zeros of Dirichlet L -functions Christian T afula

Optimal Dirichlet regions for elliptic PDEs Giuseppe Buttazzo Dipartimento di Matematica

INFOGR Computer Graphics Jacco Bikker &amp; Debabrata Panja - April-July 2017 Lecture 8:

WebGL Agenda Rendering pipeline Boilerplate for minimal application Obtaining

Approximations of the Neumann Laplacian in nonuniformly collapsing strips C esar R. de

Transcendence of special values of L-series M. Ram Murty, FRSC Queens Research Chair,

A variant of the large sieve inequality with explicit constants Maciej Grzekowiak Adam

Lecture 4.3: Self-adjoint linear operators Matthew Macauley Department of Mathematical Sciences

INFOGR Computer Graphics Jacco Bikker & Debabrata Panja - April-July 2017 Lecture 8: