Tight links between normality and automata Olivier Carton IRIF - PowerPoint PPT Presentation

Tight links between normality and automata Olivier Carton IRIF Universit´ e Paris Diderot & CNRS Based on join works with V. Becher, P. Heiber and E. Orduna (Universidad de Buenos Aires & CONICET) Chennai – CAALM

Outline Normality Selection Compressibility Weighted automata and frequencies

Normal words A normal word is an infinite word such that all finite words of the same length occur in it with the same frequency. If x ∈ A ω and w ∈ A ∗ , the frequency of w in x is defined by | x [1 ..N ] | w freq( x, w ) = lim . N N →∞ where | z | w denotes the number of occurrences of w in z . A word x ∈ A ω is normal if for each w ∈ A ∗ : 1 freq( x, w ) = | A | | w | ◮ | A | is the cardinality of the alphabet A where ◮ | w | is the length of w .

Normal words (continued) Theorem (Borel, 1909) The decimal expansion of almost every real number in [0 , 1) is a normal word in the alphabet { 0 , 1 , ..., 9 } . Nevertheless, not so many examples have been proved normal. Some of them are: ◮ Champernowne 1933 (natural numbers): 12345678910111213141516171819202122232425 · · · ◮ Besicovitch 1935 (squares): 149162536496481100121144169196225256289324 · · · ◮ Copeland and Erd˝ os 1946 (primes): 235711131719232931374143475359616771737983 · · ·

Normality as randomness Normality is the poor mans’s randomness. This is the least requirement one can expect from a random sequence. This is much weaker than Martin-L¨ of randomness which implies non-computability.

Selection rules ◮ If x = a 1 a 2 a 3 · · · is a normal infinite word, then so is x ′ = a 2 a 3 a 4 · · · made of symbols at all positions but the first one. ◮ If x = a 1 a 2 a 3 · · · is normal infinite word, then so is x ′ = a 2 a 4 a 6 · · · made of symbols at even positions. ◮ What about selecting symbols at positions 2 n ? ◮ What about selecting symbols at prime positions ? ◮ What about selecting symbols following a 1 ? ◮ What about selecting symbols followed by a 1 ?

Oblivious prefix selection Let L ⊆ A ∗ be a set of finite words and x = a 1 a 2 a 3 · · · ∈ A ω . The prefix selection of x by L is the word x ↾ L = a i 1 a i 2 a i 3 · · · where { i 1 < i 2 < i 3 < · · ·} = { i : a 1 a 2 · · · a i − 1 ∈ L } . Example (Symbols following a 1) If L = (0 + 1) ∗ 1, then i 1 − 1 , i 2 − 1 , i 3 − 1 are the positions of 1 in x and x ↾ L is made of the symbols following a 1. Theorem (Agafonov 1968) Prefix selection by a rational set of finite words preserves normality. The selection can be realized by a transducer. Example (Selection of symbols following a 1) 1 | ε q 0 q 1 0 | ε 1 | 1 0 | 0

Oblivious suffix selection Let X ⊆ A ω be a set of infinite words and x = a 1 a 2 a 3 · · · ∈ A ω . The suffix selection of x by X is the word x ↿ X = a i 1 a i 2 a i 3 · · · where { i 1 < i 2 < i 3 < · · ·} = { i : a i +1 a i +2 a i +3 · · · ∈ X } . Example (Symbols followed by a 1) If L = 1(0 + 1) ω , then i 1 + 1 , i 2 + 1 , i 3 + 1 are the positions of 1 in x and x ↿ X is made of the symbols followed by a 1. Theorem Suffix selection by a rational set of infinite words preserves normality. Combining prefix and suffix does not preserve normality in general. Selecting symbols having a 1 just before and just after them does not preserve normality.

Transducers Input tape a 0 a 1 a 2 a 3 a 4 a 5 a 6 a 7 Q Output tape b 0 b 1 b 2 b 3 b 4 b 5 b 6 Transitions p a | v → q for a ∈ A , v ∈ B ∗ . − −

Example 1 | 1 q 0 q 1 0 | 0 1 | ε 0 | 0 0 1 1 0 0 1 1 1 0 q 0 0

Example 1 | 1 q 0 q 1 0 | 0 1 | ε 0 | 0 0 1 1 0 0 1 1 1 0 q 0 0 1

Example 1 | 1 q 0 q 1 0 | 0 1 | ε 0 | 0 0 1 1 0 0 1 1 1 0 q 1 0 1

Example 1 | 1 q 0 q 1 0 | 0 1 | ε 0 | 0 0 1 1 0 0 1 1 1 0 q 1 0 1 0

Example 1 | 1 q 0 q 1 0 | 0 1 | ε 0 | 0 0 1 1 0 0 1 1 1 0 q 0 0 1 0 0

Example 1 | 1 q 0 q 1 0 | 0 1 | ε 0 | 0 0 1 1 0 0 1 1 1 0 q 0 0 1 0 0 1

Example 1 | 1 q 0 q 1 0 | 0 1 | ε 0 | 0 0 1 1 0 0 1 1 1 0 q 1 0 1 0 0 1

Example 1 | 1 q 0 q 1 0 | 0 1 | ε 0 | 0 0 1 1 0 0 1 1 1 0 q 1 0 1 0 0 1 0

Characterization of normal words An infinite word x = a 1 a 2 a 3 · · · is compressible by a transducer a 1 | v 1 a 2 | v 2 a 3 | v 3 if there is an accepting run q 0 − − − → q 1 − − − → q 2 − − − → q 3 · · · satisfying | v 1 v 2 · · · v n | log | B | lim inf | a 1 a 2 · · · a n | log | A | < 1 . n →∞ Theorem (Schnorr, Stimm and others) An infinite word is normal if and only if it cannot be compressed by deterministic one-to-one transducers. Similar to the characterization of Martin-L¨ of randomness by non-compressibility by prefix Turing machines. lim inf n →∞ H ( x [1 ..n ]) − n > −∞ where H is the prefix Kolmogorov complexity.

Ingredients Shannon (1958) ◮ frequency of u different from b −| u | implies non maximum entropy ◮ non-maximum entropy implies compressibility Huffman (1952) ◮ simple greedy implementation of Shannon’s general idea ◮ implementation by a finite state tranducer

Robust characterization Transducers can be replaced by ◮ Non-deterministic but functional one-to-one transducers ◮ Transducers with one counter ◮ Two-way transducers det non-det non-rt finite-state N N N 1 counter N N N ≥ 2 counters N N T 1 stack ? C C 1 stack + 1 counter C C T where N means cannot compress normal words C means can compress some normal word T means is Turing complete and thus can compress.

Non-compressibility implies selection Compression 0 1 0 0 1 0 0 1 0 1 0 0 1 1 0 1 0 1 1 0 1 0 1 1 0 1 Selection Merge 0 1 1 0 1 0 0 1 1 0 0 1 0 1 1 1 0 0 1 0 1 1 1 0 1 1 1 1 0 1 0 1 1 0 1 1 0 0 1

Preservation of normality A functional transducer T is said to preserve normality if for every normal word x ∈ A ω , T ( x ) is also normal. Question Given a deterministic complete transducer T , does T preserve normality?

Weighted Automata A weighted automaton T is an automaton whose transitions, not only consume a symbol from an input alphabet A , but also have a transition weight in R and whose states have initial weight and final weight in R . 1:1 0:2 0:1 1:2 1 1:1 1 q 0 q 1 This weighted automaton computes the value of a binary number.

b 1 → · · · b n b 2 The weight of a run q 0 − → q 1 − − → q n in A is the product of the weights of its n transitions times the initial weight of q 0 and the final weight of q n . 1:1 0:2 0:1 1:2 1 1:1 1 q 0 q 1 1 1 0 weight A ( q 0 − → q 0 − → q 1 − → q 2 ) = 1 ∗ 1 ∗ 1 ∗ 2 ∗ 1 = 2

b 1 b 2 → · · · b n The weight of a run q 0 − → q 1 − − → q n in A is the product of the weights of its n transitions times the initial weight of q 0 and the final weight of q n . 1:1 0:2 0:1 1:2 1 1:1 1 q 0 q 1 The weight of a word w in A is given by the sum of weights of all runs labeled with w : � weight A ( w ) = weight A ( γ ) γ run on w 1 1 0 weight A (110) = weight A ( q 0 − → q 0 − → q 1 → q 1 ) + − 1 1 0 weight A ( q 0 − → q 1 − → q 1 − → q 1 ) = 2 + 4 = 6

Theorem For every strongly connected deterministic transducer T there exists a weighted automaton A such that for any finite word w and any normal word x , weight A ( w ) is exactly the frequency of w in T ( x ) . Example 1 5 1 / 6 b : 1 / 4 b :1 a :1 b | ba 1 1 a | a a | λ 1 3 a : 1 / 2 1 3 2 / 3 b | λ b : 1 / 2 a | λ b : 1 / 4 b :1 b | bb 1 / 6 1 2 4 2 1 b : 1 / 2 Transducer T Weighted Automaton A

Deciding preservation of normality Proposition Such a weighted automaton can be computed in cubic time with respect to the size of the transducer. Theorem It can decided in cubic time whether a given deterministic transducer does preserve normality (that is sends each normal word to a normal word)

Recap of the links between automata and normality ◮ Selecting with an automaton in an normal word preserves normality. ◮ Normality is characterized by non-compressibility by finite state machines. ◮ Frequencies in the output of a deterministic transducer are given by a weighted automaton. Thank you

Tight links between normality and automata Olivier Carton IRIF - PowerPoint PPT Presentation

Tight links between normality and automata Olivier Carton IRIF Universit e Paris Diderot & CNRS Based on join works with V. Becher, P. Heiber and E. Orduna (Universidad de Buenos Aires & CONICET) Chennai CAALM Outline

Links Student Web Presence Guidelines Summary 1. The Purpose of Links 2. Worst Links 3. Best

Departures from Normality Departures from Normality Many statistical test depend on our

Chapter 2 Tight-frames An Introduction 1 Outline 1. Tight-frame 1. Tight-frame 2. Matrix

CSC 473 Automata, Grammars & Languages 9/29/10 Automata, Grammars and Languages Discourse 03

LINKS AND RULES GENOME VISUALIZATION WITH CIRCOS LINKS AND RULES 1 Martin Krzywinski

How of the Conceptual Future Internet Links lead to links that link to other links. Many

Non-Normality / Non-Gaussianity and Filtering Cris%an Proistosescu, Andy Rhines,

Regression Diagnostics Procedures ASSUMPTIONS UNDERLYING REGRESSION/CORRELATION NORMALITY OF

Normality tests P RACTICIN G S TATIS TICS IN TERVIEW QUES TION S IN R Zuzanna Chmielewska

Normality and Automata Olivier Carton LIAFA Universit e Paris Diderot & CNRS Join work

Normality and preservation of measure in cellular automata Silvio Capobianco 1 1 Institute of

Automata and program analysis Thomas Colcombet FCT Bordeaux 13 September 2017 based on

Graph Automata Jan Leike July 2nd, 2012 Motivation We want an automata model that Motivation

Applied Automata Theory Roland Meyer TU Kaiserslautern Roland Meyer (TU KL) Applied Automata

Applied Automata Theory Roland Meyer TU Kaiserslautern Roland Meyer (TU KL) Applied Automata

Multiple tree automata a new model of tree automata Gwendal Collet (TU Wien), Julien David (LIPN)

LATE and the Generalized Roy Model: Some Relationships James J. Heckman University of Chicago

A Power Comparison for Testing Normality Shigekazu Nakagawa, Hiroki Hashiguchi, and Naoto Niki

Convergence results for the Bayesian inversion theory Hanna Katriina Pikkarainen Johann Radon

Normality for approach spaces and contractive realvalued maps Mark Sioen joint with Eva

Paola Inverardi Software Engineering and Architecture Group Dipartimento di Ingegneria e Scienze

Quantifying Chance Part 2: Understanding Chance INFO-1301, Quantitative Reasoning 1 University

Normal Distributions MATH 107: Finite Mathematics University of Louisville April 2, 2014 Normal

Probability and Statistics for Computer Science Can

Sambuz

Useful Links

Newsletter

Mail Us

Tight links between normality and automata Olivier Carton IRIF - PowerPoint PPT Presentation

Tight links between normality and automata Olivier Carton IRIF Universit e Paris Diderot & CNRS Based on join works with V. Becher, P. Heiber and E. Orduna (Universidad de Buenos Aires & CONICET) Chennai CAALM Outline

Links Student Web Presence Guidelines Summary 1. The Purpose of Links 2. Worst Links 3. Best

Departures from Normality Departures from Normality Many statistical test depend on our

Chapter 2 Tight-frames An Introduction 1 Outline 1. Tight-frame 1. Tight-frame 2. Matrix

CSC 473 Automata, Grammars &amp; Languages 9/29/10 Automata, Grammars and Languages Discourse 03

LINKS AND RULES GENOME VISUALIZATION WITH CIRCOS LINKS AND RULES 1 Martin Krzywinski

How of the Conceptual Future Internet Links lead to links that link to other links. Many

Non-Normality / Non-Gaussianity and Filtering Cris%an Proistosescu, Andy Rhines,

Regression Diagnostics Procedures ASSUMPTIONS UNDERLYING REGRESSION/CORRELATION NORMALITY OF

Normality tests P RACTICIN G S TATIS TICS IN TERVIEW QUES TION S IN R Zuzanna Chmielewska

Normality and Automata Olivier Carton LIAFA Universit e Paris Diderot &amp; CNRS Join work

Normality and preservation of measure in cellular automata Silvio Capobianco 1 1 Institute of

Automata and program analysis Thomas Colcombet FCT Bordeaux 13 September 2017 based on

Graph Automata Jan Leike July 2nd, 2012 Motivation We want an automata model that Motivation

Applied Automata Theory Roland Meyer TU Kaiserslautern Roland Meyer (TU KL) Applied Automata

Applied Automata Theory Roland Meyer TU Kaiserslautern Roland Meyer (TU KL) Applied Automata

Multiple tree automata a new model of tree automata Gwendal Collet (TU Wien), Julien David (LIPN)

LATE and the Generalized Roy Model: Some Relationships James J. Heckman University of Chicago

A Power Comparison for Testing Normality Shigekazu Nakagawa, Hiroki Hashiguchi, and Naoto Niki

Convergence results for the Bayesian inversion theory Hanna Katriina Pikkarainen Johann Radon

Normality for approach spaces and contractive realvalued maps Mark Sioen joint with Eva

Paola Inverardi Software Engineering and Architecture Group Dipartimento di Ingegneria e Scienze

Quantifying Chance Part 2: Understanding Chance INFO-1301, Quantitative Reasoning 1 University

Normal Distributions MATH 107: Finite Mathematics University of Louisville April 2, 2014 Normal

Probability and Statistics for Computer Science Can

Sambuz

Useful Links

Newsletter

Mail Us

CSC 473 Automata, Grammars & Languages 9/29/10 Automata, Grammars and Languages Discourse 03

Normality and Automata Olivier Carton LIAFA Universit e Paris Diderot & CNRS Join work