Marseille 2019 On the digits of primes
In memoriam
Christian MAUDUIT
Jo¨ el RIVAT Institut de Math´ ematiques de Marseille, Universit´ e d’Aix-Marseille.
1
Marseille 2019 On the digits of primes In memoriam Christian - - PowerPoint PPT Presentation
Marseille 2019 On the digits of primes In memoriam Christian MAUDUIT Jo e l RIVAT Institut de Math e matiques de Marseille, Universit e dAix-Marseille. 1 2 Digits Let q 2 be an integer. Any n N can be written j ( n
In memoriam
Jo¨ el RIVAT Institut de Math´ ematiques de Marseille, Universit´ e d’Aix-Marseille.
1
2
Digits Let q 2 be an integer. Any n ∈ N can be written n =
εj(n) qj, εj(n) ∈ {0, . . . , q − 1}. The sum of digits function s(n) =
εj(n) has been studied in many directions: ergodicity, finite automata, dynamical systems, harmonic analysis, number theory, etc. Mahler, 1927: For q = 2, the sequence
1
N
(−1)s(n) (−1)s(n+k)
N1
converges for all k ∈ N and its limit is different from zero for infinitely many k’s.
3
Histogram of the sum of binary digits of integers (binomial distribution)
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33
0.2 0.4 0.6 0.8 1 1.2 1.4 ·109 k card{n 1010, s(n) = k}
4
Gelfond’s paper Gelfond, 1968: The sum of digits in base q 2 is well distributed along arithmetic progressions. More precisely given m 2 with (m, q − 1) = 1, there exists an explicit σm > 0 such that ∀m′ ∈ N∗, ∀(n′, s) ∈ Z2,
n≡n′ mod m′ s(n)≡s mod m
1 = x mm′ + O(x1−σm). A.O. Gelfond
5
Gelfond’s problems, 1968
suitable polynomial [for example P(n) = n2].
6
Histogram of the sum of binary digits of prime numbers
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33
1 2 3 4 5 6 7 ·107 k card{p 1010, s(p) = k}
7
Sum of binary digits of prime numbers in residue classes 1 0.5 1 1.5 2 2.5 ·108 modulo 2 1 2 0.5 1 1.5 ·108 modulo 3 1 2 3 0.2 0.4 0.6 0.8 1 1.2 ·108 modulo 4 1 2 3 4 0.2 0.4 0.6 0.8 1 ·108 modulo 5 1 2 3 4 5 2 4 6 8 ·107 modulo 6 1 2 3 4 5 6 2 4 6 ·107 modulo 7
8
Partial results Fouvry–Mauduit (1996):
n=p or n=p1p2 s(n)≡a mod m
1 C(q, m) log log x
n=p or n=p1p2
1. Dartyge–Tenenbaum (2005): For r 2,
n=p1...pr s(n)≡a mod m
1 C(q, m, r) log log x log log log x
n=p1...pr
1.
9
Gelfond’s conjecture for primes Mauduit-Rivat, 2010: If (q − 1)α ∈ R \ Z, there exists Cq(α) > 0 and σq(α) > 0,
exp(2iπα s(p))
Hence
(here (pn)n1 denotes the sequence of prime numbers).
s(p)≡a mod m
1 ∼ 1 m
1 (x → +∞).
10
Histogram of local result for prime numbers
1 2 3 4 5 6 7 8 9 10 11 12 13 14 20 21 22 23 24 25 26 27 28 29 30 31 32 33
1 2 3 4 5 6 7 ·107 k card{p 1010, s(p) = k}
11
Local result for prime numbers Drmota-Mauduit-Rivat, 2009: uniformly for all integers k 0 with (k, q − 1) = 1 #{p x : s(p) = k} = q − 1 ϕ(q − 1) π(x)
q logq x
−(k − µq logq x)2
2σ2
q logq x
2+ε)
where µq = q − 1 2 , σ2
q = q2 − 1
12 and ε > 0 is arbitrary but fixed. Such a local result was considered by Erd˝
.
12
Discrete Fourier Transform For f : N → C and λ ∈ N we define a 2λ-periodic function fλ : Z → C by ∀u ∈ {0, . . . , 2λ − 1}, fλ(u) = f(u) and its Discrete Fourier Transform
2λ
fλ(u) exp
−2iπut
2λ
By orthogonality
1/2
= 1. A non-trivial upper bound for
Getting
13
The Rudin-Shapiro sequence Let f(n) = (−1)
For λ ∈ N, we have fλ(t) = 2−λ · Schapiro polynomial
, hence
1−λ 2 .
Since
2
λ−1 2
λ 2
The proof for the sum of digits function requires
2.
This is not satisfied for the Rudin-Shapiro sequence !!!
14
Histogram of“Rudin-Shapiro sums”of prime numbers
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31
−0.5 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 ·107 k card
j1 εj−1(p) εj(p) = k
“Rudin-Shapiro sums”of prime numbers in residue classes 1 0.5 1 1.5 2 2.5 ·108 modulo 2 1 2 0.5 1 1.5 ·108 modulo 3 1 2 3 0.2 0.4 0.6 0.8 1 1.2 ·108 modulo 4 1 2 3 4 0.2 0.4 0.6 0.8 1 ·108 modulo 5 1 2 3 4 5 2 4 6 8 ·107 modulo 6 1 2 3 4 5 6 2 4 6 ·107 modulo 7
16
Rudin-Shapiro sequences of order δ Let δ ∈ N and βδ(n) the number of occurencies of patterns 1 ∗ · · · ∗
δ
1, i.e. of the form 1w1 (where w ∈ {0, 1}δ) in the representation of n: βδ(n) =
εk−δ−1(n) εk(n). Mauduit-Rivat, 2015: for any δ ∈ N, α ∈ R, ϑ ∈ R and x 2, there exists explicit constants C(δ) and σ(α) > 0 such that
Λ(n) e (βδ(n)α + ϑn)
11 4 x1−σ(α)
and
µ(n) e (βδ(n)α + ϑn)
11 4 x1−σ(α). 17
Rudin-Shapiro sequences of degree d Let d ∈ N with d 2 and bd(n) denote the number of occurencies of 1 · · · 1
d
i.e. blocks of d consecutive 1’s in the representation of n in base 2: bd(n) =
εk−d+1(n) · · · εk(n). Mauduit-Rivat, 2015: for any d ∈ N with d 2, α ∈ R, ϑ ∈ R and x 2 there exist an explicit constant σ(d, α) > 0 such that
Λ(n) e (bd(n)α + ϑn)
11 4 x1−σ(d,α),
µ(n) e (bd(n)α + ϑn)
11 4 x1−σ(d,α). 18
General result – Definitions Let U = {z ∈ C, |z| = 1}. Definition 1 A function f : N → U has the carry property if, uniformly for (λ, κ, ρ) ∈ N3 with ρ < λ, the number of integers 0 ℓ < qλ such that there exists (k1, k2) ∈ {0, . . . , qκ −1}2 with f(ℓqκ + k1 + k2) f(ℓqκ + k1) = fκ+ρ(ℓqκ + k1 + k2) fκ+ρ(ℓqκ + k1) is at most O(qλ−ρ) where the implied constant may depend only on q and f. Definition 2 Given a non decreasing function γ : R → R satisfying limλ→+∞ γ(λ) = +∞ and c > 0 we denote by Fγ,c the set of functions f : N → U such that for (κ, λ) ∈ N2 with κ cλ and t ∈ R:
f(uqκ) e (−ut)
19
General result Let γ : R → R be a non decreasing function satisfying limλ→+∞ γ(λ) = +∞, c 10 and f : N → U be a function satisfying Definition 1 and f ∈ Fγ,c in Definition 2. Then for any θ ∈ R we have
Λ(n)f(n) e (θn)
with explicit c1(q) and c2(q). Of course the same estimate holds if we replace the von Mangoldt function Λ by the M¨
function µ. M¨ ullner, 2018, has extended this result to all automatic sequences !
20
Primes in two bases Drmota-Mauduit-Rivat, 2019+: If f is a strongly q1-multiplicative function and g a strongly q2-multiplicative function such that (q1, q2) = 1 and f is is not of the form n → e(kn/(q1 − 1)) with k ∈ Z, then we have uniformly for ϑ ∈ R
Λ(n)f(n)g(n) e(ϑn)
log x log log x
The proof uses Schlickewei’s p-adic subspace theorem and Bakers’s theorem on linear form of logarithms. This does not permit to save a power of x.
21
Gelfond’s conjecture for squares Mauduit-Rivat, 2009: if (q − 1)α ∈ R \ Z, there exist Cq(α) > 0 and σq(α) > 0,
exp(2iπα s(n2))
Hence
s(n2)≡a mod m
1 ∼ x m (x → +∞).
22
Gelfond’s conjecture for polynomials Drmota-Mauduit-Rivat, 2011: let d 2, q q0(d), and P ∈ Z[X] of degree d such that P(N) ⊂ N for which the leading coefficient ad is co-prime to q. If (q − 1)α ∈ R \ Z then there exists c = c(q, d) > 0 with
exp(2iπα s(P(n))) ≪ x1−c(q−1)α2. Furthermore q0(d) e67d3(log d)2. With a more technical proof the assumptions that q is prime and that (ad, q) = 1 can be relaxed. The case q < q0(d), remains an open problem.
23
0110100110010110100101100110100110010110011010010110100110010110 1001011001101001011010011001011001101001100101101001011001101001 1001011001101001011010011001011001101001100101101001011001101001 0110100110010110100101100110100110010110011010010110100110010110 1001011001101001011010011001011001101001100101101001011001101001 0110100110010110100101100110100110010110011010010110100110010110 0110100110010110100101100110100110010110011010010110100110010110 1001011001101001011010011001011001101001100101101001011001101001
1001011001101001011010011001011001101001100101101001011001101001 0110100110010110100101100110100110010110011010010110100110010110 0110100110010110100101100110100110010110011010010110100110010110 1001011001101001011010011001011001101001100101101001011001101001 0110100110010110100101100110100110010110011010010110100110010110 1001011001101001011010011001011001101001100101101001011001101001 1001011001101001011010011001011001101001100101101001011001101001 0110100110010110100101100110100110010110011010010110100110010110
24
The Thue-Morse sequence The Thue-Morse sequence t = (tn)n∈N can be defined by induction: t0 = 0, t2n = tn, t2n+1 = 1 − tn. It is easy to check that tn ≡ s2(n) mod 2. It is the fixed point of the substitution t0 = 0, 0 → 01, 1 → 10 : 01 0110 01101001 0110100110010110 01101001100101101001011001101001 0110100110010110100101100110100110010110011010010110100110010110
25
Symbolic complexity Definition 3 The symbolic complexity of a sequence u ∈ {0, 1}N is the function pu defined for any integer k 1 by pu(k) = card{(b0, . . . , bk−1) ∈ {0, 1}k, ∃i / ui = b0, . . . , ui+k−1 = bk−1} (i.e. pu(k) is the number of distinct factors of length k that occur in the sequence u). Let (X(u), T) be the dynamical system where T is the shift on {0, 1}N and X(u) the closure
The topological entropy of (X(u), T) can be shown to be equal to lim
k→∞
log pu(k) k . In that sense pu constitutes a measure for the pseudorandomness of the sequence u.
26
The Thue Morse sequence is very ” simple”
The symbolic complexity of t is very low: there exist c1 > 0 and c2 > 0 such that, for all k 1, c1k pt(k) c2k. Zero topological entropy of the corresponding dynamical system: h = lim
k→∞
log pt(k) k = 0. For any fixed (a, b) ∈ N2 it is easy to check that the sequence ta,b = (tan+b)n∈N is also
It follows that its symbolic complexity is also sublinear: pta,b(k) = Oa(k) and that any sym- bolic dynamical system (X(ta,b), T) obtained by extracting a subsequence of t along arithmetic progressions still has zero topological entropy.
27
The Thue-Morse sequence along squares Picking the values at square positions 0110100110010110100101100110100110010110011010010110100, we get 0110110111110010111110110100110111111011110110100111 00011011001011111011100111111010011111011001011011110. Moshe, 2007 (conjectured by Allouche and Shallit in 2003): the subword complexity of (tn2)n0 is p(2)
k
= 2k, i.e. for k 1, every word b1 · · · bk with bj ∈ {0, 1} appears in (tn2)n0. Mauduit-Rivat, 2009: Both letters 0 and 1 have frequency 1
2.
Question: what is the frequency of a given word ?
28
The Thue-Morse sequence along squares is normal A sequence u ∈ {0, 1}N is normal if, for any k ∈ N and any (b0, . . . , bk−1) ∈ {0, 1}k: lim
N→∞
1 N card{i < N, ui = b0, . . . , ui+k−1 = bk−1} = 1 2k. Notion introduced by Borel in 1909. First explicit construction by Champernowne in 1933. Only few explicit constructions are known. Drmota-Mauduit-Rivat, 2019+: The sequence (tn2)n∈N is normal. This theorem provides a new method to construct normal numbers in a given base. The real number α =
∞
tn2 2n is normal in base 2.
29
30
Approach to digital problem
31
A variant of van der Corput’s inequality For all complex numbers z1, . . . , zL and integers k 1, R 1 we have
zℓ
L + kR − k R
L
|zℓ|2 + 2
R−1
R
L−kr
ℜ
.
(for k = 1 this is the classical van der Corput’s inequality.) Idea: Taking zℓ = exp(2iπϕ(ℓ)) for some function ϕ, since r is small we can take advantage a better control of the difference ϕ(ℓ + kr) − ϕ(ℓ) instead of the more general ϕ(ℓ′) − ϕ(ℓ). In base q this is useful when f is q-additive. With k = 1, this will permits to remove the upper digits. With k = qµ, this will permits to remove the lower digits.
32
Removing the upper digits f is q-additive if for all k 0 and all (a0, . . . , ak) ∈ {0, . . . , q − 1}k+1, f(a0 + a1q + · · · + akqk) = f(a0) + f(a1q) + · · · + f(akqk). Consider the difference f(a + b) − f(a), with b ≍ qβ much smaller that a ≍ qα. Example: a =
α
b = 396576345354568797095646467570
, In the sum a + b the digits of index β may change only by carry propagation. The proportion
can ignore these exceptional pairs and replace f(a + b) − f(a) by fβ+ρ(a + b) − fβ+ρ(a) where fβ+ρ is the truncated f function which considers only the digits of index < β + ρ: fβ+ρ(n) := f(n mod qβ+ρ) which is periodic of period qβ+ρ.
33
Removing the lower digits Now if a ≍ qα, c ≍ qγ with γ + µ α, consider the difference fβ+ρ(a + qµc) − fβ+ρ(a). Example: a =
α
qµc = 396576345354568797095646467571
000000000000
, In the sum a + qµc the digits of index < µ are not modified. We have fµ(a + qµc) = fµ(a) so fβ+ρ(a + qµc) − fβ+ρ(a) = (fβ+ρ − fµ)(a + qµc) − (fβ+ρ − fµ)(a) and fβ+ρ − fµ depends only on the digits of index ∈ {µ, . . . , β + ρ − 1}. If f is a more general digital function (not q-additive) these arguments need to be adapted.
34
Detection of digits Let rκ1,κ2(a) be the integer obtained using the digits of a of indexes κ1, . . . , κ2 − 1. We have rκ1,κ2(a) = u ⇐ ⇒ a qκ2 ∈
qκ2−κ1, u + 1 qκ2−κ1
It remains to detect which points belong to an interval modulo 1. For 0 α < 1 let χα(x) = ⌊x⌋ − ⌊x − α⌋. For any integer H 1 there exist real valued trigonometric polynomials such that for all x ∈ R,
(using Vaaler’s kernel derived from the Beurling-Selberg function) with Aα,H(x) =
aα,H(h) exp(2iπhx) Bα,H(x) =
bα,H(h) exp(2iπhx), with aα,H(0) = α,
1 π|h|
H+1.
35
Muldimensional approximation How to detect points in a small d-dimensional box (modulo 1) ? For (α1, . . . , αd) ∈ [0, 1)d and (H1, . . . , Hd) ∈ Nd with H1 1,. . . , Hd 1, we have for all (x1, . . . , xd) ∈ Rd
χαj(xj) −
d
Aαj,Hj(xj)
χαj(xj)
Bαj,Hj(xj) where Aα,H(.) and Bα,H(.) are (Vaaler’s) real valued trigonometric polynomials. This reminds to Koksma’s inequality
36
Sum over prime numbers By partial summation
f(p) − →
Λ(n) f(n) where Λ(n) is von Mangoldt’s function. Advantage: convolutions ! Λ ∗ 1 = log, i.e.
Λ(d) = log n. A classical process (Vinogradov, Vaughan, Heath-Brown) remains (using some more technical details), for some 0 < β1 < 1/3 and 1/2 < β2 < 1, to estimate uniformly the sums SI :=
f(mn)
where MN = x (which implies that the“easy”sum over n is long) and for all complex numbers am, bn with |am| 1, |bn| 1 the sums SII :=
ambn f(mn) for xβ1 < M xβ2 (type II), (which implies that both sums have a significant length).
37
Sums of type I The knowledge of the function g should permit to deal with the sum
f(mn) =
k≡0 mod m
f(k) = 1 m
f(k) e
kℓ
m
In our case f(n) is some digital function. Some arguments from Fouvry and Mauduit (1996) can be generalized. We use the large sieve inequality: If x1, . . . , xR ∈ R satisfy xr − xs δ for all r = s and a1, . . . , aK ∈ C, then
R
ak e(kxr)
δ
|ak|2 .
38
Sums of type II By Cauchy-Schwarz: |SII|2 M
bn f(mn)
. Expanding the square and exchanging the summations leads to a smooth sum over m, but also two free variables n1 and n2 with no control. Using Van der Corput’s inequality we have n1 = n + r and n2 = n so that the size of n1 − n2 = r is small. It remains to estimate
bn+r bn
f(m(n + r))f(mn). If f(k) = e(αg(k)) we have the difference g(mn + mr) − g(mn) where the upper digits might be removed.
39
After two van der Corput’s inequalities We have also removed the lower digits and are led to consider
fµ1,µ2(mn + mr + qµ1sn + qµ1rs)fµ1,µ2(mn + mr) fµ1,µ2(mn + qµ1sn)fµ1,µ2(mn), so that for mn and mn+mr we need to detect their digits of indexes between µ1 and µ2 while for sn and rs it is sufficient to look the digits of index < µ2 − µ1. This is done by Fourier analysis.
40
Sums of type II - Fourier analysis We are now working modulo qµ2−µ1. We consider Discrete Fourier Transform
1 qµ2−µ1
fµ1,µ2(qµ1u) e
qµ2−µ1
By Fourier inversion formula and exchanges of summations we must show that the quantity
fµ1,µ2(h1) fµ1,µ2(h2) fµ1,µ2(h3)
e
qµ2
The geometric sums over m and n give very good estimates except for“diagonal”terms like those for h0 + h1 + h2 + h3 = 0. Then the proof depends on the properties of fµ1,µ2.
41
Comments on the proof This approach is sufficient to study the Thue-Morse sequence, the Rudin-Shapiro sequence, and even more general sequences over primes. It can be adapted for squares. For fµ1,µ2 we need“only”that the L∞-norm is small, which is a big progress in comparison with the original method of Mauduit-Rivat, 2010. But it is not sufficient to prove the normality of the Thue-Morse sequence along squares. We need to consider more difficult Fourier transforms GI
λ(h, d) = 1
2λ
e
k−1
αℓfλ(u + ℓd + iℓ) − h2−λ
,
where I = (i0, . . . , ik−1) ∈ Nk. The L∞-norm is not small uniformly over d. We handle them using appropriate matrix norms, paths in graphs,. . .
42
Open problems For any polynomial of degree 3 taking values in N, is it true that (tP(n))n∈N is normal ? Mauduit-Rivat, 2010: the frequencies of 0 and 1 in the sequence (tpn)n∈N are equal to 1
2
(where (pn)n∈N denote the sequence of prime numbers). For any non constant polynomial taking values in N, is it true that (tP(pn))n∈N is normal ? Moreover it would be interesting to find some other almost periodic sequences u with the same property and also to understand this phenomenon from the dynamical system point of view.
43
44