SLIDE 1 Trace functions over finite fields: a study in sums of products
ETH Z¨ urich
May 29, 2014
SLIDE 2 Trace functions over finite fields: a study in sums of products
ETH Z¨ urich
May 29, 2014 [Joint works with ´
- E. Fouvry, Ph. Michel (and in part S. Ganguly,
- G. Ricotta); arXiv:1405.2293]
SLIDE 3
Trace functions
The trace functions modulo a prime p are functions K : Fp − → C which are “special” functions of algebraic nature.
SLIDE 4
Trace functions
The trace functions modulo a prime p are functions K : Fp − → C which are “special” functions of algebraic nature.
◮ Precisely, we consider trace functions of middle-extension ℓ-adic
sheaves F on the affine line, pointwise pure of weight 0, brought to C by a fixed ι : ¯ Qℓ − → C.
SLIDE 5
Trace functions
The trace functions modulo a prime p are functions K : Fp − → C which are “special” functions of algebraic nature.
◮ Precisely, we consider trace functions of middle-extension ℓ-adic
sheaves F on the affine line, pointwise pure of weight 0, brought to C by a fixed ι : ¯ Qℓ − → C.
Such a trace function has a “complexity”c(K).
SLIDE 6 Trace functions
The trace functions modulo a prime p are functions K : Fp − → C which are “special” functions of algebraic nature.
◮ Precisely, we consider trace functions of middle-extension ℓ-adic
sheaves F on the affine line, pointwise pure of weight 0, brought to C by a fixed ι : ¯ Qℓ − → C.
Such a trace function has a “complexity”c(K).
◮ We define c(K) as the minimum of c(F) over sheaves as above with
trace function K, where c(F) = rank(F) + |(sing. points)| +
Swanx(F) ≥ 1.
SLIDE 7 Trace functions
The trace functions modulo a prime p are functions K : Fp − → C which are “special” functions of algebraic nature.
◮ Precisely, we consider trace functions of middle-extension ℓ-adic
sheaves F on the affine line, pointwise pure of weight 0, brought to C by a fixed ι : ¯ Qℓ − → C.
Such a trace function has a “complexity”c(K).
◮ We define c(K) as the minimum of c(F) over sheaves as above with
trace function K, where c(F) = rank(F) + |(sing. points)| +
Swanx(F) ≥ 1.
We typically let p vary, and consider Kp modulo p with bounded conductor: c(Kp) ≤ C for all p.
SLIDE 8
Examples
SLIDE 9
Examples
◮ (Characters) K(x) = e(f (x)/p) or K(x) = χ(f (x)), where χ = 1 is
a multiplicative character, and f ∈ Fp[X] is non-constant; the conductor is bounded in terms of deg(f ) only;
SLIDE 10 Examples
◮ (Characters) K(x) = e(f (x)/p) or K(x) = χ(f (x)), where χ = 1 is
a multiplicative character, and f ∈ Fp[X] is non-constant; the conductor is bounded in terms of deg(f ) only;
◮ (Hyper)-Kloosterman sums: for r ≥ 1 integer
K(x) = Klr(x) = 1 p(r−1)/2
yi∈Fp
e y1 + · · · + yr p
the conductor is bounded in terms of r only;
SLIDE 11 Examples
◮ (Characters) K(x) = e(f (x)/p) or K(x) = χ(f (x)), where χ = 1 is
a multiplicative character, and f ∈ Fp[X] is non-constant; the conductor is bounded in terms of deg(f ) only;
◮ (Hyper)-Kloosterman sums: for r ≥ 1 integer
K(x) = Klr(x) = 1 p(r−1)/2
yi∈Fp
e y1 + · · · + yr p
the conductor is bounded in terms of r only;
◮ (Point-counting)
K(x) =
f (y)=x
1 − 1, f ∈ Fp[X] non-constant. the conductor is bounded in terms of deg(f ) only.
SLIDE 12
These functions occur in many applications in analytic number theory.
SLIDE 13 These functions occur in many applications in analytic number theory. Most often, one needs estimates for the “generalized exponential sums” of the type
K(x),
SLIDE 14 These functions occur in many applications in analytic number theory. Most often, one needs estimates for the “generalized exponential sums” of the type
K(x),
- r more naturally for inner products
- x∈Fp
K1(x)K2(x).
- f trace functions K1 and K2.
SLIDE 15 Goals
◮ Square-root cancellation:
K1(x)K2(x)
where C is under control (depends only on the complexity of K1 and K2);
SLIDE 16 Goals
◮ Square-root cancellation:
K1(x)K2(x)
where C is under control (depends only on the complexity of K1 and K2);
◮ Or understanding when this does not hold (“diagonal
situations”), e.g., K1(x) = K2(x).
SLIDE 17
These goals can often be reached, by exploiting the features of the underlying algebraic geometry:
SLIDE 18 These goals can often be reached, by exploiting the features of the underlying algebraic geometry:
◮ There is a powerful and very flexible formalism for trace
functions, including:
- 1. Stability under algebraic operations;
- 2. Stability under Fourier transform, convolution(s), etc;
- 3. The Grothendieck-Lefschetz trace formula
SLIDE 19 These goals can often be reached, by exploiting the features of the underlying algebraic geometry:
◮ There is a powerful and very flexible formalism for trace
functions, including:
- 1. Stability under algebraic operations;
- 2. Stability under Fourier transform, convolution(s), etc;
- 3. The Grothendieck-Lefschetz trace formula
◮ This formalism is compatible with the complexity: operations
- n trace functions with bounded complexity result in other
trace functions with bounded complexity;
SLIDE 20 These goals can often be reached, by exploiting the features of the underlying algebraic geometry:
◮ There is a powerful and very flexible formalism for trace
functions, including:
- 1. Stability under algebraic operations;
- 2. Stability under Fourier transform, convolution(s), etc;
- 3. The Grothendieck-Lefschetz trace formula
◮ This formalism is compatible with the complexity: operations
- n trace functions with bounded complexity result in other
trace functions with bounded complexity;
◮ And we have the general form of Deligne’s Riemann
Hypothesis over finite fields.
SLIDE 21 A version of the Riemann Hypothesis
Theorem (Quasi-orthogonality)
◮ Suppose K1 and K2 are trace functions modulo p associated
to geometrically irreducible sheaves F1, F2. Then
K1(x)K2(x)
where C depends only on c(K1), c(K2), unless F1 and F2 are geometrically isomorphic.
SLIDE 22 A version of the Riemann Hypothesis
Theorem (Quasi-orthogonality)
◮ Suppose K1 and K2 are trace functions modulo p associated
to geometrically irreducible sheaves F1, F2. Then
K1(x)K2(x)
where C depends only on c(K1), c(K2), unless F1 and F2 are geometrically isomorphic.
◮ In this “diagonal” case, there exists α with |α| = 1 such that
K1(x) = αK2(x) and
K1(x)K2(x) − ¯ αp
SLIDE 23 Examples
◮ (Weil-Deligne bounds)
|Klr(x)| = p−(r−1)/2
e y1 + · · · + yr p
SLIDE 24 Examples
◮ (Weil-Deligne bounds)
|Klr(x)| = p−(r−1)/2
e y1 + · · · + yr p
◮ (A “non-bound”) For
K(x) =
P2(Kl2(y2))e xy p
P2(X) = X 2 − 1, we have
γ·x=∞
K(x)K(γ · x)
if γ ∈ PGL2(Fp) is γ = Id, −1 1
16 1
4 −16 1 4
SLIDE 25
Philosophy
◮ The Riemann Hypothesis can be used as a black box in many
applications, using known examples of trace functions and their properties;
SLIDE 26
Philosophy
◮ The Riemann Hypothesis can be used as a black box in many
applications, using known examples of trace functions and their properties;
◮ But the more one knows, the better (for instance, to identify
geometrically irreducible trace functions);
SLIDE 27
Philosophy
◮ The Riemann Hypothesis can be used as a black box in many
applications, using known examples of trace functions and their properties;
◮ But the more one knows, the better (for instance, to identify
geometrically irreducible trace functions);
◮ This talk will attempt to explain, in a specific context, how to
make the box slightly less dark.
SLIDE 28 Sums of products
We often find in applications that we need to bound sums like
K1(x) · · · Kn(x)M(x) where Ki, 1 ≤ i ≤ n, are trace functions, as well as M, and often M(x) = 1 or M(x) = e(hx/p) for some h ∈ Fp.
SLIDE 29 Sums of products
We often find in applications that we need to bound sums like
K1(x) · · · Kn(x)M(x) where Ki, 1 ≤ i ≤ n, are trace functions, as well as M, and often M(x) = 1 or M(x) = e(hx/p) for some h ∈ Fp. In particular, one often has Ki(x) = K(aix + bi) for some other fixed trace function K and ai ∈ F×
p , bi ∈ Fp. The
(ai, bi) are not necessarily distinct.
SLIDE 30
Examples
◮ Proof of the Burgess bound: k even,
Ki(x) = χ(x + bi), M(x) = 1.
SLIDE 31
Examples
◮ Proof of the Burgess bound: k even,
Ki(x) = χ(x + bi), M(x) = 1.
◮ (Friedlander–Iwaniec, Heath-Brown, Michel, Zhang, FKM) For
d3 in arithmetic progressions, n = 2 and K1(x) = Kl3(a1x), K2(x) = Kl3(a2x), M(x) = e(hx/p).
SLIDE 32 Examples
◮ Proof of the Burgess bound: k even,
Ki(x) = χ(x + bi), M(x) = 1.
◮ (Friedlander–Iwaniec, Heath-Brown, Michel, Zhang, FKM) For
d3 in arithmetic progressions, n = 2 and K1(x) = Kl3(a1x), K2(x) = Kl3(a2x), M(x) = e(hx/p).
◮ (Fouvry-Michel-Rivat-Sark¨
- zy, FGKM, KR, Irving): n ≥ 1, and
Ki(x) = Klr(aix + bi), M(x) = 1 or e(hx/p).
SLIDE 33 Examples
◮ Proof of the Burgess bound: k even,
Ki(x) = χ(x + bi), M(x) = 1.
◮ (Friedlander–Iwaniec, Heath-Brown, Michel, Zhang, FKM) For
d3 in arithmetic progressions, n = 2 and K1(x) = Kl3(a1x), K2(x) = Kl3(a2x), M(x) = e(hx/p).
◮ (Fouvry-Michel-Rivat-Sark¨
- zy, FGKM, KR, Irving): n ≥ 1, and
Ki(x) = Klr(aix + bi), M(x) = 1 or e(hx/p).
◮ Other examples: Fouvry–Iwaniec, Bombieri–Bourgain,
Blomer–Milicevic...
SLIDE 34
Assumptions
◮ In general, the Ki are well-understood, and we assume that
they are geometrically irreducible (e.g., Klr(aix + bi)), and are “given”;
SLIDE 35
Assumptions
◮ In general, the Ki are well-understood, and we assume that
they are geometrically irreducible (e.g., Klr(aix + bi)), and are “given”;
◮ We assume also that M is geometrically irreducible, but it
might not be known very explicitly.
SLIDE 36 More precise goal
We wish to classify the “diagonal” cases: for which M does an estimate
K1(x) · · · Kn(x)M(x)
fail, with C depending only on max(c(Ki), c(M))?
SLIDE 37 More precise goal
We wish to classify the “diagonal” cases: for which M does an estimate
K1(x) · · · Kn(x)M(x)
fail, with C depending only on max(c(Ki), c(M))? Main difficulty. For n ≥ 2, K1 · · · Kn has no reason to be geometrically irreducible. Thus quasi-orthogonality can not be applied directly.
SLIDE 38 Principle of the method
◮ Each trace function Ki is a restriction to a set of Frobenius
conjugacy classes of the character of a finite-dimensional representation of some group Π1: there exist ρi : Π1 − → GL(Vi) such that Ki(x) = ι
SLIDE 39 Principle of the method
◮ Each trace function Ki is a restriction to a set of Frobenius
conjugacy classes of the character of a finite-dimensional representation of some group Π1: there exist ρi : Π1 − → GL(Vi) such that Ki(x) = ι
◮ Consider the direct sum
ρ = ρ1 ⊕ · · · ⊕ ρn : Π1 − → GL(V1 ⊕ · · · ⊕ Vn) and the “external” tensor product π : GL(V1 ⊕ · · · ⊕ Vn) − → GL(V1 ⊗ · · · ⊗ Vn).
SLIDE 40
(cont.)
◮ Then
K1(x) · · · Kn(x) = Tr((π ◦ ρ)(Frx,Fp)).
SLIDE 41 (cont.)
◮ Then
K1(x) · · · Kn(x) = Tr((π ◦ ρ)(Frx,Fp)).
◮ Intuitively, we know1 (Deligne’s Equidistribution Theorem)
that this means that the product is distributed like the trace
- f a “random” matrix in a maximal compact subgroup U of
the Zariski-closure G of the image of ρ.
1With a minor caveat
SLIDE 42 (cont.)
◮ Then
K1(x) · · · Kn(x) = Tr((π ◦ ρ)(Frx,Fp)).
◮ Intuitively, we know1 (Deligne’s Equidistribution Theorem)
that this means that the product is distributed like the trace
- f a “random” matrix in a maximal compact subgroup U of
the Zariski-closure G of the image of ρ.
◮ This means that (case M = 1) we have
1 p
K1(x) · · · Kn(x) =
Tr(x)dµHaar(x) + O(p−1/2).
1With a minor caveat
SLIDE 43 (cont.)
◮ Then
K1(x) · · · Kn(x) = Tr((π ◦ ρ)(Frx,Fp)).
◮ Intuitively, we know1 (Deligne’s Equidistribution Theorem)
that this means that the product is distributed like the trace
- f a “random” matrix in a maximal compact subgroup U of
the Zariski-closure G of the image of ρ.
◮ This means that (case M = 1) we have
1 p
K1(x) · · · Kn(x) =
Tr(x)dµHaar(x) + O(p−1/2).
◮ So square-root cancellation means exactly that the “main
term” vanishes...
1With a minor caveat
SLIDE 44
(cont.)
◮ ... which means that the trivial representation is not a
component of the “tautological” representation of U on V1 ⊕ · · · ⊕ Vn.
SLIDE 45
(cont.)
◮ ... which means that the trivial representation is not a
component of the “tautological” representation of U on V1 ⊕ · · · ⊕ Vn.
◮ A priori, G (resp. U) is a subgroup of the product of the Gi
(resp. Ui) defined similarly from ρi.
SLIDE 46 (cont.)
◮ ... which means that the trivial representation is not a
component of the “tautological” representation of U on V1 ⊕ · · · ⊕ Vn.
◮ A priori, G (resp. U) is a subgroup of the product of the Gi
(resp. Ui) defined similarly from ρi.
◮ If it is so big that U = U1 × · · · × Un, then
Tr(x)dµHaar(x) =
Tr(x)dµHaar(x).
SLIDE 47 (cont.)
◮ ... which means that the trivial representation is not a
component of the “tautological” representation of U on V1 ⊕ · · · ⊕ Vn.
◮ A priori, G (resp. U) is a subgroup of the product of the Gi
(resp. Ui) defined similarly from ρi.
◮ If it is so big that U = U1 × · · · × Un, then
Tr(x)dµHaar(x) =
Tr(x)dµHaar(x).
◮ If we know that the ρi are irreducible and non-trivial, this is
zero.
SLIDE 48
Goursat-Kolchin-Ribet, d’apr` es Katz
How/when can we get such a “big” group?
SLIDE 49
Goursat-Kolchin-Ribet, d’apr` es Katz
How/when can we get such a “big” group?
◮ Basic information: the projections U −
→ Ui are surjective for each i;
SLIDE 50
Goursat-Kolchin-Ribet, d’apr` es Katz
How/when can we get such a “big” group?
◮ Basic information: the projections U −
→ Ui are surjective for each i;
◮ A “miracle”: complicated groups are very independent from
each other!
SLIDE 51 Goursat-Kolchin-Ribet, d’apr` es Katz
How/when can we get such a “big” group?
◮ Basic information: the projections U −
→ Ui are surjective for each i;
◮ A “miracle”: complicated groups are very independent from
each other!
◮ In particular, if Ui = SUdi(C), di ≥ 2, and the representations
ρi are pairwise non-isomorphic,2 then U is the product of the Ui;
2 More precisely, pairwise unrelated up to twists.
SLIDE 52 Goursat-Kolchin-Ribet, d’apr` es Katz
How/when can we get such a “big” group?
◮ Basic information: the projections U −
→ Ui are surjective for each i;
◮ A “miracle”: complicated groups are very independent from
each other!
◮ In particular, if Ui = SUdi(C), di ≥ 2, and the representations
ρi are pairwise non-isomorphic,2 then U is the product of the Ui;
◮ The same happens with USp2gi(C), or with mixtures, or with
quite a few other groups with simple Lie algebra.
2 More precisely, pairwise unrelated up to twists.
SLIDE 53
An example
This is already enough for many applications. For instance:
Theorem (Katz)
For r even and K(x) = Klr(ax + b), we have U = USpr(C), and the underlying sheaves when (a, b) ∈ F×
p × Fp vary are pairwise
non-isomorphic (even up to twist).
SLIDE 54 An example
This is already enough for many applications. For instance:
Theorem (Katz)
For r even and K(x) = Klr(ax + b), we have U = USpr(C), and the underlying sheaves when (a, b) ∈ F×
p × Fp vary are pairwise
non-isomorphic (even up to twist). It follows:
Corollary
If r is even, n ≥ 1 is fixed, and (ai, bi)1≤i≤n are distinct pairs in F×
p × Fp, then
Klr(a1x + b1) · · · Klr(anx + bn) ≪ p1/2.
SLIDE 55 Diagonal cases
In some applications, not all Ki are distinct. So we may need to consider
K1(x)ν1 · · · Kn(x)νn where the Ki are pairwise non-isomorphic and νi ≥ 1.
SLIDE 56 Diagonal cases
In some applications, not all Ki are distinct. So we may need to consider
K1(x)ν1 · · · Kn(x)νn where the Ki are pairwise non-isomorphic and νi ≥ 1. By the previous argument, at least if (K1, . . . , Kn) satisfy the same assumptions as before (large “complicated” monodromy), we get square root cancellation if and only if
Tr(x)νidµHaar(x) = 0.
SLIDE 57 If Ui = USp2gi(C), this means that at least some multiplicity νi is
SLIDE 58 If Ui = USp2gi(C), this means that at least some multiplicity νi is
If Ui = SUdi(C), this means that at least some multiplicity νi is not divisible by di.
SLIDE 59 If Ui = USp2gi(C), this means that at least some multiplicity νi is
If Ui = SUdi(C), this means that at least some multiplicity νi is not divisible by di. For instance, if r is even, we have
Klr(a1x + b1)ν1 · · · Klr(anx + bn)νn ≪ p1/2 unless each νi is even.
SLIDE 60
A comparison
Take Ki(x) = e((aix)−1/p) (inverse modulo p) for distinct ai’s.
SLIDE 61 A comparison
Take Ki(x) = e((aix)−1/p) (inverse modulo p) for distinct ai’s. Then the sum
p
K1(x) · · · Kn(x) has no cancellation for all (a1, . . . , an) such that 1 a1 + · · · + 1 an = 0.
SLIDE 62 When M is non-trivial
Now take M any geometrically irreducible trace function and consider
K1(x)ν1 · · · Kn(x)νnM(x).
SLIDE 63 When M is non-trivial
Now take M any geometrically irreducible trace function and consider
K1(x)ν1 · · · Kn(x)νnM(x). If there is no square-root cancellation then:
SLIDE 64 When M is non-trivial
Now take M any geometrically irreducible trace function and consider
K1(x)ν1 · · · Kn(x)νnM(x). If there is no square-root cancellation then:
◮ M must correspond to a representation of Π1 that factors
through ρ;
SLIDE 65 When M is non-trivial
Now take M any geometrically irreducible trace function and consider
K1(x)ν1 · · · Kn(x)νnM(x). If there is no square-root cancellation then:
◮ M must correspond to a representation of Π1 that factors
through ρ;
◮ If U is the product of the Ui, this means that
M = L1(x) · · · Ln(x) where
- 1. Ln(x) is associated to a representation Λi ◦ ρi of Π1;
- 2. For each i, Λi is an irreducible subrepresentation of the νi-th
tensor power of the standard representations of Ui.
SLIDE 66 For example, we have
Kl2(a1x + b1)ν1 · · · Kl2(anx + bn)νnM(x) ≪ p1/2 for M geometrically irreducible if and only M is not of the form M(x) =
P2mi(Kl2(aix + bi)) where Pd is a Chebychev polynomial.
SLIDE 67 An application (FGKM, KR)
Let k ≥ 2 be an integer, p a prime, (a, p) = 1, a real number X ≥ 2, and w a test function w : [0, +∞[− → [0, 1] with w(x) ≥ 0, w = 0. Let E(X; p, a) =
n≡a (mod p)
dk(n) − 1 p − 1
dk(n).
SLIDE 68
Theorem
If X = pk/Φ(p) with Φ(x) ↑ +∞, Φ(x) ≪ xε, then a → E(X; p, a) is approximately normally distributed, if:
SLIDE 69
Theorem
If X = pk/Φ(p) with Φ(x) ↑ +∞, Φ(x) ≪ xε, then a → E(X; p, a) is approximately normally distributed, if:
◮ k = 2, a ∈ F× p (FGKM)
SLIDE 70
Theorem
If X = pk/Φ(p) with Φ(x) ↑ +∞, Φ(x) ≪ xε, then a → E(X; p, a) is approximately normally distributed, if:
◮ k = 2, a ∈ F× p (FGKM) ◮ k ≥ 3, a ∈ F× p (KR)
SLIDE 71 Theorem
If X = pk/Φ(p) with Φ(x) ↑ +∞, Φ(x) ≪ xε, then a → E(X; p, a) is approximately normally distributed, if:
◮ k = 2, a ∈ F× p (FGKM) ◮ k ≥ 3, a ∈ F× p (KR) ◮ k ≥ 3, and either
- 1. a ∈ I, I interval of length p1/2+δ, δ > 0
- 2. a ∈ f (Fp) where f ∈ Z[X] is a fixed non-constant polynomial
(KR).
SLIDE 72 Link with trace functions
Assume one considers a ∈ Xp where Xp ⊂ Fp. Computing the n-th moment using the Voronoi summation formula, one ends up dealing with sums S(a1, . . . , an) =
Klk(a1x) · · · Klk(anx)M(x) for (a1, . . . , an) ∈ F×
p , M one of the trace functions arising in a
decomposition 1Xp(x) =
αjMj(x), M1(x) = |Xp| p .
SLIDE 73
The key point is that M can not be “diagonal” for too many tuples (a1, . . . , an):
SLIDE 74
The key point is that M can not be “diagonal” for too many tuples (a1, . . . , an): if M is not geometrically trivial, then the number of a ∈ (F×
p )n for which there is no square-root cancellation is
≪ p(n−1)/2 where the implied constant depends only on r.
SLIDE 75
The key point is that M can not be “diagonal” for too many tuples (a1, . . . , an): if M is not geometrically trivial, then the number of a ∈ (F×
p )n for which there is no square-root cancellation is
≪ p(n−1)/2 where the implied constant depends only on r. In contrast, for M = 1 and n even, all (p − 1)n/2 tuples (a1, a1, . . . , an/2, an/2) contribute to the main term.
SLIDE 76
What if the Ki are not pairwise distinct?
One needs some fun algebraic facts. For instance: let H ⊂ G be a subgroup of a group G, ξ ∈ G. Then we have ξHξ ⊂ H if and only if ξ ∈ NG(H) and ξ2 ∈ H.