SLIDE 1
The WZ method and zeta function identities Ira M. Gessel Department - - PowerPoint PPT Presentation
The WZ method and zeta function identities Ira M. Gessel Department - - PowerPoint PPT Presentation
The WZ method and zeta function identities Ira M. Gessel Department of Mathematics Brandeis University American Mathematical Society 2010 Fall Eastern Sectional Meeting Syracuse University October 3, 2010 There are several interesting
SLIDE 2
SLIDE 3
There are several interesting formulas that give quickly converging series for generating functions for the zeta function, due to Almkvist, Borwein, Bradley, Granville, Koecher, Leshchiner, and Rivoal. For example, Koecher’s formula is
∞
- k=0
ζ(2k+3)x2k =
∞
- n=1
(−1)n+1 n32n
n
- 1
2 + 2 1 − x2/n2 n−1
- k=1
- 1 − x2
k2
- .
SLIDE 4
Khodabakhsh and Tatiana Hessami Pilehrood used the WZ method of Wilf and Zeilberger to prove some of these formulas.
SLIDE 5
Khodabakhsh and Tatiana Hessami Pilehrood used the WZ method of Wilf and Zeilberger to prove some of these formulas. I will give a brief explanation of the WZ method and I will explain how the WZ proofs of these zeta function identities are closely related to classical hypergeometric series identities, and how this connection allows us to generalize them.
SLIDE 6
The WZ method
We consider a grid graph with a weight function defined on every directed edge with the following property: ◮ For all vertices A and B, all paths from A to B have the same
- weight. (Path-invariance)
2 2 2 2 2 1 1 1 1 1 1 1 1 1 2 4
SLIDE 7
This property is equivalent to the existence of a potential function defined on the vertices: the weight of an edge is the difference in the potential of its endpoints:
4 5 7 6 8 3 1 5 6 3 2
2 2 2 2 2 1 1 1 1 1 1 1 1 1 2 4
SLIDE 8
Some observations
◮ To check that a weighted grid graph has the path invariance property, it is sufficient to check it on each rectangle:
a c b d a + b = c + d
SLIDE 9
Some observations
◮ The weights of a directed edge and its reversal are negatives
- f each other.
SLIDE 10
Some observations
◮ We can add arbitrary other edges to the graph, and they will have uniquely determined weights that satisfy the path- invariance property.
2 2 2 2 2 1 1 1 1 1 1 1 1 1 2 4
SLIDE 11
In particular, from one path-invariant weighted grid graph, we can get many more by taking a different grid on the same set of points:
SLIDE 12
In particular, from one path-invariant weighted grid graph, we can get many more by taking a different grid on the same set of points:
SLIDE 13
In particular, from one path-invariant weighted grid graph, we can get many more by taking a different grid on the same set of points:
SLIDE 14
In particular, from one path-invariant weighted grid graph, we can get many more by taking a different grid on the same set of points:
SLIDE 15
The WZ method
Suppose that we have a a path-invariant weighted grid graph, where the vertices are in Z × Z. Let f(i, j) be the weight on the edge from (i, j) to (i + 1, j) and let g(i, j) be the weight on the edge from (i, j) to (i, j + 1):
f(i, j + 1) f(i, j) g(i, j) g(i + 1, j)
(i, j)
SLIDE 16
The WZ method
Suppose that we have a a path-invariant weighted grid graph, where the vertices are in Z × Z. Let f(i, j) be the weight on the edge from (i, j) to (i + 1, j) and let g(i, j) be the weight on the edge from (i, j) to (i, j + 1):
f(i, j + 1) f(i, j) g(i, j) g(i + 1, j)
(i, j)
The path invariance property is f(i, j) + g(i + 1, j) = g(i, j) + f(i, j + 1)
SLIDE 17
The WZ method
Suppose that we have a a path-invariant weighted grid graph, where the vertices are in Z × Z. Let f(i, j) be the weight on the edge from (i, j) to (i + 1, j) and let g(i, j) be the weight on the edge from (i, j) to (i, j + 1):
f(i, j + 1) f(i, j) g(i, j) g(i + 1, j)
(i, j)
The path invariance property is f(i, j) + g(i + 1, j) = g(i, j) + f(i, j + 1) Note: Traditionally the order of the parameters is switched.
SLIDE 18
A pair of functions (f, g) satisfying this identity is called a WZ-pair. For our purposes, we want to assume that f and g have a particular form: we want each to be of the form Γ(a1i + b1j + u1) · · · Γ(aki + bkj + uk) Γ(c1i + d1j + v1) · · · Γ(cli + dlj + vl) ziwj where the an, bn, cn, and dn are integers, and the un, vn, z, and w are complex numbers.
SLIDE 19
A pair of functions (f, g) satisfying this identity is called a WZ-pair. For our purposes, we want to assume that f and g have a particular form: we want each to be of the form Γ(a1i + b1j + u1) · · · Γ(aki + bkj + uk) Γ(c1i + d1j + v1) · · · Γ(cli + dlj + vl) ziwj where the an, bn, cn, and dn are integers, and the un, vn, z, and w are complex numbers. This implies that f(i + 1, j)/f(i, j), f(i, j + 1)/f(i, j), and f(i, j)/g(i, j) are rational functions.
SLIDE 20
A pair of functions (f, g) satisfying this identity is called a WZ-pair. For our purposes, we want to assume that f and g have a particular form: we want each to be of the form Γ(a1i + b1j + u1) · · · Γ(aki + bkj + uk) Γ(c1i + d1j + v1) · · · Γ(cli + dlj + vl) ziwj where the an, bn, cn, and dn are integers, and the un, vn, z, and w are complex numbers. This implies that f(i + 1, j)/f(i, j), f(i, j + 1)/f(i, j), and f(i, j)/g(i, j) are rational functions. Note: We will want to allow i and j to be complex numbers.
SLIDE 21
We illustrate our approach with the simplest interesting WZ pair, which is associated with the binomial theorem. One form
- f this WZ pair is
f(i, j) = Γ(i + j) Γ(i + 1)Γ(j)xi(1 − x)j = i + j − 1 i
- xi(1 − x)j
g(i, j) = − Γ(i + j) Γ(i)Γ(j + 1)xi(1 − x)j = − i + j − 1 i − 1
- xi(1 − x)j
Let’s first see the connection between this WZ pair and the binomial theorem.
SLIDE 22
We sum along the two paths P1 and P2:
(m, 0) (0, n) (m, n) (0, 0) P1 P2
Summing along P1 gives
m−1
- i=0
f(i, 0) +
n−1
- j=0
g(m, j) = 1 −
n−1
- j=0
m + j − 1 i − 1
- xm(1 − x)j.
and summing along P2 gives
n−1
- j=0
g(0, j) +
m−1
- i=0
f(i, n) = 0 +
m−1
- i=0
i + n − 1 i
- xi(1 − x)n.
SLIDE 23
So we have the identity 1 −
n−1
- j=0
m + j − 1 i − 1
- xm(1 − x)j =
m−1
- i=0
i + n − 1 i
- xi(1 − x)n.
Taking the limit as m → ∞ we get the binomial theorem in the form 1 =
∞
- i=0
i + n − 1 i
- xi(1 − x)n.
SLIDE 24
So we have the identity 1 −
n−1
- j=0
m + j − 1 i − 1
- xm(1 − x)j =
m−1
- i=0
i + n − 1 i
- xi(1 − x)n.
Taking the limit as m → ∞ we get the binomial theorem in the form 1 =
∞
- i=0
i + n − 1 i
- xi(1 − x)n.
Conversely, starting with the binomial theorem in this form, it’s easy to find f and g, using Gosper’s algorithm.
SLIDE 25
We can use this WZ pair to obtain identities for the sum L(x, α) =
∞
- i=0
α α + i xi, which can be expressed in terms of the Lerch transcendent as α Φ(x, 1, α).
SLIDE 26
We can use this WZ pair to obtain identities for the sum L(x, α) =
∞
- i=0
α α + i xi, which can be expressed in terms of the Lerch transcendent as α Φ(x, 1, α). Note that L(x, 1) = −x−1 log(1 − x) and L(x, 1/2) = x−1/2 tanh−1 √ x.
SLIDE 27
The sum L(x, α) is a generating function for the polygorithm Lis(x) =
∞
- i=1
xi is which reduces to the zeta function for x = 1.
SLIDE 28
The sum L(x, α) is a generating function for the polygorithm Lis(x) =
∞
- i=1
xi is which reduces to the zeta function for x = 1. We have L(x, α) =
∞
- i=0
α α + i xi = 1 + α
∞
- i=1
1 i xi 1 + α/i = 1 +
∞
- m=0
(−1)mαm+1
∞
- i=1
xi im+1 = 1 + 1 x
∞
- m=1
(−1)m−1αm Lim(x).
SLIDE 29
Let’s look at f(i, j) = Γ(i + j) Γ(i + 1)Γ(j)xi(1 − x)j. We want to modify f and g so that ∞
i=0 f(i, 0) becomes
something like ∞
i=0 αxi/(α + i).
SLIDE 30
Let’s look at f(i, j) = Γ(i + j) Γ(i + 1)Γ(j)xi(1 − x)j. We want to modify f and g so that ∞
i=0 f(i, 0) becomes
something like ∞
i=0 αxi/(α + i). If we replace i with α + i in
f(i, j), then we get f(α + i, j) = Γ(α + i + j) Γ(α + i + 1)xi · xα(1 − x)j Γ(j)
SLIDE 31
Let’s look at f(i, j) = Γ(i + j) Γ(i + 1)Γ(j)xi(1 − x)j. We want to modify f and g so that ∞
i=0 f(i, 0) becomes
something like ∞
i=0 αxi/(α + i). If we replace i with α + i in
f(i, j), then we get f(α + i, j) = Γ(α + i + j) Γ(α + i + 1)xi · xα(1 − x)j Γ(j) Now if we set j = 0, we get f(α + i, 0) = α α + i xi · xα α Γ(0) which is just what we want except for a constant factor
SLIDE 32
Let’s look at f(i, j) = Γ(i + j) Γ(i + 1)Γ(j)xi(1 − x)j. We want to modify f and g so that ∞
i=0 f(i, 0) becomes
something like ∞
i=0 αxi/(α + i). If we replace i with α + i in
f(i, j), then we get f(α + i, j) = Γ(α + i + j) Γ(α + i + 1)xi · xα(1 − x)j Γ(j) Now if we set j = 0, we get f(α + i, 0) = α α + i xi · xα α Γ(0) which is just what we want except for a constant factor which is unfortunately 0.
SLIDE 33
Clearly we can multiply a WZ pair by a constant.
SLIDE 34
Clearly we can multiply a WZ pair by a constant. Can we replace i with α + i? What about the Γ(0)?
SLIDE 35
Clearly we can multiply a WZ pair by a constant. Can we replace i with α + i? What about the Γ(0)? The first question is easy:
- Lemma. If
- f(i, j), g(i, j)
- is a WZ pair then so is
- f(α + i, β + j), g(α + i, β + j)
- for any α and β.
SLIDE 36
To deal with the Γ(0) we need to do a little more. We observe that
- f(i, j), g(i, j)
- is a WZ pair, then so is
- p(j)f(i, j), p(j)g(i, j)
- ,
where p(j) is a function that is periodic with period 1. We take p(j) to be (−1)jΓ(j)Γ(1 − j) = (−1)j π sin πj .
SLIDE 37
To deal with the Γ(0) we need to do a little more. We observe that
- f(i, j), g(i, j)
- is a WZ pair, then so is
- p(j)f(i, j), p(j)g(i, j)
- ,
where p(j) is a function that is periodic with period 1. We take p(j) to be (−1)jΓ(j)Γ(1 − j) = (−1)j π sin πj . This is what Wilf and Zeilberger call shadowing.
SLIDE 38
So (after multiplying f and g by αx−α) we get a modified WZ pair f(i, j) = αΓ(α + i + j)Γ(1 − j) Γ(α + i + 1) xi(x − 1)j g(i, j) = αΓ(α + i + j)Γ(−j) Γ(α + i) xi(x − 1)j and we have
∞
- i=0
f(i, 0) =
∞
- i=0
α α + i xi.
SLIDE 39
To transform this sum, we consider three paths:
(m, 0) (0, 0)
The sum along the black path is equal to the sum along the red path plus the sum along the green path. We take the limit as m → ∞ and we hope that the green path goes to 0 and the red path gives us something interesting.
SLIDE 40
More precisely, we pick positive integers s and t and define h(k) to be the weight of a path from (sk, −tk) to (s(k + 1), −t(k + 1)).
(0, 0) (3s, −3t) (2s, −2t) (s, −t) h(0) h(1) h(2)
SLIDE 41
More precisely, we pick positive integers s and t and define h(k) to be the weight of a path from (sk, −tk) to (s(k + 1), −t(k + 1)).
(0, 0) (3s, −3t) (2s, −2t) (s, −t) h(0) h(1) h(2)
It’s not hard to see that as m → ∞ the sum along the green path goes to 0, so we have the identity
∞
- i=0
α α + i xi =
∞
- k=0
h(k)
SLIDE 42
What is h(k)? It’s the weight of any path from (sk, tk) to (s(k + 1), t(k + 1)). So if we take the path that goes down first and then to the right
(sk, −tk) (sk, −t(k + 1)) (s(k + 1), −t(k + 1))
we get h(k) = −
−tk−1
- j=−t(k+1)
g(sk, j) +
s(k+1)−1
- i=sk
f(i, −t(k + 1))
SLIDE 43
It’s not hard to see that h(k) will be a rational function of α, x, and k times (tk)! (α)(s−t)k (α + 1)sk (−1)txs (1 − x)t k , where (u)m is the rising factorial, (u)m = u(u + 1) · · · (u + m − 1) if m ≥ 0 (−1)m (1 − u)m if m < 0.
SLIDE 44
The simplest cases are
∞
- i=0
α α + i xi =
∞
- k=0
k! (α + 1)k (−x)k (1 − x)k+1 (s = 1, t = 1) =
∞
- k=0
(1 + k − α + xk + xα) × (2k)! (α + 1)k(1 − α)k+1 (−x)k (1 − x)2k+2 (s = 1, t = 2) =
∞
- k=0
(1 + 2k + α − x − xk) × k! (α)k (α + 1)2k+1 (−x2)k (1 − x)k+1 (s = 2, t = 1)
SLIDE 45
The simplest cases are
∞
- i=0
α α + i xi =
∞
- k=0
k! (α + 1)k (−x)k (1 − x)k+1 (s = 1, t = 1) =
∞
- k=0
(1 + k − α + xk + xα) × (2k)! (α + 1)k(1 − α)k+1 (−x)k (1 − x)2k+2 (s = 1, t = 2) =
∞
- k=0
(1 + 2k + α − x − xk) × k! (α)k (α + 1)2k+1 (−x2)k (1 − x)k+1 (s = 2, t = 1) Note: The first formula is a 2F1 linear transformation.
SLIDE 46
For α = 1 the first formula gives − log(1 − x) =
∞
- k=1
1 k
- x
1 − x k = log
- 1 +
x 1 − x
SLIDE 47
For α = 1 the first formula gives − log(1 − x) =
∞
- k=1
1 k
- x
1 − x k = log
- 1 +
x 1 − x
- In the second formula, we can’t directly set α = 1 because of
the 1 − α in the denominator. But by taking an appropriate limit, we can get − log(1 − x) =
∞
- k=1
- k2x + k2 − 3k + 1)(2k − 2)!
k!2
- −
x (1 − x)2 k .
SLIDE 48
The third formula gives − log(1 − x) = (x − 2)
∞
- k=1
(k − 1)! k! (2k)! (−1)k x2k−1 (1 − x)k .
SLIDE 49
The third formula gives − log(1 − x) = (x − 2)
∞
- k=1
(k − 1)! k! (2k)! (−1)k x2k−1 (1 − x)k . This is equivalent to the known formula sin−1 x √ 1 − x2 = x
∞
- k=0
k!2 (2k + 1)!(2x)2k.
SLIDE 50
For α = 1/2 the first formula gives tanh−1 x = 2x 1 − x
∞
- k=0
k! (k + 1)! (2k + 2)!
- − 4x2
1 − x2 k , which is also equivalent to sin−1 x √ 1 − x2 = x
∞
- k=0
k!2 (2k + 1)!(2x)2k.
SLIDE 51
For α = 1/2 the first formula gives tanh−1 x = 2x 1 − x
∞
- k=0
k! (k + 1)! (2k + 2)!
- − 4x2
1 − x2 k , which is also equivalent to sin−1 x √ 1 − x2 = x
∞
- k=0
k!2 (2k + 1)!(2x)2k. The second formula gives another formula equivalent to this.
SLIDE 52
The third formula gives tanh−1 x = 4x 1 − x2
∞
- k=0
(3 + 4k − 2(1 + k)x2) × (2k)! (2k + 2)! (4k + 4)!
- − 4x4
1 − x2 k .
SLIDE 53
The simplest interesting polylogarithm formula is the dilogarithm formula obtained from the second transformation formula Li2(x) =
∞
- i=1
xi i2 = x 1 − x +
∞
- k=1
(−1)k (1 + 2k − (k + 1)x)(H2k+1 − Hk−1) − 1
- × (k − 1)! k!
(2k + 1)! x2k (1 − x)k+1 where Hn is the harmonic number 1 + 1/2 + · · · + 1/n.
SLIDE 54
The simplest interesting polylogarithm formula is the dilogarithm formula obtained from the second transformation formula Li2(x) =
∞
- i=1
xi i2 = x 1 − x +
∞
- k=1
(−1)k (1 + 2k − (k + 1)x)(H2k+1 − Hk−1) − 1
- × (k − 1)! k!
(2k + 1)! x2k (1 − x)k+1 where Hn is the harmonic number 1 + 1/2 + · · · + 1/n. We can’t set x = 1, but we can set x = −1, to get a formula for ζ(2).
SLIDE 55
Li2(−1) = −1
2ζ(2) = −π2
12 = −1 2 +
∞
- k=1
(−1)k (2 + 3k)(H2k+1 − Hk−1) − 1
- × (k − 1)! k!
(2k + 1)! 1 2k+1
SLIDE 56
What do we do with a more complicated identity? Let’s look at the Bailey-Borwein-Bradley formula
∞
- n=0
ζ(2n + 1)x2n =
∞
- k=1
1 k2 − x2
SLIDE 57
What do we do with a more complicated identity? Let’s look at the Bailey-Borwein-Bradley formula
∞
- n=0
ζ(2n + 1)x2n =
∞
- k=1
1 k2 − x2 = 3
∞
- k=1
1 2k
k
- (k2 − x2)
k−1
- m=1
4x2 − m2 x2 − m2
SLIDE 58
What do we do with a more complicated identity? Let’s look at the Bailey-Borwein-Bradley formula
∞
- n=0
ζ(2n + 1)x2n =
∞
- k=1
1 k2 − x2 = 3
∞
- k=1
1 2k
k
- (k2 − x2)
k−1
- m=1
4x2 − m2 x2 − m2 = 3/2 1 − x2
∞
- k=0
(2)k(1 − 2x)k(1 + 2x)k (3
2)k(2 − x)k(2 + x)k
1 4 k .
SLIDE 59
What do we do with a more complicated identity? Let’s look at the Bailey-Borwein-Bradley formula
∞
- n=0
ζ(2n + 1)x2n =
∞
- k=1
1 k2 − x2 = 3
∞
- k=1
1 2k
k
- (k2 − x2)
k−1
- m=1
4x2 − m2 x2 − m2 = 3/2 1 − x2
∞
- k=0
(2)k(1 − 2x)k(1 + 2x)k (3
2)k(2 − x)k(2 + x)k
1 4 k . This identity is associated with a WZ pair that corresponds to Gauss’s theorem (the nonterminating form of Vandermonde’s theorem).
SLIDE 60
Gauss’s theorem is
∞
- k=0
(a)k(b)k k! (c)k = Γ(c)Γ(c − a − b) Γ(c − a)Γ(c − b). We want to transform the summand of Gauss’s theorem into a constant times 1 (k + 1)2 − x2 = (1 + x)k(1 − x)k (1 − x2)(2 + x)k(2 − x)k . If we replace k by 1 + x + k in (a)k(b)k/k! (c)k and then set a = 0, b = −2x, and c = 1 − 2x, we get − 2x Γ(0)(1 − x2) · (1 + x)k(1 − x)k (2 + x)k(2 − x)k
SLIDE 61
Gauss’s theorem is
∞
- k=0
(a)k(b)k k! (c)k = Γ(c)Γ(c − a − b) Γ(c − a)Γ(c − b). We want to transform the summand of Gauss’s theorem into a constant times 1 (k + 1)2 − x2 = (1 + x)k(1 − x)k (1 − x2)(2 + x)k(2 − x)k . If we replace k by 1 + x + k in (a)k(b)k/k! (c)k and then set a = 0, b = −2x, and c = 1 − 2x, we get − 2x Γ(0)(1 − x2) · (1 + x)k(1 − x)k (2 + x)k(2 − x)k By shadowing as before we can get rid of the Γ(0) in the denominator.
SLIDE 62
But in trying to find a WZ pair we have a problem: Gauss’s theorem has four parameters (the summation index and three
- thers) but to get a WZ pair all we need is the summation index
and one parameter. How do we know which one to use?
SLIDE 63
But in trying to find a WZ pair we have a problem: Gauss’s theorem has four parameters (the summation index and three
- thers) but to get a WZ pair all we need is the summation index
and one parameter. How do we know which one to use? We use all of them, and instead of a WZ pair we get a WZ 4-tuple (or WZ form). This gives a path-invariant grid graph in Z4 rather than Z2. Everything works as before, except that we have more choices.
SLIDE 64
The WZ 4-tuple f =
- f1(i, j, k, l), . . . , f4(i, j, k, l)
- corresponding
to Gauss’s theorem is f = G i , G j , G k , G l
- ,
where G = (−1)k Γ(i + k) Γ(i + l) Γ(j + k) Γ(j + l) Γ(1 − k) Γ(i) Γ(j) Γ(l) Γ(i + j + k + l) . What this means is that if we put a weight of ft(i, j, k, l) on the edge from (i, j, k, l) to the adjacent point (i, j, k, l) + et, where et = (0, . . . , 1
t , . . . , 0), then we get a path-invariant grid graph.
SLIDE 65
The WZ 4-tuple f =
- f1(i, j, k, l), . . . , f4(i, j, k, l)
- corresponding
to Gauss’s theorem is f = G i , G j , G k , G l
- ,
where G = (−1)k Γ(i + k) Γ(i + l) Γ(j + k) Γ(j + l) Γ(1 − k) Γ(i) Γ(j) Γ(l) Γ(i + j + k + l) . What this means is that if we put a weight of ft(i, j, k, l) on the edge from (i, j, k, l) to the adjacent point (i, j, k, l) + et, where et = (0, . . . , 1
t , . . . , 0), then we get a path-invariant grid graph.
Note that i, j, k, and l don’t need to be integers.
SLIDE 66
Now if we sum f in the direction (1, 0, 0, 0) (i.e., parallel to the x1-axis) starting at the point (1 + x, 1, 0, −2x) we get −2x
∞
- k=0
1 (1 + k)2 − x2 = −2x
∞
- k=1
1 k2 − x2 .
SLIDE 67
Now if we sum f in the direction (1, 0, 0, 0) (i.e., parallel to the x1-axis) starting at the point (1 + x, 1, 0, −2x) we get −2x
∞
- k=0
1 (1 + k)2 − x2 = −2x
∞
- k=1
1 k2 − x2 . To transform the sum, we compute the value of the kth step starting at the same point and going in another direction. We look for something interesting and we have to check that the sum along the “connecting path" goes to 0.
SLIDE 68
Now if we sum f in the direction (1, 0, 0, 0) (i.e., parallel to the x1-axis) starting at the point (1 + x, 1, 0, −2x) we get −2x
∞
- k=0
1 (1 + k)2 − x2 = −2x
∞
- k=1
1 k2 − x2 . To transform the sum, we compute the value of the kth step starting at the same point and going in another direction. We look for something interesting and we have to check that the sum along the “connecting path" goes to 0. We find that the direction (1, 2, −1, −1) gives us what we want: the sum starting at (1 + x, 1, 0, −2x) gives − 3x 1 − x2
∞
- n=0
(2)n(1 − 2x)n(1 + 2x)n (3
2)n(2 − x)n(2 + x)n
1 4 n .
SLIDE 69
An advantage of this approach is that it gives us something more; if we start at the point (x, 1, 0, y − x) we get the more general formula
∞
- n=0
1 (n + x)(n + y) = 1 2xy
∞
- n=0