[PPT] - The WZ method and zeta function identities Ira M. Gessel Department PowerPoint Presentation

SLIDE 1

The WZ method and zeta function identities

Ira M. Gessel

Department of Mathematics Brandeis University

American Mathematical Society 2010 Fall Eastern Sectional Meeting Syracuse University October 3, 2010

SLIDE 2

There are several interesting formulas that give quickly converging series for generating functions for the zeta function, due to Almkvist, Borwein, Bradley, Granville, Koecher, Leshchiner, and Rivoal.

SLIDE 3

There are several interesting formulas that give quickly converging series for generating functions for the zeta function, due to Almkvist, Borwein, Bradley, Granville, Koecher, Leshchiner, and Rivoal. For example, Koecher’s formula is

∞

k=0

ζ(2k+3)x2k =

∞

n=1

(−1)n+1 n32n

n

1

2 + 2 1 − x2/n2 n−1

k=1
1 − x2

k2

.

SLIDE 4

Khodabakhsh and Tatiana Hessami Pilehrood used the WZ method of Wilf and Zeilberger to prove some of these formulas.

SLIDE 5

Khodabakhsh and Tatiana Hessami Pilehrood used the WZ method of Wilf and Zeilberger to prove some of these formulas. I will give a brief explanation of the WZ method and I will explain how the WZ proofs of these zeta function identities are closely related to classical hypergeometric series identities, and how this connection allows us to generalize them.

SLIDE 6

The WZ method

We consider a grid graph with a weight function defined on every directed edge with the following property: ◮ For all vertices A and B, all paths from A to B have the same

weight. (Path-invariance)

2 2 2 2 2 1 1 1 1 1 1 1 1 1 2 4

SLIDE 7

This property is equivalent to the existence of a potential function defined on the vertices: the weight of an edge is the difference in the potential of its endpoints:

4 5 7 6 8 3 1 5 6 3 2

2 2 2 2 2 1 1 1 1 1 1 1 1 1 2 4

SLIDE 8

Some observations

◮ To check that a weighted grid graph has the path invariance property, it is sufficient to check it on each rectangle:

a c b d a + b = c + d

SLIDE 9

Some observations

◮ The weights of a directed edge and its reversal are negatives

f each other.

SLIDE 10

Some observations

◮ We can add arbitrary other edges to the graph, and they will have uniquely determined weights that satisfy the path- invariance property.

2 2 2 2 2 1 1 1 1 1 1 1 1 1 2 4

SLIDE 11

In particular, from one path-invariant weighted grid graph, we can get many more by taking a different grid on the same set of points:

SLIDE 12

In particular, from one path-invariant weighted grid graph, we can get many more by taking a different grid on the same set of points:

SLIDE 13

In particular, from one path-invariant weighted grid graph, we can get many more by taking a different grid on the same set of points:

SLIDE 14

In particular, from one path-invariant weighted grid graph, we can get many more by taking a different grid on the same set of points:

SLIDE 15

The WZ method

Suppose that we have a a path-invariant weighted grid graph, where the vertices are in Z × Z. Let f(i, j) be the weight on the edge from (i, j) to (i + 1, j) and let g(i, j) be the weight on the edge from (i, j) to (i, j + 1):

f(i, j + 1) f(i, j) g(i, j) g(i + 1, j)

(i, j)

SLIDE 16

The WZ method

Suppose that we have a a path-invariant weighted grid graph, where the vertices are in Z × Z. Let f(i, j) be the weight on the edge from (i, j) to (i + 1, j) and let g(i, j) be the weight on the edge from (i, j) to (i, j + 1):

f(i, j + 1) f(i, j) g(i, j) g(i + 1, j)

(i, j)

The path invariance property is f(i, j) + g(i + 1, j) = g(i, j) + f(i, j + 1)

SLIDE 17

The WZ method

Suppose that we have a a path-invariant weighted grid graph, where the vertices are in Z × Z. Let f(i, j) be the weight on the edge from (i, j) to (i + 1, j) and let g(i, j) be the weight on the edge from (i, j) to (i, j + 1):

f(i, j + 1) f(i, j) g(i, j) g(i + 1, j)

(i, j)

The path invariance property is f(i, j) + g(i + 1, j) = g(i, j) + f(i, j + 1) Note: Traditionally the order of the parameters is switched.

SLIDE 18

A pair of functions (f, g) satisfying this identity is called a WZ-pair. For our purposes, we want to assume that f and g have a particular form: we want each to be of the form Γ(a1i + b1j + u1) · · · Γ(aki + bkj + uk) Γ(c1i + d1j + v1) · · · Γ(cli + dlj + vl) ziwj where the an, bn, cn, and dn are integers, and the un, vn, z, and w are complex numbers.

SLIDE 19

A pair of functions (f, g) satisfying this identity is called a WZ-pair. For our purposes, we want to assume that f and g have a particular form: we want each to be of the form Γ(a1i + b1j + u1) · · · Γ(aki + bkj + uk) Γ(c1i + d1j + v1) · · · Γ(cli + dlj + vl) ziwj where the an, bn, cn, and dn are integers, and the un, vn, z, and w are complex numbers. This implies that f(i + 1, j)/f(i, j), f(i, j + 1)/f(i, j), and f(i, j)/g(i, j) are rational functions.

SLIDE 20

A pair of functions (f, g) satisfying this identity is called a WZ-pair. For our purposes, we want to assume that f and g have a particular form: we want each to be of the form Γ(a1i + b1j + u1) · · · Γ(aki + bkj + uk) Γ(c1i + d1j + v1) · · · Γ(cli + dlj + vl) ziwj where the an, bn, cn, and dn are integers, and the un, vn, z, and w are complex numbers. This implies that f(i + 1, j)/f(i, j), f(i, j + 1)/f(i, j), and f(i, j)/g(i, j) are rational functions. Note: We will want to allow i and j to be complex numbers.

SLIDE 21

We illustrate our approach with the simplest interesting WZ pair, which is associated with the binomial theorem. One form

f this WZ pair is

f(i, j) = Γ(i + j) Γ(i + 1)Γ(j)xi(1 − x)j = i + j − 1 i

xi(1 − x)j

g(i, j) = − Γ(i + j) Γ(i)Γ(j + 1)xi(1 − x)j = − i + j − 1 i − 1

xi(1 − x)j

Let’s first see the connection between this WZ pair and the binomial theorem.

SLIDE 22

We sum along the two paths P1 and P2:

(m, 0) (0, n) (m, n) (0, 0) P1 P2

Summing along P1 gives

m−1

i=0

f(i, 0) +

n−1

j=0

g(m, j) = 1 −

n−1

j=0

m + j − 1 i − 1

xm(1 − x)j.

and summing along P2 gives

n−1

j=0

g(0, j) +

m−1

i=0

f(i, n) = 0 +

m−1

i=0

i + n − 1 i

xi(1 − x)n.

SLIDE 23

So we have the identity 1 −

n−1

j=0

m + j − 1 i − 1

xm(1 − x)j =

m−1

i=0

i + n − 1 i

xi(1 − x)n.

Taking the limit as m → ∞ we get the binomial theorem in the form 1 =

∞

i=0

i + n − 1 i

xi(1 − x)n.

SLIDE 24

So we have the identity 1 −

n−1

j=0

m + j − 1 i − 1

xm(1 − x)j =

m−1

i=0

i + n − 1 i

xi(1 − x)n.

Taking the limit as m → ∞ we get the binomial theorem in the form 1 =

∞

i=0

i + n − 1 i

xi(1 − x)n.

Conversely, starting with the binomial theorem in this form, it’s easy to find f and g, using Gosper’s algorithm.

SLIDE 25

We can use this WZ pair to obtain identities for the sum L(x, α) =

∞

i=0

α α + i xi, which can be expressed in terms of the Lerch transcendent as α Φ(x, 1, α).

SLIDE 26

We can use this WZ pair to obtain identities for the sum L(x, α) =

∞

i=0

α α + i xi, which can be expressed in terms of the Lerch transcendent as α Φ(x, 1, α). Note that L(x, 1) = −x−1 log(1 − x) and L(x, 1/2) = x−1/2 tanh−1 √ x.

SLIDE 27

The sum L(x, α) is a generating function for the polygorithm Lis(x) =

∞

i=1

xi is which reduces to the zeta function for x = 1.

SLIDE 28

The sum L(x, α) is a generating function for the polygorithm Lis(x) =

∞

i=1

xi is which reduces to the zeta function for x = 1. We have L(x, α) =

∞

i=0

α α + i xi = 1 + α

∞

i=1

1 i xi 1 + α/i = 1 +

∞

m=0

(−1)mαm+1

∞

i=1

xi im+1 = 1 + 1 x

∞

m=1

(−1)m−1αm Lim(x).

SLIDE 29

Let’s look at f(i, j) = Γ(i + j) Γ(i + 1)Γ(j)xi(1 − x)j. We want to modify f and g so that ∞

i=0 f(i, 0) becomes

something like ∞

i=0 αxi/(α + i).

SLIDE 30

Let’s look at f(i, j) = Γ(i + j) Γ(i + 1)Γ(j)xi(1 − x)j. We want to modify f and g so that ∞

i=0 f(i, 0) becomes

something like ∞

i=0 αxi/(α + i). If we replace i with α + i in

f(i, j), then we get f(α + i, j) = Γ(α + i + j) Γ(α + i + 1)xi · xα(1 − x)j Γ(j)

SLIDE 31

Let’s look at f(i, j) = Γ(i + j) Γ(i + 1)Γ(j)xi(1 − x)j. We want to modify f and g so that ∞

i=0 f(i, 0) becomes

something like ∞

i=0 αxi/(α + i). If we replace i with α + i in

f(i, j), then we get f(α + i, j) = Γ(α + i + j) Γ(α + i + 1)xi · xα(1 − x)j Γ(j) Now if we set j = 0, we get f(α + i, 0) = α α + i xi · xα α Γ(0) which is just what we want except for a constant factor

SLIDE 32

Let’s look at f(i, j) = Γ(i + j) Γ(i + 1)Γ(j)xi(1 − x)j. We want to modify f and g so that ∞

i=0 f(i, 0) becomes

something like ∞

i=0 αxi/(α + i). If we replace i with α + i in

f(i, j), then we get f(α + i, j) = Γ(α + i + j) Γ(α + i + 1)xi · xα(1 − x)j Γ(j) Now if we set j = 0, we get f(α + i, 0) = α α + i xi · xα α Γ(0) which is just what we want except for a constant factor which is unfortunately 0.

SLIDE 33

Clearly we can multiply a WZ pair by a constant.

SLIDE 34

Clearly we can multiply a WZ pair by a constant. Can we replace i with α + i? What about the Γ(0)?

SLIDE 35

Clearly we can multiply a WZ pair by a constant. Can we replace i with α + i? What about the Γ(0)? The first question is easy:

Lemma. If
f(i, j), g(i, j)
is a WZ pair then so is
f(α + i, β + j), g(α + i, β + j)
for any α and β.

SLIDE 36

To deal with the Γ(0) we need to do a little more. We observe that

f(i, j), g(i, j)
is a WZ pair, then so is
p(j)f(i, j), p(j)g(i, j)
,

where p(j) is a function that is periodic with period 1. We take p(j) to be (−1)jΓ(j)Γ(1 − j) = (−1)j π sin πj .

SLIDE 37

To deal with the Γ(0) we need to do a little more. We observe that

f(i, j), g(i, j)
is a WZ pair, then so is
p(j)f(i, j), p(j)g(i, j)
,

where p(j) is a function that is periodic with period 1. We take p(j) to be (−1)jΓ(j)Γ(1 − j) = (−1)j π sin πj . This is what Wilf and Zeilberger call shadowing.

SLIDE 38

So (after multiplying f and g by αx−α) we get a modified WZ pair f(i, j) = αΓ(α + i + j)Γ(1 − j) Γ(α + i + 1) xi(x − 1)j g(i, j) = αΓ(α + i + j)Γ(−j) Γ(α + i) xi(x − 1)j and we have

∞

i=0

f(i, 0) =

∞

i=0

α α + i xi.

SLIDE 39

To transform this sum, we consider three paths:

(m, 0) (0, 0)

The sum along the black path is equal to the sum along the red path plus the sum along the green path. We take the limit as m → ∞ and we hope that the green path goes to 0 and the red path gives us something interesting.

SLIDE 40

More precisely, we pick positive integers s and t and define h(k) to be the weight of a path from (sk, −tk) to (s(k + 1), −t(k + 1)).

(0, 0) (3s, −3t) (2s, −2t) (s, −t) h(0) h(1) h(2)

SLIDE 41

More precisely, we pick positive integers s and t and define h(k) to be the weight of a path from (sk, −tk) to (s(k + 1), −t(k + 1)).

(0, 0) (3s, −3t) (2s, −2t) (s, −t) h(0) h(1) h(2)

It’s not hard to see that as m → ∞ the sum along the green path goes to 0, so we have the identity

∞

i=0

α α + i xi =

∞

k=0

h(k)

SLIDE 42

What is h(k)? It’s the weight of any path from (sk, tk) to (s(k + 1), t(k + 1)). So if we take the path that goes down first and then to the right

(sk, −tk) (sk, −t(k + 1)) (s(k + 1), −t(k + 1))

we get h(k) = −

−tk−1

j=−t(k+1)

g(sk, j) +

s(k+1)−1

i=sk

f(i, −t(k + 1))

SLIDE 43

It’s not hard to see that h(k) will be a rational function of α, x, and k times (tk)! (α)(s−t)k (α + 1)sk (−1)txs (1 − x)t k , where (u)m is the rising factorial, (u)m =      u(u + 1) · · · (u + m − 1) if m ≥ 0 (−1)m (1 − u)m if m < 0.

SLIDE 44

The simplest cases are

∞

i=0

α α + i xi =

∞

k=0

k! (α + 1)k (−x)k (1 − x)k+1 (s = 1, t = 1) =

∞

k=0

(1 + k − α + xk + xα) × (2k)! (α + 1)k(1 − α)k+1 (−x)k (1 − x)2k+2 (s = 1, t = 2) =

∞

k=0

(1 + 2k + α − x − xk) × k! (α)k (α + 1)2k+1 (−x2)k (1 − x)k+1 (s = 2, t = 1)

SLIDE 45

The simplest cases are

∞

i=0

α α + i xi =

∞

k=0

k! (α + 1)k (−x)k (1 − x)k+1 (s = 1, t = 1) =

∞

k=0

(1 + k − α + xk + xα) × (2k)! (α + 1)k(1 − α)k+1 (−x)k (1 − x)2k+2 (s = 1, t = 2) =

∞

k=0

(1 + 2k + α − x − xk) × k! (α)k (α + 1)2k+1 (−x2)k (1 − x)k+1 (s = 2, t = 1) Note: The first formula is a 2F1 linear transformation.

SLIDE 46

For α = 1 the first formula gives − log(1 − x) =

∞

k=1

1 k

x

1 − x k = log

1 +

x 1 − x

SLIDE 47

For α = 1 the first formula gives − log(1 − x) =

∞

k=1

1 k

x

1 − x k = log

1 +

x 1 − x

In the second formula, we can’t directly set α = 1 because of

the 1 − α in the denominator. But by taking an appropriate limit, we can get − log(1 − x) =

∞

k=1
k2x + k2 − 3k + 1)(2k − 2)!

k!2

−

x (1 − x)2 k .

SLIDE 48

The third formula gives − log(1 − x) = (x − 2)

∞

k=1

(k − 1)! k! (2k)! (−1)k x2k−1 (1 − x)k .

SLIDE 49

The third formula gives − log(1 − x) = (x − 2)

∞

k=1

(k − 1)! k! (2k)! (−1)k x2k−1 (1 − x)k . This is equivalent to the known formula sin−1 x √ 1 − x2 = x

∞

k=0

k!2 (2k + 1)!(2x)2k.

SLIDE 50

For α = 1/2 the first formula gives tanh−1 x = 2x 1 − x

∞

k=0

k! (k + 1)! (2k + 2)!

− 4x2

1 − x2 k , which is also equivalent to sin−1 x √ 1 − x2 = x

∞

k=0

k!2 (2k + 1)!(2x)2k.

SLIDE 51

For α = 1/2 the first formula gives tanh−1 x = 2x 1 − x

∞

k=0

k! (k + 1)! (2k + 2)!

− 4x2

1 − x2 k , which is also equivalent to sin−1 x √ 1 − x2 = x

∞

k=0

k!2 (2k + 1)!(2x)2k. The second formula gives another formula equivalent to this.

SLIDE 52

The third formula gives tanh−1 x = 4x 1 − x2

∞

k=0

(3 + 4k − 2(1 + k)x2) × (2k)! (2k + 2)! (4k + 4)!

− 4x4

1 − x2 k .

SLIDE 53

The simplest interesting polylogarithm formula is the dilogarithm formula obtained from the second transformation formula Li2(x) =

∞

i=1

xi i2 = x 1 − x +

∞

k=1

(−1)k (1 + 2k − (k + 1)x)(H2k+1 − Hk−1) − 1

× (k − 1)! k!

(2k + 1)! x2k (1 − x)k+1 where Hn is the harmonic number 1 + 1/2 + · · · + 1/n.

SLIDE 54

The simplest interesting polylogarithm formula is the dilogarithm formula obtained from the second transformation formula Li2(x) =

∞

i=1

xi i2 = x 1 − x +

∞

k=1

(−1)k (1 + 2k − (k + 1)x)(H2k+1 − Hk−1) − 1

× (k − 1)! k!

(2k + 1)! x2k (1 − x)k+1 where Hn is the harmonic number 1 + 1/2 + · · · + 1/n. We can’t set x = 1, but we can set x = −1, to get a formula for ζ(2).

SLIDE 55

Li2(−1) = −1

2ζ(2) = −π2

12 = −1 2 +

∞

k=1

(−1)k (2 + 3k)(H2k+1 − Hk−1) − 1

× (k − 1)! k!

(2k + 1)! 1 2k+1

SLIDE 56

What do we do with a more complicated identity? Let’s look at the Bailey-Borwein-Bradley formula

∞

n=0

ζ(2n + 1)x2n =

∞

k=1

1 k2 − x2

SLIDE 57

What do we do with a more complicated identity? Let’s look at the Bailey-Borwein-Bradley formula

∞

n=0

ζ(2n + 1)x2n =

∞

k=1

1 k2 − x2 = 3

∞

k=1

1 2k

k

(k2 − x2)

k−1

m=1

4x2 − m2 x2 − m2

SLIDE 58

What do we do with a more complicated identity? Let’s look at the Bailey-Borwein-Bradley formula

∞

n=0

ζ(2n + 1)x2n =

∞

k=1

1 k2 − x2 = 3

∞

k=1

1 2k

k

(k2 − x2)

k−1

m=1

4x2 − m2 x2 − m2 = 3/2 1 − x2

∞

k=0

(2)k(1 − 2x)k(1 + 2x)k (3

2)k(2 − x)k(2 + x)k

1 4 k .

SLIDE 59

What do we do with a more complicated identity? Let’s look at the Bailey-Borwein-Bradley formula

∞

n=0

ζ(2n + 1)x2n =

∞

k=1

1 k2 − x2 = 3

∞

k=1

1 2k

k

(k2 − x2)

k−1

m=1

4x2 − m2 x2 − m2 = 3/2 1 − x2

∞

k=0

(2)k(1 − 2x)k(1 + 2x)k (3

2)k(2 − x)k(2 + x)k

1 4 k . This identity is associated with a WZ pair that corresponds to Gauss’s theorem (the nonterminating form of Vandermonde’s theorem).

SLIDE 60

Gauss’s theorem is

∞

k=0

(a)k(b)k k! (c)k = Γ(c)Γ(c − a − b) Γ(c − a)Γ(c − b). We want to transform the summand of Gauss’s theorem into a constant times 1 (k + 1)2 − x2 = (1 + x)k(1 − x)k (1 − x2)(2 + x)k(2 − x)k . If we replace k by 1 + x + k in (a)k(b)k/k! (c)k and then set a = 0, b = −2x, and c = 1 − 2x, we get − 2x Γ(0)(1 − x2) · (1 + x)k(1 − x)k (2 + x)k(2 − x)k

SLIDE 61

Gauss’s theorem is

∞

k=0

(a)k(b)k k! (c)k = Γ(c)Γ(c − a − b) Γ(c − a)Γ(c − b). We want to transform the summand of Gauss’s theorem into a constant times 1 (k + 1)2 − x2 = (1 + x)k(1 − x)k (1 − x2)(2 + x)k(2 − x)k . If we replace k by 1 + x + k in (a)k(b)k/k! (c)k and then set a = 0, b = −2x, and c = 1 − 2x, we get − 2x Γ(0)(1 − x2) · (1 + x)k(1 − x)k (2 + x)k(2 − x)k By shadowing as before we can get rid of the Γ(0) in the denominator.

SLIDE 62

But in trying to find a WZ pair we have a problem: Gauss’s theorem has four parameters (the summation index and three

thers) but to get a WZ pair all we need is the summation index

and one parameter. How do we know which one to use?

SLIDE 63

But in trying to find a WZ pair we have a problem: Gauss’s theorem has four parameters (the summation index and three

thers) but to get a WZ pair all we need is the summation index

and one parameter. How do we know which one to use? We use all of them, and instead of a WZ pair we get a WZ 4-tuple (or WZ form). This gives a path-invariant grid graph in Z4 rather than Z2. Everything works as before, except that we have more choices.

SLIDE 64

The WZ 4-tuple f =

f1(i, j, k, l), . . . , f4(i, j, k, l)
corresponding

to Gauss’s theorem is f = G i , G j , G k , G l

,

where G = (−1)k Γ(i + k) Γ(i + l) Γ(j + k) Γ(j + l) Γ(1 − k) Γ(i) Γ(j) Γ(l) Γ(i + j + k + l) . What this means is that if we put a weight of ft(i, j, k, l) on the edge from (i, j, k, l) to the adjacent point (i, j, k, l) + et, where et = (0, . . . , 1

t , . . . , 0), then we get a path-invariant grid graph.

SLIDE 65

The WZ 4-tuple f =

f1(i, j, k, l), . . . , f4(i, j, k, l)
corresponding

to Gauss’s theorem is f = G i , G j , G k , G l

,

where G = (−1)k Γ(i + k) Γ(i + l) Γ(j + k) Γ(j + l) Γ(1 − k) Γ(i) Γ(j) Γ(l) Γ(i + j + k + l) . What this means is that if we put a weight of ft(i, j, k, l) on the edge from (i, j, k, l) to the adjacent point (i, j, k, l) + et, where et = (0, . . . , 1

t , . . . , 0), then we get a path-invariant grid graph.

Note that i, j, k, and l don’t need to be integers.

SLIDE 66

Now if we sum f in the direction (1, 0, 0, 0) (i.e., parallel to the x1-axis) starting at the point (1 + x, 1, 0, −2x) we get −2x

∞

k=0

1 (1 + k)2 − x2 = −2x

∞

k=1

1 k2 − x2 .

SLIDE 67

Now if we sum f in the direction (1, 0, 0, 0) (i.e., parallel to the x1-axis) starting at the point (1 + x, 1, 0, −2x) we get −2x

∞

k=0

1 (1 + k)2 − x2 = −2x

∞

k=1

1 k2 − x2 . To transform the sum, we compute the value of the kth step starting at the same point and going in another direction. We look for something interesting and we have to check that the sum along the “connecting path" goes to 0.

SLIDE 68

Now if we sum f in the direction (1, 0, 0, 0) (i.e., parallel to the x1-axis) starting at the point (1 + x, 1, 0, −2x) we get −2x

∞

k=0

1 (1 + k)2 − x2 = −2x

∞

k=1

1 k2 − x2 . To transform the sum, we compute the value of the kth step starting at the same point and going in another direction. We look for something interesting and we have to check that the sum along the “connecting path" goes to 0. We find that the direction (1, 2, −1, −1) gives us what we want: the sum starting at (1 + x, 1, 0, −2x) gives − 3x 1 − x2

∞

n=0

(2)n(1 − 2x)n(1 + 2x)n (3

2)n(2 − x)n(2 + x)n

1 4 n .

SLIDE 69