Theory of Computer Games: Concluding Remarks
Tsan-sheng Hsu
tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu
1
Theory of Computer Games: Concluding Remarks Tsan-sheng Hsu - - PowerPoint PPT Presentation
Theory of Computer Games: Concluding Remarks Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1 Abstract Introducing practical issues. The open book. The graph history interaction (GHI) problem. Smart
tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu
1
⊲ time during searching ⊲ memory ⊲ coding efforts ⊲ debugging efforts
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
⊲ Use off-line computation to find out the value of a position for a given depth that cannot be computed online during a game due to resource constraints.
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
⊲ win: the number of games reaching this position and then wins. ⊲ loss: the number of games reaching this position and then loss. ⊲ draw: the number of games reaching this position and then draw.
⊲ Dark chess: adjacent attack of the opponent’s Cannon. [Chen and Hsu 2013]
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
⊲ It can be win. loss or draw for Chinese chess. ⊲ It can only be draw for Western chess. ⊲ It can only be loss for Go.
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
A B C D E F G H loss win I J
⊲ Memorized H as a loss position.
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
⊲ More time is spent in the beginning when the game just starts. ⊲ Stop searching a path further when you think the position is stable.
⊲ Use the time when your opponent is thinking. ⊲ Guessing and then pondering.
⊲ A large number of positions are not visited too often.
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
⊲ “better” means closer to the real value f(p)
⊲ In a MAX node, use f1. ⊲ In a MIN node, use f2
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
⊲ The value of a node is a distribution, not a fixed value.
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
i=1 score(i) ∗ Pr(x = i).
i=1{score(i) | Pr(x = i) > 0}.
i=1{score(i) | Pr(x = i) > 0}.
⊲ Using symmetry, we can reduce it to 7*8.
⊲ N = 14. ⊲ Assume x = 1 means a black King is revealed and x = 8 means a red King is revealed. ⊲ Then score(1) = score(8). ⊲ P r(x = 1) = P r(x = 8) = 1/14.
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
⊲ m0 = vmin ⊲ M0 = vmax
⊲ m1 ≥ score(1) ∗ P r(x = 1) + vmin ∗ (1 − P r(x = 1)), and ⊲ M1 ≤ score(1) ∗ P r(x = 1) + vmax ∗ (1 − P r(x = 1)).
⊲ mi∗ ≥ i∗
i=1 score(i) ∗ P r(x = i) + vmin ∗ (1 − i∗ i=1 P r(x = i)), and
⊲ Mi∗ ≤ i∗
i=1 score(i) ∗ P r(x = i) + vmax ∗ (1 − i∗ i=1 P r(x = i)). TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
⊲ m0 = −10. ⊲ M0 = 10.
⊲ m1 = −2 ∗ 1/7 + −10 ∗ 6/7 = −62/7 ≃ −8.86. ⊲ M1 = −2 ∗ 1/7 + 10 ∗ 6/7 = 58/7 ≃ 8.26.
⊲ m1 = 3 ∗ 1/7 + −10 ∗ 6/7 = −57/7 ≃ −8.14. ⊲ M1 = 3 ∗ 1/7 + 10 ∗ 6/7 = 63/7 = 9.
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
⊲ Knowing some value v′
1 of a subtree for x = 1 gives an upper bound,
i.e., score(1) ≥ v′
1.
⊲ Knowing some value v′
2 of a subtree for x = 2 gives an upper bound,
i.e., score(2) ≥ v′
2.
⊲ These bounds can be used to make the search window further narrower.
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
⊲ Main search algorithm ⊲ Enhancements ⊲ Evaluating function: knowledge
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
i ) = x
i=1 Xi.
⊲ The mean of X[n], E(X[n]), is 0. ⊲ The standard deviation of X[n], σn, is x√n
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
⊲ Data source: 63,548 games played among masters recorded at www.dpxq.com. ⊲ This means the first player has a better chance of winning.
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
P r(|X[n]| ≤ s) s = 0 s = 1 s = 2 s = 3 s = 4 s = 5 s = 6
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
P r(|X[n]| ≤ s) s = 7 s = 8 s = 9 s = 10 s = 11 s = 12 s = 13
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
P r(|X[n]| ≤ s) s = 14 s = 15 s = 16 s = 17 s = 18 s = 19 s = 20
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
⊲ Then this result is meaningful, that is a program is better than the
⊲ Then this result is not very meaningful, because it happens with a high probability of 0.95.
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
⊲ You can only solve a game once!
⊲ You can learn lots of knowledge.
⊲ Fun!
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c
TCG: Concluding remarks, 20150108, Tsan-sheng Hsu c