Survey of Artificial Intelligence for Card Games and Its Application - - PowerPoint PPT Presentation

survey of artificial intelligence for card games and its
SMART_READER_LITE
LIVE PREVIEW

Survey of Artificial Intelligence for Card Games and Its Application - - PowerPoint PPT Presentation

Survey of Artificial Intelligence for Card Games and Its Application to the Swiss Game Jass J. Niklaus *1 , M. Alberti *1 , V. Pondenkandath 1 , R. Ingold 1 , M. Liwicki 12 *Equal contribution 1 DIVA Group, University of Fribourg, Switzerland 2


slide-1
SLIDE 1

Survey of Artificial Intelligence for Card Games and Its Application to the Swiss Game Jass

  • J. Niklaus*1, M. Alberti*1, V. Pondenkandath1, R. Ingold1, M. Liwicki12

*Equal contribution

1DIVA Group, University of Fribourg, Switzerland 2EISLAB Machine Learning, Luleå University of Technology, Sweden

slide-2
SLIDE 2

Perfect Information Games

Great Success of Artificial Intelligence in Games in last decades AlphaGO (GO), AlphaZero (Chess), Maluuba (Pacman), …

2

slide-3
SLIDE 3

Why Care About Hidden Information Games?

Hidden information makes games hard Are very close to real-world application energy optimization in datacenters, surgical operations, business, physics, …

3

slide-4
SLIDE 4

AI in Hidden Information Games

Recently several milestones have been reached OpenAI Five (Dota II), Libratus (Poker), AlphaStar (StarCraft II), …

4

slide-5
SLIDE 5

The Next Challenge: Jass

Trick-taking traditional Swiss card game Hidden information Sequential Non-cooperative Finite Constant-sum Schieber variant with 4 players

5

slide-6
SLIDE 6

Coordination Game Within Jass

Activity within team → Coordination Game Can convey meaning by playing cards according to common protocol Agreements like “discarding policy” Example: Top-Down Player 3 plays low Diamond Card to signal strong Diamond Suit (probably Ace or at least King) to player 1

6

Player 1 Player 3 Player 2

slide-7
SLIDE 7

Jass vs. Hanabi

Competitive and Cooperative Cooperation is key to success on high level Multiplayer Game Hidden Information → Suitable testbed “New Frontier of AI Research” (Deepmind) Purely cooperative Multiplayer Game Hidden Information

7

slide-8
SLIDE 8

Our Contribution

Survey of existing methods for card games Rule Based, Evolutionary, RL, MCTS Starting point for research in hidden information or card games Discussion of methods on use case Jass

8

slide-9
SLIDE 9

Rule Based Systems

Leverage human knowledge Simple Used as baselines Hanabi is solved only by RB systems so far Can be seen as man-made decision trees

9

slide-10
SLIDE 10

Evolutionary Algorithms

Inspired by evolutionary theory Survival of the fittest Example: Population: Blackjack strategy Fitness function: money after playing the game for N iterations Example: Population: Hearthstone deck Fitness function: score after playing versus established human-designed decks

10

slide-11
SLIDE 11

Reinforcement Learning in a Nutshell

11

slide-12
SLIDE 12

Counterfactual Regret Minimization (CFR)

12

slide-13
SLIDE 13

Various Reinforcement Learning Methods

Temporal Difference Learning Policy Gradient Counterfactual Regret Minimization (CFR) CFR+ Deep CFR Discounted CFR Neural Fictitious Self-Play First Order Methods

slide-14
SLIDE 14

Monte Carlo Simulation in a Nutshell

Problem analytically very hard or impossible to solve Stochastic solution: big number of random experiments Example: Approximate π, 4*probability that random point in square is within the circle

14

slide-15
SLIDE 15

Monte Carlo Tree Search (MCTS)

15

slide-16
SLIDE 16

Variations of Monte Carlo Methods

Monte Carlo Simulation Flat Monte Carlo Monte Carlo Tree Search Upper Confidence Bound for Trees Determinization Information Set Monte Carlo Tree Search Monte Carlo Sampling for Regret Minimization

slide-17
SLIDE 17

Application to the use-case

  • f Jass

MCTS and CFR most successfully applied to card games CFR only used in Poker so far Approaches NE Bad at exploiting opponents MCTS applied to plethora of complex card games No guarantees for approaching NE Good at finding good solution fast Choice depends on the goal AI that never loses, costs don’t matter → CFR AI that performs well, can exploit opponents → MCTS

slide-18
SLIDE 18

Preliminary Results

Jass Server: http://jass.joeli.to Play against MCTS bot! Suggestion engine based on MCTS bot Beats the average human player (par-human) Introduction Video: bit.ly/jass_intro Experiment: bit.ly/jass_form

18

slide-19
SLIDE 19

Conclusion

19

Rule-based is replaced by most popular methods MCTS and CFR MCTS finds good strategies fast, but is exploitable by very good opponents CFR is not exploitable but cannot exploit others Jass is a hard game and hence suitable for testbed of new technologies and methods in the field. We propose using MCTS and our preliminary results suggest it’s a good idea Our bot performs at par-human level