evaluating state space abstractions in extensive form
play

Evaluating State-Space Abstractions in Extensive-Form Games Michael - PowerPoint PPT Presentation

Evaluating State-Space Abstractions in Extensive-Form Games Michael Johanson, Neil Burch, Richard Valenzano and Michael Bowling University of Alberta, Canada Outline Using CFR-BR to evaluate abstractions Using imperfect recall in


  1. Evaluating State-Space Abstractions in Extensive-Form Games Michael Johanson, Neil Burch, Richard Valenzano and Michael Bowling University of Alberta, Canada

  2. Outline ● Using CFR-BR to evaluate abstractions ● Using imperfect recall in abstractions ● New abstraction features – Read our paper!

  3. Extensive-Form Games Rock Paper Scissors Limit Texas Hold'em RTS Games 9 states ~10 18 states many states TOO BIG!

  4. Abstraction ● Combine strategically similar situations to create a smaller (hopefully) strategically similar game

  5. Evaluating an Abstraction ● Gilpin and Sandholm (AAAI '08) listed three methods for evaluating abstractions – One on one comparison – Play versus real-game equilibrium – Play versus best-response

  6. Evaluating an Abstraction ● One on one comparison – Not transitive: cycles of winners – Depends on the particular abstract solutions Real Game Abstract Abstraction A Strategy a Solution a Expected value Abstract Real Game Abstraction B Solution b Strategy b

  7. Evaluating an Abstraction ● Play versus real-game equilibrium – Generally intractable – Depends on the particular abstract solutions Real Game Abstract Abstraction A Strategy a Solution a Expected value Real Game Solution

  8. Evaluating an Abstraction ● Play versus best-response – Depends on the particular abstract solutions – Does not match observed one-on-on performance Real Game Abstract Abstraction A Strategy a Solution a Best Exploitability Response

  9. CFR-BR [Johanson et al. 2012] Real game strategies Abstract game strategies Real game solutions Abstract solutions CFR-BR finds the least exploitable abstract strategy

  10. Evaluation using CFR-BR ● CFR-BR (Johanson et al. AAAI '12) can be used to find an abstract strategy with lowest real-game exploitability Real Game CFR-BR Abstraction A Strategy a Solution a Best Exploitability Response

  11. Imperfect Recall Imperfect Recall Perfect Recall 1 1 N M 1 1 ... ... N M 1 1 ... ... ... ... ... ... N Depth information sets K information sets

  12. Imperfect Recall Chance Texas Limit Round 1 Player Actions Hold'em Chance Round 2 Player Actions Chance Round 3 Player Action Chance Round 4 Player Actions Abstraction # Information Sets 10/10/10/10 perfect recall 57,330,780 10/100/1000/10000 imperfect recall 57,330,780 169/9000/9000/9000 imperfect recall 57,331,352

  13. Evaluating Imperfect Recall Abstractions Should we use imperfect recall in an abstraction? Yes! Abstraction One-on-One vs. Best CFR-BR vs. Performance Response Best Response 10/10/10/10 PR -24.8 -282.856 -84.039 169/9000/9000/9000 IR 24.8 -282.395 ≥ -64.820 Comparison of perfect and imperfect recall abstraction of limit Texas Hold'em All values are big blinds per thousand hands

  14. Summary ● Use CFR-BR to evaluate abstractions – Transitive measure – Tracks one-on-one performance well – Not dependent on a particular strategy ● Use imperfect recall in abstractions – More flexibility in abstraction choices – Demonstrable improvement in abstraction quality

  15. Thank you! Mihai Ciucu, Eric Jackson, Mengliao Wang, UofA Computer Poker Research ● Group NSERC, Alberta Innovates Technology Futures, WestGrid, Réseau ● Québécois de Calcul de Haute Performance, Compute/Calcul Canada Pictures from freepokerphotosite.com and Wikipedia ●

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend