Techniques in Artificial Intelligence - Part I Todd W. Neller - PowerPoint PPT Presentation

An Introduction to Monte Carlo Techniques in Artificial Intelligence - Part I Todd W. Neller Gettysburg College

Monte Carlo (MC) Techniques in AI • General: Monte Carlo simulation for probabilistic estimation • Machine Learning: Monte Carlo reinforcement learning • Uncertain Reasoning: Bayesian network reasoning with the Markov Chain Monte Carlo method • Robotics: Monte Carlo localization • Search: Monte Carlo tree search • Game Theory: Monte Carlo regret-based techniques

Monte Carlo Simulation • Repeated sampling of stochastic simulations to estimate system properties • Recommended Readings: – Wikipedia article on Monte Carlo Methods – [Paul J. Nahin’s Digital Dice: Computational Solutions to Practical Probability Problems is a great source of MC simulation exercises.]

Why MC Simulation? • Nihin’s motivational philosophical theme: 1. No matter how smart you are, there will always be probabilistic problems that are too hard for you to solve analytically. 2. Despite (1), if you know a good scientific programming language that incorporates a random number generator (and if it is good it will), you may still be able to get numerical answers to those "too hard" problems.

Problem Solving Approach 1. Program a single simulation with enough printed output to convince you of the correctness of your model. 2. Add your statistical measure of interest and test its correctness as well. 3. Remove printing from the code. 4. Wrap the code in a loop of many iterations. 5. Add printing to summarize the analysis of the collected statistical data.

Game AI Exercises • Yahtzee – probability of getting a Yahtzee (5 of a kind) in 3 rolls of 5 dice • Pig – probability of turn outcomes of “hold at 20” policy – expected number of turns in solitaire play – first player advantage assuming “hold at 20” policy • Risk – attack rollouts with varying attackers, defenders • Limitations of MC Simulation – probability of rolling all 1s for n dice

MC Reinforcement Learning • Learn essential Reinforcement Learning (RL) terminology from a variety of sources: • Sutton, R.S. and Barto, A.G. Reinforcement Learning: an introduction, Chapter 3 • Kaelbling, L.P., Littman, M.L., and Moore, A.W. Reinforcement learning: a survey, sections 1 and 3.1 • Russell, S. and Norvig, P. Artificial Intelligence: a modern approach, 3rd ed., section 17.1 • Read specifically about MC RL: – Sutton, R.S. and Barto, A.G. Reinforcement Learning: an introduction, Chapter 5

Approach N • Since learning is best through experience, we suggest implementing Sutton and Barto’s MC RL algorithms with a single running problem. • Approach N – Original design as simplest “Jeopardy approach game” [Neller & Presser 2005] prototype – 2 players and a single standard 6-sided die (d6). – Goal: approach a total of n without exceeding it. – 1 st player rolls a die repeatedly until they either (1) "hold" with a roll sum <= n, or (2) exceed n and lose. – 1 st player holds at exactly n  immediate win – Otherwise 2 nd player rolls to exceed the first player total without exceeding n , winning or losing accordingly. • Only 1 st player has a choice of play policy. • For n >= 10, the game is nearly fair. • Sample solution output given for n = 10, but students may be assigned different n .

MC RL Approach N Exercises • Comparative MC Simulation – Simulate games with 1st player holding sum s for s in [ n – 5, n ]. Which s optimizes 1 st player wins? • First-visit MC method for policy evaluation • MC control with exploring starts (MCES) • Epsilon-soft on-policy MC control • Off-policy MC control

Further MC RL Game AI Exercises • Hog Solitaire – Each turn, roll some chosen number of dice. Score only rolls with no 1s. How many dice should be rolled so as to minimize the expected number of turns to reach a goal score? • Pig Solitaire – As above, but with individual die rolls and option to hold and score at any time. • Yahtzee or Chance – Assuming an option to score a Yahtzee (5-of-a-kind, 50 pts.) or Chance (sum of dice) in 3 rolls, which dice should be rerolled in any given situation?

Conclusion • Deep knowledge comes best from playful experience. – “One must learn by doing the thing; for though you think you know it, you have no certainty, until you try.” – Sophocles – “Play is our brain’s favorite way of learning.” – Diane Ackerman • We have provided novel, fun Game AI exercises that – cover essentials in MC Simulation and MC RL – range from CS1-level to advanced AI exercises – have Java solutions available to instructors – suggest many starting points for undergraduate research projects

Techniques in Artificial Intelligence - Part I Todd W. Neller - PowerPoint PPT Presentation

An Introduction to Monte Carlo Techniques in Artificial Intelligence - Part I Todd W. Neller Gettysburg College Monte Carlo (MC) Techniques in AI General: Monte Carlo simulation for probabilistic estimation Machine Learning: Monte

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial intelligence Artificial Intelligence is the science of PHILOSOPHY OF ARTIFICIAL

Artificial Intelligence Intro (Chapter 1 of AIMA) Summary Artificial Intelligence What is AI?

What is Artificial Intelligence? CPSC 322 Lecture 1 September 5, 2007 What is Artificial

Traditional Definition of Artificial Intelligence Trends Artificial Intelligence (AI) is

Artificial Intelligence as Law Bart Verheij Department of Artificial Intelligence, Bernoulli

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

Lecture Overview What is Artificial Intelligence? Agents acting in an environment

CSCI 446: Artificial Intelligence CSCI 446: Artificial Intelligence Course Website:

1.1 What is AI? 1. What is Artificial Intelligence? 2. AI Past and Present 3. Rational

8th November 2019 Artificial Intelligence Finance Institute NYU Courant Artificial Intelligence

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

Introduction to Artificial Intelligence What is Artificial Intelligence for YOU? CPSC 533

Search Techniques for Artificial Intelligence Search is a central topic in Artificial

Dr. Marina Dombrovskaya ICEAA Conference Denver, CO June, 2014 This document is confidential

STATUS EU Consultation EPD MFIN 23/03/2016 1 BACKGROUND ON THE ISSUE China is member

nutritional information www.dippinstix.com premium produce Serving Size: 1 package 2.75oz (78g)

SBI Holdings, Inc. Financial Results for the Nine-Month Period Ended Dec. 31, 2010 (Fiscal Year

Dynamic Code Generation and Execution for Monte Carlo Simulations Vaivaswatha Nagaraj Steve

Applied Quantitative Cyber Risk Analysis Michael Rich, OSCP, CISSP Director of IT Security,

The Use of Wastewater Models to Manage Risk Thursday, January 23, 2020 1:00 3:00 PM ET 2 1

A Brief Overview of Uncertainty Quantification and Error Estimation in Numerical Simulation Tim

Sambuz

Useful Links

Newsletter

Mail Us