CS 473: Ar*ficial Intelligence Conclusion Dan Weld - PDF document

CS ¡473: ¡Ar*ficial ¡Intelligence ¡ Conclusion ¡ ¡ Dan ¡Weld ¡– ¡University ¡of ¡Washington ¡ [Many ¡of ¡these ¡slides ¡were ¡created ¡by ¡Dan ¡Klein ¡and ¡Pieter ¡Abbeel ¡for ¡CS188 ¡Intro ¡to ¡AI ¡at ¡UC ¡Berkeley. ¡ ¡All ¡CS188 ¡materials ¡are ¡available ¡at ¡hMp://ai.berkeley.edu.] ¡ Exam ¡Topics ¡ § Reinforcement Learning § Search ¡ § Exploration vs Exploitation § Problem ¡spaces ¡ § Model-based vs. model-free § BFS, ¡DFS, ¡UCS, ¡A* ¡(tree ¡and ¡graph), ¡local ¡search ¡ § Q-learning § Completeness ¡and ¡Op*mality ¡ § Linear value function approx. § Heuris*cs: ¡admissibility ¡and ¡consistency; ¡paMern ¡DBs ¡ § Hidden Markov Models § CSPs ¡ § Markov chains, DBNs § Constraint ¡graphs, ¡backtracking ¡search ¡ Forward algorithm § § Forward ¡checking, ¡AC3 ¡constraint ¡propaga*on, ¡ordering ¡ § Particle Filters heuris*cs ¡ § Bayesian Networks § Games ¡ § Basic definition, independence (d-sep) § Minimax, ¡Alpha-‑beta ¡pruning, ¡ ¡ § Variable elimination § Expec*max ¡ § Sampling (rejection, importance) § Evalua*on ¡Func*ons ¡ § Learning § MDPs ¡ § BN parameters with complete data § Bellman ¡equa*ons ¡ § Search thru space of BN structures § Value ¡itera*on, ¡policy ¡itera*on ¡ Expectation maximization § 1

What ¡is ¡intelligence? ¡ § (bounded) ¡Ra*onality ¡ § Agent ¡has ¡a ¡performance ¡measure ¡to ¡op*mize ¡ § Given ¡its ¡state ¡of ¡knowledge ¡ § Choose ¡op*mal ¡ac*on ¡ ¡ § With ¡limited ¡computa*onal ¡resources ¡ § Human-‑like ¡intelligence/behavior ¡ Search ¡in ¡Discrete ¡State ¡Spaces ¡ § Every ¡discrete ¡problem ¡can ¡be ¡cast ¡as ¡a ¡search ¡problem. ¡ § states, ¡ac*ons, ¡transi*ons, ¡cost, ¡goal-‑test ¡ § Types ¡ § uninformed ¡systema*c: ¡ocen ¡slow ¡ § DFS, ¡BFS, ¡uniform-‑cost, ¡itera*ve ¡deepening ¡ § Heuris*c-‑guided: ¡beMer ¡ § Greedy ¡best ¡first, ¡A* ¡ § relaxa*on ¡leads ¡to ¡heuris*cs ¡ § Local: ¡fast, ¡fewer ¡guarantees; ¡ocen ¡local ¡op*mal ¡ § Hill ¡climbing ¡and ¡varia*ons ¡ § Simulated ¡Annealing: ¡global ¡op*mal ¡ § (Local) ¡Beam ¡Search ¡ 2

Which ¡Algorithm? ¡ § A*, Manhattan Heuristic: Adversarial ¡Search ¡ 3

Adversarial ¡Search ¡ § AND/OR ¡search ¡space ¡(max, ¡min) ¡ § minimax ¡objec*ve ¡func*on ¡ § minimax ¡algorithm ¡(~dfs) ¡ § alpha-‑beta ¡pruning ¡ § U*lity ¡func*on ¡for ¡par*al ¡search ¡ § Learning ¡u*lity ¡func*ons ¡by ¡playing ¡with ¡itself ¡ § Openings/Endgame ¡databases ¡ Knowledge ¡Representa*on ¡and ¡Reasoning ¡ § Represen*ng: ¡what ¡agent ¡knows ¡ Propositional logic Constraint networks HMMs Bayesian networks … § Reasoning: ¡what ¡agent ¡can ¡infer ¡ ¡ Search Dynamic programming Preprocessing to simplify 4

Search+KR&R ¡Example: ¡CSP ¡ § Representa*on ¡ ¡ § Variables, ¡Domains, ¡Constraints ¡ § Reasoning: ¡ § Arc ¡Consistency ¡(k-‑Consistency) ¡ § Solving ¡ § Backtracking ¡search: ¡par*al ¡var ¡assignments ¡ § Heuris*cs: ¡min ¡remaining ¡values, ¡min ¡conflicts ¡ § Local ¡search: ¡complete ¡var ¡assignments ¡ Trapped ¡ � § Pacman ¡is ¡trapped! ¡He ¡is ¡surrounded ¡by ¡mysterious ¡corridors, ¡ � � each ¡of ¡which ¡leads ¡to ¡either ¡a ¡pit ¡(P), ¡a ¡ghost(G), ¡or ¡an ¡exit ¡(E). ¡ � � In ¡order ¡to ¡escape, ¡he ¡needs ¡to ¡figure ¡out ¡which ¡corridors, ¡if ¡any, ¡ lead ¡to ¡an ¡exit ¡and ¡freedom, ¡rather ¡than ¡the ¡certain ¡doom ¡of ¡a ¡ � � pit ¡or ¡a ¡ghost. ¡The ¡one ¡sign ¡of ¡what ¡lies ¡behind ¡the ¡corridors ¡is ¡ � � the ¡wind: ¡a ¡pit ¡produces ¡a ¡strong ¡breeze ¡(S) ¡and ¡an ¡exit ¡produces ¡ � � a ¡weak ¡breeze ¡(W), ¡while ¡a ¡ghost ¡doesn’t ¡produce ¡any ¡breeze ¡at ¡ � all. ¡Unfortunately, ¡Pacman ¡cannot ¡measure ¡the ¡strength ¡of ¡the ¡ breeze ¡at ¡a ¡specific ¡corridor. ¡Instead, ¡he ¡can ¡stand ¡between ¡two ¡ adjacent ¡corridors ¡and ¡feel ¡the ¡max ¡of ¡the ¡two ¡breezes. ¡For ¡ Variables? example, ¡if ¡he ¡stands ¡between ¡a ¡pit ¡and ¡an ¡exit ¡he ¡will ¡sense ¡a ¡ strong ¡(S) ¡breeze, ¡while ¡if ¡he ¡stands ¡between ¡an ¡exit ¡and ¡a ¡ghost, ¡ he ¡will ¡sense ¡a ¡weak ¡(W) ¡breeze. ¡The ¡measurements ¡for ¡all ¡ intersec*ons ¡are ¡shown ¡in ¡the ¡figure ¡below. ¡Also, ¡while ¡the ¡total ¡ number ¡of ¡exits ¡might ¡be ¡zero, ¡one, ¡or ¡more, ¡Pacman ¡knows ¡that ¡ two ¡neighboring ¡squares ¡will ¡not ¡both ¡be ¡exits. ¡ 11 5

Trapped ¡ � § Pacman ¡is ¡trapped! ¡He ¡is ¡surrounded ¡by ¡mysterious ¡corridors, ¡ � � each ¡of ¡which ¡leads ¡to ¡either ¡a ¡pit ¡(P), ¡a ¡ghost(G), ¡or ¡an ¡exit ¡(E). ¡ � � In ¡order ¡to ¡escape, ¡he ¡needs ¡to ¡figure ¡out ¡which ¡corridors, ¡if ¡any, ¡ lead ¡to ¡an ¡exit ¡and ¡freedom, ¡rather ¡than ¡the ¡certain ¡doom ¡of ¡a ¡ � � pit ¡or ¡a ¡ghost. ¡The ¡one ¡sign ¡of ¡what ¡lies ¡behind ¡the ¡corridors ¡is ¡ � � the ¡wind: ¡a ¡pit ¡produces ¡a ¡strong ¡breeze ¡(S) ¡and ¡an ¡exit ¡produces ¡ � � a ¡weak ¡breeze ¡(W), ¡while ¡a ¡ghost ¡doesn’t ¡produce ¡any ¡breeze ¡at ¡ � all. ¡Unfortunately, ¡Pacman ¡cannot ¡measure ¡the ¡strength ¡of ¡the ¡ breeze ¡at ¡a ¡specific ¡corridor. ¡Instead, ¡he ¡can ¡stand ¡between ¡two ¡ adjacent ¡corridors ¡and ¡feel ¡the ¡max ¡of ¡the ¡two ¡breezes. ¡For ¡ Variables? X 1 , … X 6 example, ¡if ¡he ¡stands ¡between ¡a ¡pit ¡and ¡an ¡exit ¡he ¡will ¡sense ¡a ¡ Domains {P, G, E} strong ¡(S) ¡breeze, ¡while ¡if ¡he ¡stands ¡between ¡an ¡exit ¡and ¡a ¡ghost, ¡ he ¡will ¡sense ¡a ¡weak ¡(W) ¡breeze. ¡The ¡measurements ¡for ¡all ¡ intersec*ons ¡are ¡shown ¡in ¡the ¡figure ¡below. ¡Also, ¡while ¡the ¡total ¡ number ¡of ¡exits ¡might ¡be ¡zero, ¡one, ¡or ¡more, ¡Pacman ¡knows ¡that ¡ two ¡neighboring ¡squares ¡will ¡not ¡both ¡be ¡exits. ¡ 12 Trapped ¡ � § A ¡pit ¡produces ¡a ¡strong ¡breeze ¡(S) ¡and ¡an ¡exit ¡produces ¡a ¡weak ¡ � � breeze ¡(W), ¡while ¡a ¡ghost ¡doesn’t ¡produce ¡any ¡breeze ¡at ¡all. ¡ � � Pacman ¡feels ¡the ¡max ¡of ¡the ¡two ¡breezes. ¡ ¡ § the ¡total ¡number ¡of ¡exits ¡might ¡be ¡zero, ¡one, ¡or ¡more, ¡ ¡ � � § two ¡neighboring ¡squares ¡will ¡not ¡both ¡be ¡exits. ¡ � � � � � Constraints? Variables? X 1 , … X 6 Domains {P, G, E} 13 6

CS 473: Ar*ficial Intelligence Conclusion Dan Weld - PDF document

CS 473: Ar*ficial Intelligence Conclusion Dan Weld University of Washington [Many of these slides were created by Dan Klein and Pieter Abbeel

CSCI 446: Arficial Intelligence CSCI 446: Arficial Intelligence

Today CS 232: Ar)ficial Intelligence Introduc)on August 31,

Midterm$Postmortem$ CSE$473:$Ar+ficial$Intelligence$ $ Reinforcement$Learning$ !

CSE 473: Ar+ficial Intelligence Reinforcement Learning Dan Weld

CSE 473: Ar+ficial Intelligence Par+cle Filters for HMMs

CSE 473: Ar+ficial Intelligence Reinforcement Learning Instructor: Luke Ze?lemoyer University of

An Introduction to National Intelligence Unclassified National Intelligence Intelligence:

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

A A Historical and Functional Ov Overview of f Artifi ficial Intelligence wi with h Hy

Augmen'ng Intellect through Wearables and Ar'ficial Intelligence Professor Thad Starner

Pieter Abbeel Berkeley Ar-ficial Intelligence Research laboratory (BAIR.berkeley.edu) PR1

Diversity in Ar,ficial Intelligence SONIA GUPTA MD @SoniaGuptaMD DIRECTOR OF ULTRASOUND BETH

Ar#ficial Intelligence: Introduc#on Byoung-Tak Zhang School of

Constraint sasfacon problems II CS171, Fall 2016 Introducon to Arficial Intelligence Prof.

Today CS 232: Ar)ficial Intelligence Constraint Sa)sfac)on

An old Ar(ficial Intelligence dream that comes true: Merging

Artificial Intelligence Chapter 1 Chapter 1 1 Outline What is AI? A brief history

Introduction to Computer Science CSCI 109 China Tianhe-2 Andrew Goodney Fall 2019 Lecture

CS440 - Introduction to Artificial Intelligence 1 http://xkcd.com/329/ Course staff q

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

PSEUDO-RANDOM FUNCTIONS We want to answer the question: What is a good block cipher? where

PSEUDO-RANDOM FUNCTIONS 1 / 65 Recall We studied security of a block cipher against key

EECS 3401 AI and Logic Prog. Lecture 1 Adapted from slides of Prof. Yves Lesperance York

CS 4100 Artificial al Intelligence or: Jan-Willem van de Meent In Instructor ite:

CS 473: Ar*ficial Intelligence Conclusion Dan Weld - PDF document

CS 473: Ar*ficial Intelligence Conclusion Dan Weld University of Washington [Many of these slides were created by Dan Klein and Pieter Abbeel

CSCI 446: Ar*ficial Intelligence CSCI 446: Ar*ficial Intelligence

Today CS 232: Ar)ficial Intelligence Introduc)on August 31,

Midterm$Postmortem$ CSE$473:$Ar+ficial$Intelligence$ $ Reinforcement$Learning$ !

CSE 473: Ar+ficial Intelligence Reinforcement Learning Dan Weld

CSE 473: Ar+ficial Intelligence Par+cle Filters for HMMs

CSE 473: Ar+ficial Intelligence Reinforcement Learning Instructor: Luke Ze?lemoyer University of

An Introduction to National Intelligence Unclassified National Intelligence Intelligence:

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

A A Historical and Functional Ov Overview of f Artifi ficial Intelligence wi with h Hy

Augmen'ng Intellect through Wearables and Ar'ficial Intelligence Professor Thad Starner

Pieter Abbeel Berkeley Ar-ficial Intelligence Research laboratory (BAIR.berkeley.edu) PR1

Diversity in Ar,ficial Intelligence SONIA GUPTA MD @SoniaGuptaMD DIRECTOR OF ULTRASOUND BETH

Ar#ficial Intelligence: Introduc#on Byoung-Tak Zhang School of

Constraint sa*sfac*on problems II CS171, Fall 2016 Introduc*on to Ar*ficial Intelligence Prof.

Today CS 232: Ar)ficial Intelligence Constraint Sa)sfac)on

An old Ar(ficial Intelligence dream that comes true: Merging

Artificial Intelligence Chapter 1 Chapter 1 1 Outline What is AI? A brief history

Introduction to Computer Science CSCI 109 China Tianhe-2 Andrew Goodney Fall 2019 Lecture

CS440 - Introduction to Artificial Intelligence 1 http://xkcd.com/329/ Course staff q

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

PSEUDO-RANDOM FUNCTIONS We want to answer the question: What is a good block cipher? where

PSEUDO-RANDOM FUNCTIONS 1 / 65 Recall We studied security of a block cipher against key

EECS 3401 AI and Logic Prog. Lecture 1 Adapted from slides of Prof. Yves Lesperance York

CS 4100 Artificial al Intelligence or: Jan-Willem van de Meent In Instructor ite:

CSCI 446: Arficial Intelligence CSCI 446: Arficial Intelligence

Constraint sasfacon problems II CS171, Fall 2016 Introducon to Arficial Intelligence Prof.