ARTIFICIAL INTELLIGENCE
Lecturer: Silja Renooij
Planning under uncertainty: POMDPs
Utrecht University The Netherlands
These slides are part of the INFOB2KI Course Notes available from www.cs.uu.nl/docs/vakken/b2ki/schema.html
ARTIFICIAL INTELLIGENCE Planning under uncertainty: POMDPs - - PowerPoint PPT Presentation
Utrecht University INFOB2KI 2019-2020 The Netherlands ARTIFICIAL INTELLIGENCE Planning under uncertainty: POMDPs Lecturer: Silja Renooij These slides are part of the INFOB2KI Course Notes available from
Utrecht University The Netherlands
These slides are part of the INFOB2KI Course Notes available from www.cs.uu.nl/docs/vakken/b2ki/schema.html
Prediction Planning Fully observable
(Markov decision process)
Partially observable
(Partially observable Markov decision process)
S3 S1 S2 … S1 S2 S3 O1 O2 O3 …
2
3
* ' *
s a
4
Environment
Goal
5
7
8
O n
1 2
n
dimensionality memory
9
Γ represents a vector of values for possible states (true state unknown)
10
∈
11
b 𝑐, 𝑏, 𝑐′ 𝑄 𝑐 𝑐, 𝑏 ∑
∈
∈ ∈ ∈
b 𝑐, 𝑏 ∑
∈
𝑈: BA → 𝐶 𝑆: BA → 𝑺
12
zero or one
P(s0) = 0 P(s0) = 1
13
P(b|b,a,o) Current Belief State (Register) Policy
Update belief state after action and observation Policy maps belief state to action Policy is found by solving the belief‐state MDP
14
15
16
17