Foundations of Artificial Intelligence
- 13. Acting under Uncertainty
Foundations of Artificial Intelligence 13. Acting under Uncertainty - - PowerPoint PPT Presentation
Foundations of Artificial Intelligence 13. Acting under Uncertainty Maximizing Expected Utility Joschka Boedecker and Wolfram Burgard and Bernhard Nebel Albert-Ludwigs-Universit at Freiburg Contents Introduction to Utility Theory 1
(University of Freiburg) Foundations of AI 2 / 32
(University of Freiburg) Foundations of AI 3 / 32
(University of Freiburg) Foundations of AI 4 / 32
(University of Freiburg) Foundations of AI 5 / 32
(University of Freiburg) Foundations of AI 6 / 32
(University of Freiburg) Foundations of AI 7 / 32
(University of Freiburg) Foundations of AI 8 / 32
(University of Freiburg) Foundations of AI 9 / 32
(University of Freiburg) Foundations of AI 10 / 32
150,000 800,000
(University of Freiburg) Foundations of AI 11 / 32
1 2 3 1 2 3 − 1 + 1 4
START
(University of Freiburg) Foundations of AI 12 / 32
(University of Freiburg) Foundations of AI 13 / 32
(University of Freiburg) Foundations of AI 14 / 32
–1 +1 1 2 3 1 2 3 4
(University of Freiburg) Foundations of AI 15 / 32
(University of Freiburg) Foundations of AI 16 / 32
(University of Freiburg) Foundations of AI 17 / 32
(University of Freiburg) Foundations of AI 18 / 32
(University of Freiburg) Foundations of AI 19 / 32
(University of Freiburg) Foundations of AI 20 / 32
(University of Freiburg) Foundations of AI 21 / 32
(University of Freiburg) Foundations of AI 22 / 32
1 2 3 1 2 3 –1 + 1 4 0.611 0.812 0.655 0.762 0.918 0.705 0.660 0.868 0.388
(University of Freiburg) Foundations of AI 23 / 32
(University of Freiburg) Foundations of AI 24 / 32
(University of Freiburg) Foundations of AI 25 / 32
(University of Freiburg) Foundations of AI 26 / 32
a ∈ A(s)
s′
(University of Freiburg) Foundations of AI 27 / 32
(University of Freiburg) Foundations of AI 28 / 32
0.2 0.4 0.6 0.8 1 5 10 15 20 25 30 Utility estimates Number of iterations (4,3) (3,3) (1,1) (3,1) (4,1) 0.2 0.4 0.6 0.8 1 2 4 6 8 10 12 14 Max error/Policy loss Number of iterations Max error Policy loss
(University of Freiburg) Foundations of AI 29 / 32
(University of Freiburg) Foundations of AI 30 / 32
a ∈ A(s)
s′
s′
a ∈ A(s)
s′
(University of Freiburg) Foundations of AI 31 / 32
(University of Freiburg) Foundations of AI 32 / 32