CS475/CS675 Lecture 24: July 21, 2016
Open problems
CS475/CS675 (c) 2016 P. Poupart 1
CS475/CS675 Lecture 24: July 21, 2016 Open problems CS475/CS675 - - PowerPoint PPT Presentation
CS475/CS675 Lecture 24: July 21, 2016 Open problems CS475/CS675 (c) 2016 P. Poupart 1 Two Open Problems Kernel methods: how to solve linear systems of equation in less than cubic time Markov decision processes: how to evaluate
CS475/CS675 (c) 2016 P. Poupart 1
2
CS475/CS675 (c) 2016 P. Poupart
CS475/CS675 (c) 2016 P. Poupart 3
CS475/CS675 (c) 2016 P. Poupart 4
5
,
CS475/CS675 (c) 2016 P. Poupart 6
CS475/CS675 (c) 2016 P. Poupart 7
CS475/CS675 (c) 2016 P. Poupart 8
CS475/CS675 (c) 2016 P. Poupart 9
State Reward Action s0 s1 s2 r0 a0 a1 r1 r2 a2 …
CS475/CS675 (c) 2016 P. Poupart 10
(
(
CS475/CS675 (c) 2016 P. Poupart 11
R s ∑ Pr ,
Pr , ∑ Pr ,
Pr , ∑ Pr ,
Pr ,
CS475/CS675 (c) 2016 P. Poupart 12
R s Pr ,
CS475/CS675 (c) 2016 P. Poupart 13
is
which is prohibitive for large state
CS475/CS675 (c) 2016 P. Poupart 14
states
which is exponential in the number of
CS475/CS675 (c) 2016 P. Poupart 15
16
sum to 1
is 1
is factored and is additive
CS475/CS675 (c) 2016 P. Poupart 17