SLIDE 14 8/3/12& 14&
5.(Polyvalent(Uncertainty(
In(Reinforcement(Learning,(
there(is(only(uncertainty( about(the(relation(between( S(t)(and(Y(t).(
For(Bayesians,(there(is(
uncertainty(about(X(t),(about( Y(t)(given(X(t),(and(even( about(whether(the(relation( (S(t),(X(t))(changes…((
27"
Xt Yt St St Yt
Uncertainty…((
Irreducible"uncertainty"or"risk:(Decision(Maker((DM)(knows(that(
the(chance(of(heads(on(a(fair(coin(is(0.5;(DM(doesn’t(know(whether( the(next(toss(will(be(heads(or(tails.((Concerns0the0relation0between0 X(t)0and0Y(t))(
Estimation"uncertainty"or"ambiguity:(DM(is(given(a(new(coin(and(
doesn’t(know(whether(it(is(fair;(DM(needs(to(learn(the(probability(of( heads.((Concerns0how0sure0one0is0of0X(t))(
Unexpected"uncertainty"or"jump"risk"(or"“volatility”):(Unknown(
to(DM,(the(coin(is(replaced(with(another((possibly(unfair)(coin.( (Concerns0whether0X(t)0has0changed)(
Model"or"“Knightean”"uncertainty:(Is(the(coin(being(replaced(
regularly(or(are(coin(tosses(correlated?((Concerns0the0nature0of0X(t))"
28"