Reinforcement Learning by Narayan Hegde ME, CSA@IISc Google 2 - - PowerPoint PPT Presentation

reinforcement learning
SMART_READER_LITE
LIVE PREVIEW

Reinforcement Learning by Narayan Hegde ME, CSA@IISc Google 2 - - PowerPoint PPT Presentation

Reinforcement Learning by Narayan Hegde ME, CSA@IISc Google 2 Summer School 2013 3 Summer School 2013 What is it? 4 What could the mechanisms be, by which based on feedback based on our actions by the environment, we learn to act over


slide-1
SLIDE 1

Reinforcement Learning

by Narayan Hegde ME, CSA@IISc Google

slide-2
SLIDE 2

Summer School 2013

2

slide-3
SLIDE 3

Summer School 2013

3

slide-4
SLIDE 4

What is it?

What could the mechanisms be, by which based on feedback based on our actions by the environment, we learn to act over time. Learning by trial and error to perform sequential decision making Markov Decision Process trial-and-error search and delayed reward - are the two most important distinguishing features

Summer School 2013

4

slide-5
SLIDE 5

Summer School 2013

5

Reinforcement Learning

Artificial Intelligence

Machine Learning Control Theory Game Theory Psychology Operation Research

slide-6
SLIDE 6

Where do you see this

 Current and Next Generation Automated Robots  Drive Vehicles  Personal Assistant  Traffic-light control  Games like chess  Why don’t we hard code them? What are the benefits  Where can you apply?  What is the greatest example of reinforcement learning?

Summer School 2013

6

slide-7
SLIDE 7

The Video of Goal Keeper

Summer School 2013

7

slide-8
SLIDE 8

Take Away

Get Motivated Model in Real Life Try Building a mathematical model

slide-9
SLIDE 9

Summer School 2013

9

slide-10
SLIDE 10

Summer School 2013

10

slide-11
SLIDE 11

Summer School 2013

11

slide-12
SLIDE 12

Summer School 2013

12

slide-13
SLIDE 13

Summer School 2013

13

slide-14
SLIDE 14

Example of Robot playing Football

Summer School 2013

14

slide-15
SLIDE 15

Summer School 2013

15

slide-16
SLIDE 16

Summer School 2013

16

slide-17
SLIDE 17

Summer School 2013

17

slide-18
SLIDE 18

Summer School 2013

18

slide-19
SLIDE 19

What is the algorithm paradigm?

Summer School 2013

slide-20
SLIDE 20

Summer School 2013

20

slide-21
SLIDE 21

Summer School 2013

21

slide-22
SLIDE 22

Summer School 2013

22

slide-23
SLIDE 23

Summer School 2013

23

slide-24
SLIDE 24

Thank you and Questions

Summer School 2013

slide-25
SLIDE 25

Summer School 2013

25

slide-26
SLIDE 26

Summer School 2013

26

slide-27
SLIDE 27

Summer School 2013

27

slide-28
SLIDE 28

Summer School 2013

28

slide-29
SLIDE 29

Summer School 2013

29

slide-30
SLIDE 30

Summer School 2013

30

slide-31
SLIDE 31

Summer School 2013

31

slide-32
SLIDE 32

Summer School 2013

32

slide-33
SLIDE 33

Summer School 2013

33

slide-34
SLIDE 34

Summer School 2013

34

slide-35
SLIDE 35

Summer School 2013

35