CS344M Autonomous Multiagent Systems Patrick MacAlpine Department - - PowerPoint PPT Presentation

▶

Jan 05, 2024 350 likes •650 views

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department or Computer Science The University of Texas at Austin Good Afternoon, Colleagues Are there any questions? Patrick MacAlpine Logistics How to read a research paper Patrick

SLIDE 1

CS344M Autonomous Multiagent Systems

Patrick MacAlpine Department or Computer Science The University of Texas at Austin

SLIDE 2

Good Afternoon, Colleagues

Are there any questions?

Patrick MacAlpine

SLIDE 3

Logistics

How to read a research paper

Patrick MacAlpine

SLIDE 4

Logistics

How to read a research paper

– Some have too few details...

Patrick MacAlpine

SLIDE 5

Logistics

How to read a research paper

– Some have too few details... – Others have too many.

Patrick MacAlpine

SLIDE 6

Logistics

How to read a research paper

– Some have too few details... – Others have too many.

Next week’s readings posted

Patrick MacAlpine

SLIDE 7

Logistics

How to read a research paper

– Some have too few details... – Others have too many.

Next week’s readings posted

Patrick MacAlpine

SLIDE 8

Overview of the Readings

Darwin: genetic programming approach

Patrick MacAlpine

SLIDE 9

Overview of the Readings

Darwin: genetic programming approach Stone and McAllester: Architecture for action selection

Patrick MacAlpine

SLIDE 10

Overview of the Readings

Darwin: genetic programming approach Stone and McAllester: Architecture for action selection Riley et al: Coach competition, extracting models

Patrick MacAlpine

SLIDE 11

Overview of the Readings

Darwin: genetic programming approach Stone and McAllester: Architecture for action selection Riley et al: Coach competition, extracting models Kuhlmann et al: Learning for coaching

Patrick MacAlpine

SLIDE 12

Overview of the Readings

Darwin: genetic programming approach Stone and McAllester: Architecture for action selection Riley et al: Coach competition, extracting models Kuhlmann et al: Learning for coaching Wihthop and Reidmiller: Reinforcement learning

Patrick MacAlpine

SLIDE 13

Overview of the Readings

Darwin: genetic programming approach Stone and McAllester: Architecture for action selection Riley et al: Coach competition, extracting models Kuhlmann et al: Learning for coaching Wihthop and Reidmiller: Reinforcement learning MacAlpine, Price, and Stone: Role assignment

Patrick MacAlpine

SLIDE 14

Overview of the Readings

Darwin: genetic programming approach Stone and McAllester: Architecture for action selection Riley et al: Coach competition, extracting models Kuhlmann et al: Learning for coaching Wihthop and Reidmiller: Reinforcement learning MacAlpine, Price, and Stone: Role assignment MacAlpine, Depinet, and Stone: Overlapping layered learning

Patrick MacAlpine

SLIDE 15

Evolutionary Computation

Motivated by biological evolution: GA, GP

Patrick MacAlpine

SLIDE 16

Evolutionary Computation

Motivated by biological evolution: GA, GP
Search through a space

Patrick MacAlpine

SLIDE 17

Evolutionary Computation

Motivated by biological evolution: GA, GP
Search through a space

− Need a representation, fitness function − Probabilistically apply search operators to set of points in search space

Patrick MacAlpine

SLIDE 18

Evolutionary Computation

Motivated by biological evolution: GA, GP
Search through a space

− Need a representation, fitness function − Probabilistically apply search operators to set of points in search space

Randomized, parallel hill-climbing through space

Patrick MacAlpine

SLIDE 19

Evolutionary Computation

Motivated by biological evolution: GA, GP
Search through a space

− Need a representation, fitness function − Probabilistically apply search operators to set of points in search space

Randomized, parallel hill-climbing through space
Learning is an optimization problem (fitness)

Patrick MacAlpine

SLIDE 20

Evolutionary Computation

Motivated by biological evolution: GA, GP
Search through a space

− Need a representation, fitness function − Probabilistically apply search operators to set of points in search space

Randomized, parallel hill-climbing through space
Learning is an optimization problem (fitness)

Some slides from Machine Learning [Mitchell, 1997]

Patrick MacAlpine

SLIDE 21

Darwin United

More ambitious follow-up to Luke, 97 (made 2nd round)

Patrick MacAlpine

SLIDE 22

Darwin United

More ambitious follow-up to Luke, 97 (made 2nd round)
Motivated in part by Peter’s detailed team construction

Patrick MacAlpine

SLIDE 23

Darwin United

More ambitious follow-up to Luke, 97 (made 2nd round)
Motivated in part by Peter’s detailed team construction
Evolves whole teams — lexicographic fitness function

Patrick MacAlpine

SLIDE 24

Darwin United

More ambitious follow-up to Luke, 97 (made 2nd round)
Motivated in part by Peter’s detailed team construction
Evolves whole teams — lexicographic fitness function
Lots of spinning, but figured out dribbling, offsides

Patrick MacAlpine

SLIDE 25

Darwin United

More ambitious follow-up to Luke, 97 (made 2nd round)
Motivated in part by Peter’s detailed team construction
Evolves whole teams — lexicographic fitness function
Lots of spinning, but figured out dribbling, offsides
1-1-1 record. Tied a good team, but didn’t advance

Patrick MacAlpine

SLIDE 26

Darwin United

More ambitious follow-up to Luke, 97 (made 2nd round)
Motivated in part by Peter’s detailed team construction
Evolves whole teams — lexicographic fitness function
Lots of spinning, but figured out dribbling, offsides
1-1-1 record. Tied a good team, but didn’t advance
Success of the method, but not pursued

Patrick MacAlpine

SLIDE 27

Overlapping Layered Learning

Machine learning paradigms (not algorithms)

Patrick MacAlpine

SLIDE 28

Overlapping Layered Learning

Machine learning paradigms (not algorithms)
Useful for learning complex skills that work well together

Patrick MacAlpine

SLIDE 29

Overlapping Layered Learning

Machine learning paradigms (not algorithms)
Useful for learning complex skills that work well together
(slides)

Patrick MacAlpine