Evolving Societies of Learning Autonomous Systems (ESLAS)
Franz Rammig, Bernd Kleinjohann, Alexander Jungmann University of Paderborn
Evolving Societies of Learning Autonomous Systems (ESLAS) Franz - - PowerPoint PPT Presentation
Organic Computing Status Colloquium / Sept 2010 Evolving Societies of Learning Autonomous Systems (ESLAS) Franz Rammig, Bernd Kleinjohann, Alexander Jungmann University of Paderborn The ESLAS project phase III Goal: Organic coordination
Franz Rammig, Bernd Kleinjohann, Alexander Jungmann University of Paderborn
September 21-22, 2009 DFG 1183 ORGANIC COMPUTING
2
→ dynamic goal priorization based on Multi-SMDPs
dependent on motivation system
→ use behavior-recognition (imitation) to model teammates → use those models to “meta-learn” team strategies (cartesian product of the state spaces) dependent on vicinity
controller controller
input
DEC
decision
BC
behavior construction
LTM
long term memory
EXPL
exploration
ACT
action capabilities
EV
evaluation
EM
episode memory
September 2010 DFG 1183 ORGANIC COMPUTING
3 controller
input
DEC
decision
BC
behavior construction
LTM
long term memory
EXPL
exploration
ACT
action capabilities
EV
evaluation
EM
episode memory
– Intrinsic high-level state of the robot
– Drive examples
– Generation of motivation
dynamic motivation
– i.e. greedy goal selection – Problem: Does not pay attention to the dynamics of the drive state
September 21-22, 2009 DFG 1183 ORGANIC COMPUTING
4
– While pursuing one goal, try to fulfill other goals as well that are „on your way“ – Ex: If battery level is low, but the robot can transport an object to its base while driving to the battery fuel station „with minor detour“, then it shall do it
– Keeps track of all the state spaces for the different drives„ strategies – Maintains a priorization of those dependent on their motivation – Switches between the different strategies at runtime
September 2010 DFG 1183 ORGANIC COMPUTING
5
controller
SMDP SMDP SMDP
input
DEC
decision
BC
behavior construction
LTM
long term memory
EXPL
exploration
ACT
action capabilities
EV
evaluation
EM
episode memory
COORD
goal coordination
September 2010 DFG 1183 ORGANIC COMPUTING
6
September 2010 DFG 1183 ORGANIC COMPUTING
7 Drive a Drive c Drive b
SMDPa SMDPb SMDPc
A 78 65
65
81
65
B 81 65 67
81 67 67 81 78
September 2010 DFG 1183 ORGANIC COMPUTING
8
September 2010 DFG 1183 ORGANIC COMPUTING
9
Priority (higher= lower timely effort) Refined state value
Drive a Drive c Drive b
= 3,4 ma = 2,6 mb = 2,6 mc
100 A 90 81 90 81 73 81 73 66 73 66 59 66 59 53 59 66 59 66 73 66 73 81 73 81 90 81 90 100 B 53 SMDPa SMDPb
p1 p1
September 2010 DFG 1183 ORGANIC COMPUTING
10 10
September 2010 DFG 1183 ORGANIC COMPUTING
11 11
September 2010 DFG 1183 ORGANIC COMPUTING
12 12
trial. 1 2 3 4 5 6 7 8 9 10 150 100 200 39% 27% 39% 39%
EG-PS FIXED EG MV ØΣDegree of „unsatisfaction“
September 2010 DFG 1183 ORGANIC COMPUTING
13 13
1 2 3 4 5 6 7 8 9 10 22% 22% 22% 11%
EG-PS FIXED EG MV 12 10 14 11 13 trial. ØΣDegree of „unsatisfaction“
September 2010 DFG 1183 ORGANIC COMPUTING
14 14
prior to execution
highest expected utility
experience of repeated interactions
teammates‟ brains”
September 21-22, 2009 DFG 1183 ORGANIC COMPUTING
15 15
MRS Taxonomy (Farinelli et al, 2004)
September 2010 DFG 1183 ORGANIC COMPUTING
16 16
September 21-22, 2009 DFG 1183 ORGANIC COMPUTING
17 17
September 2010 DFG 1183 ORGANIC COMPUTING
18 18
September 2010 DFG 1183 ORGANIC COMPUTING
19 19
No predefined communication needed
Change of goal/behavior will be reflected in a change of the teammate model
(possible with minor algorithm changes)
September 21-22, 2009 DFG 1183 ORGANIC COMPUTING
20 20
September 21-22, 2009 DFG 1183 ORGANIC COMPUTING
21 21
Robot„s local video (live) Drag to move robot Live video stream stitched from 8 GigE video cameras
September 2010 DFG 1183 ORGANIC COMPUTING
22 22