- C. G
. G. Cassand . Cassandras as
Division of Systems Engineering and Dept. of Electrical and Computer Engineering and Center for Information and Systems Engineering Boston University
Christos G. Cassandras
CODES Lab. - Boston University
MODELS AND M MODELS AND METHODS ETHODS FOR FOR CYBER-PHY CYBER - - PowerPoint PPT Presentation
DISCRETE DISCRETE EVENT AN EVENT AND HYBRID SY HYBRID SYSTEM STEM MODELS AND M MODELS AND METHODS ETHODS FOR FOR CYBER-PHY CYBER PHYSICAL SICAL SY SYSTEMS STEMS C. G . G. Cassand . Cassandras as Division of Systems Engineering
Division of Systems Engineering and Dept. of Electrical and Computer Engineering and Center for Information and Systems Engineering Boston University
Christos G. Cassandras
CODES Lab. - Boston University
Christos G. Cassandras CODES Lab. - Boston University
Christos G. Cassandras CODES Lab. - Boston University
STATES s1 s2 s3 s4 TIME t
TIME-DRIVEN SYSTEM
STATES TIME t
STATE SPACE:
X
DYNAMICS:
, x f x t
EVENT-DRIVEN SYSTEM
STATE SPACE:
X s s s s
1 2 3 4
, , ,
DYNAMICS:
e x f x , '
t2 e2 x(t) t3 t4 t5 e3 e4 e5 EVENTS x(t) t1 e1
Christos G. Cassandras CODES Lab. - Boston University
Christos G. Cassandras CODES Lab. - Boston University
REFERENCE
PLANT CONTROLLER
INPUT
SENSOR
MEASURED OUTPUT OUTPUT ERROR REFERENCE
PLANT CONTROLLER
INPUT
SENSOR
MEASURED OUTPUT OUTPUT ERROR
EVENT:
g(STATE) ≤ 0 EVENT-DRIVEN CONTROL: Act only when needed (or on TIMEOUT) - not based on a clock
Christos G. Cassandras CODES Lab. - Boston University
(e.g., Internet) → all state transitions are event-driven
→ some state transitions are event-driven
→ components interact asynchronously (through events)
Christos G. Cassandras CODES Lab. - Boston University
→ actions needed in response to random events
computation and estimation quality
components than to time-driven components
→ time-driven communication consumes significant energy UNNECESSARILY!
AUTOMATON: (E, X, G, f, x0) E : Event Set X : State Space
e e
1 2
, ,
x x
1 2
, ,
f x e x , '
G(x) : Set of feasible or enabled events at state x
x X
0
x0 : Initial State,
f X E X :
e x G
f : State Transition Function (undefined for events )
Christos G. Cassandras CODES Lab. - Boston University
Add a Clock Structure V to the automaton: (E, X, G, f, x0 , V) where: and vi is a Clock or Lifetime sequence:
V v
i
i E :
, ,
2 1 i i i
v v v
Need an internal mechanism to determine NEXT EVENT e´ and hence NEXT STATE
x f x e ' , '
Christos G. Cassandras CODES Lab. - Boston University
x x
1 2
, ,
f x e x , ' '
v1 vN
NEXT EVENT
Associate a CLOCK VALUE/RESIDUAL LIFETIME yi with each feasible event
i x G
xX with feasible event set G(x) e that caused transition into x t associated with e
Christos G. Cassandras CODES Lab. - Boston University
i x i
y e
G
min arg '
t t y y y
i x i
' * min
where: *
G
Christos G. Cassandras CODES Lab. - Boston University
Determine new CLOCK VALUES for every event
i x G
i v
e x x i v e i x i x i y y y
ij ij i i
event for lifetime new : where , , * G G G G
Christos G. Cassandras CODES Lab. - Boston University
x x
1 2
, ,
} { min arg ' , ' , '
) ( i x i
y e e x f x
G
V x, , ' y g y
v1 vN
2 1 2 1 d d d a a a
Christos G. Cassandras CODES Lab. - Boston University
1 a d 2 a d a d n a d n+1 a d
t0 x0 = 0
a
e1 = a
x0 = 1 t1
a d
e2 = a
x0 = 2 t2
a d
e3 = a
x0 = 3 t3
a d
x0 = 2
e4 = d
t4
Christos G. Cassandras CODES Lab. - Boston University
which vi is generated
In a simulator, vi is generated through a pseudorandom number generator
Christos G. Cassandras CODES Lab. - Boston University
Christos G. Cassandras CODES Lab. - Boston University
Q: set of discrete states (modes) X: set of continuous states (normally Rn) E: set of events U: set of admissible controls
) , , , , , , , , , , (
0 x
q guard Inv f U E X Q Gh
f : vector field, : discrete state transition function,
X U X Q f : Q E X Q :
Inv: set defining an invariant condition (domain), guard: set defining a guard condition, : reset function, q0: initial discrete state x0: initial continuous state
X Q Inv X Q Q guard X E X Q Q :
Christos G. Cassandras CODES Lab. - Boston University
Guard condition: Subset of X in which a transition from q to q' is enabled, defined through Invariant condition: Subset of X to which x must belong in order (domain) to remain in q. If this condition no longer holds, a transition to some q' must occur, defined through Transition MAY occur Transition MUST occur Reset condition: New value x' at q' when transition occurs from (x,q)
Christos G. Cassandras CODES Lab. - Boston University
Unreliable machine with timeouts
T x 1 1 u x ] [ T
a
2 x
b,
K x
g
' ' x ' ' x
IDLE BUSY DOWN
x(t) : physical state of part in machine (t): clock a : START, b : STOP, g : REPAIR Invariant Guard Reset
Christos G. Cassandras CODES Lab. - Boston University
T x 1 1 u x ] [ T
a
2 x
b,
K x
g
' ' x ' ' x
IDLE BUSY DOWN
Invariant Guard Reset
1 ) ; , ; ( a e e x
1 , 2 ) ; , ; 1 ( b e K x T e x
2 ) ; , ; 2 ( g e e x
www.mathworks.com/products/simevents/
Cassandras, Clune and Mostreman, 2006
Christos G. Cassandras CODES Lab. - Boston University
Timed Automata and Timed Petri Nets – Alur, R., and D.L. Dill, “A Theory of Timed Automata,” Theoretical Computer Science, No. 126, pp. 183-235, 1994. – Cassandras, C.G, and S. Lafortune, “Introduction to Discrete Event Systems,” Springer, 2008. – Wang, J., Timed Petri Nets - Theory and Application, Kluwer Academic Pub- lishers, Boston, 1998. Hybrid Systems – Bemporad, A. and M. Morari, “Control of Systems Integrating Logic Dynamics and Constraints,” Automatica, Vol. 35, No. 3, pp.407-427, 1999. – Branicky, M.S., V.S. Borkar, and S.K. Mitter, “A Unified Framework for Hy- brid Control: Model and Optimal Control Theory,”IEEE Trans. on Automatic Control, Vol. 43, No. 1, pp. 31-45, 1998. – Cassandras, C.G., and J. Lygeros, “Stochastic Hybrid Systems,” Taylor and Francis, 2007. – Grossman, R., A. Nerode, A. Ravn, and H. Rischel, (Eds), Hybrid Systems, Springer, New York, 1993. – Hristu-Varsakelis, D. and W.S. Levine, Handbook of Networked and Embedded Control Systems, Birkhauser, Boston, 2005.
Christos G. Cassandras CODES Lab. - Boston University
dx x R x P H ) ( ) , ( ) ( max s s
s
N i F si , , 1 ,
x
i
a
a1
a2
a3
O1 O2
s=[s1, … , sN ]
Christos G. Cassandras CODES Lab. - Boston University
Ω GOAL: Find the best state vector s=[s1, … , sN ] so that agents achieve a maximal reward from interacting with the mission space
T t
dt dx x R t u x P J
) (
) ( ))) ( ( , ( max s
u
N i F t si , , 1 , ) (
x
i
a
a1
a2
a3
O1 O2
Christos G. Cassandras CODES Lab. - Boston University
Ω GOAL: Find the best state trajectories si(t), 0 ≤ t ≤ T so that agents achieve a maximal reward from interacting with the mission space
N i t u s f s
i i i i
, , 1 ), , , (
May also have dynamics
dx x R x P H ) ( ) , ( ) ( max s s
s
N i F si , , 1 ,
Christos G. Cassandras CODES Lab. - Boston University
N i i i
s x p x P
1
) , ( ˆ 1 1 ) , ( s
) ( ) , ( ) , ( ˆ
i i i i i
s V x s x p s x p
i k a k s s b B
i k i k i i
, , , 1 , 2 :
NEIGHBORHOOD:
i
si sk
) ( i s V x
sj
) (
j
s V x
Christos G. Cassandras CODES Lab. - Boston University
THEOREM: If P(x,s) = P(p1,…,pN) is a function of local reward functions pi, then H(s) can be expressed as: for any i = 1,…,N, where and ), ( ) ( ) (
2 1 i L i
s H H H s s
× × ,si-1,si+1,× × × ,sN]
] , , [
1 a i i
b b i L i
s s s s
State of i and its neighbors only State of all agents except i
1 i L i i
1 1 k i L i k k i k i
Christos G. Cassandras CODES Lab. - Boston University
i i i
Christos G. Cassandras CODES Lab. - Boston University
5 10 2 4 6 8 10 10 20 30 40 50
R(x) (Hz/ m2)
? ? ? ?? ? ? ? ?
N i i i
1
s
Christos G. Cassandras CODES Lab. - Boston University
Event sensing probability
CONTINUED
N i i N
1 1
Christos G. Cassandras CODES Lab. - Boston University
k k k k N k i i i k
, 1
k i k k i k i
1
Desired displacement = V·Dt
Cassandras and Li, EJC, 2005 Zhong and Cassandras, IEEE TAC, 2011
CONTINUED
CONTINUED
Christos G. Cassandras CODES Lab. - Boston University
k k k k N k i i i k
, 1
k i k k i k i
1
Christos G. Cassandras CODES Lab. - Boston University
Christos G. Cassandras CODES Lab. - Boston University
i N s s
N
1 , ,
1
1 1
1
N s
N N s
s t s s s H
N
s constraint . . ) , , ( min
1
(processors, agents, vehicles, nodes),
Christos G. Cassandras CODES Lab. - Boston University
i Controllable state si, i = 1,…,ni
)) ( ( ) ( ) 1 ( k d k s k s
i i i i
s a
Step Size Update Direction, usually
)) ( ( )) ( ( k H k d
i i
s s
i N s
s t s s s H
i
s constraint . . ) , , ( min
1
i requires knowledge of all s1,…,sN Inter-node communication
Christos G. Cassandras CODES Lab. - Boston University
1 2 3
COMMUNICATE + UPDATE
Christos G. Cassandras CODES Lab. - Boston University
1 2 3
Bertsekas and Tsitsiklis, 1997
Update frequency for each node is bounded + technical conditions
)) ( ( ) ( ) 1 ( k d k s k s
i i i i
s a
converges
Christos G. Cassandras CODES Lab. - Boston University
2 3
UPDATE COMMUNICATE
locally determined, arbitrary (possibly periodic)
1
Christos G. Cassandras CODES Lab. - Boston University
Node state at any time t : xi(t) Node state at tk : si(k) si(k) = xi(tk)
j i
tk
) (k
j
Estimate examples: )) ( ( ) ( k x k s
j j i j
Most recent value
)) ( ( ) ( )) ( ( ) ( k x d k t k x k s
j j j i j j k j j i j
a D Linear prediction : node j state estimated by node i ) (k si
j
AT UPDATE TIME tk :
Christos G. Cassandras CODES Lab. - Boston University
AT ANY TIME t :
) (t x j
i
) (t x j
i
… and evaluates an ERROR FUNCTION
) ( ), ( t x t x g
j i i
Error Function examples:
2 1
) ( ) ( , ) ( ) ( t x t x t x t x
j i i j i i
) (t x j
i
Christos G. Cassandras CODES Lab. - Boston University
i i j
) (t xi
i
Node i communicates its state to node j only when it detects that its true state xi(t) deviates from j’ estimate of it so that ) (t x j
i
i j i i
t x t x g ) ( ), (
) ( ), ( t x t x g
j i i
Compare ERROR FUNCTION to THRESHOLD i
Christos G. Cassandras CODES Lab. - Boston University
) 1 ( if ) ( ( ) ( k C k k d K k
i i i i i
s ) ( ( ) (
i i i
d K s
K
Update Direction, usually
)) ( ( )) ( ( k H k d
i i i i
s s
Intuition: near convergence (small ), better estimates are needed
)) ( ( k d
i i s
Christos G. Cassandras CODES Lab. - Boston University
i i i i
Estimates of other nodes, evaluated by node i
) 1 ( update sends if ) ( ( ) ( k k k d K k
i i i i
s
Christos G. Cassandras CODES Lab. - Boston University
CONTINUED
ASSUMPTION 1: There exists a positive integer B such that for all i = 1,…,N and k ≥ 0 at least one of the elements
INTERPRETATION: Each node updates its state at least once during a period in which B state update events take place (no time bound)
ASSUMPTION 2: The objective function H(s), satisfies: (a) H(s) ≥ 0, for all (b) H(·) continuously differentiable and Lipschitz continuous, i.e., there exists K1 such that for all
N i i m
n m
1
, s
m
s ) ( H
m
y x, y x y x
1
) ( ) ( K H H
Christos G. Cassandras CODES Lab. - Boston University
CONTINUED
ASSUMPTION 3: There exist positive constants K2, K3 such that for all i = 1,…,N and
i
C k ) ( )) ( ( (b) / ) ( )) ( ( ) ( (a)
2 3 2
k d k H K K k d k H k d
i i i i i i i
s s
NOTE: Very mild condition, immediately satisfied with K2 = K3 = 1 when we use the usual update direction given by
)) ( ( ) ( k H k d
i i i
s
ASSUMPTION 4: There exists a positive constant K4 such that The ERROR FUNCTION satisfies
)) ( ) ( ( ) ( ) (
4
t x t x g K t x t x
j i i j i i
NOTE: Very mild condition, immediately satisfied with K4 = 1 when we use the common choice
) ( ) ( )) ( ) ( ( t x t x t x t x g
j i i j i i
Christos G. Cassandras CODES Lab. - Boston University
CONTINUED
THEOREM: Under A1-A4, there exist positive constants α and Kδ such that
)) ( ( lim
k H
k
s
Zhong and Cassandras, IEEE TAC, 2010
INTERPRETATION:
reduced communication requirements energy savings
Christos G. Cassandras CODES Lab. - Boston University
CONTINUED
THEOREM: Under A1-A4, there exist positive constants α and Kδ such that
)) ( ( lim
k H
k
s
BYPRODUCT OF PROOF:
smallest possible number of communication events:
3 1 3 1 4
/ 2 2 ) 1 ( 1 K K K K m K B K a a
State dim. ~ network dim. Comm. frequency
Christos G. Cassandras CODES Lab. - Boston University
Red curve: Black curve:
ij
t
k
i
j i i x
x g ~ ,
j i i x
x g ,
ij 3
ij 1
ij 2
ij 1
ij 2
ij 3
ij 4
ij 4
j i i x
x g ,
k
i
t
ij 3
ij 2
ij 1
ij
Error function trajectory with NO DELAY
DELAY
Christos G. Cassandras CODES Lab. - Boston University
ASSUMPTION 5: There exists a non-negative integer D such that if a message is sent before tk-D from node i to node j, it will be received before tk.
INTERPRETATION: at most D state update events can occur between a node sending a message and all destination nodes receiving this message.
Add a boundedness assumption: THEOREM: Under A1-A5, there exist positive constants α and Kδ such that
)) ( ( lim
k H
k
s
NOTE: The requirements on α and Kδ depend on D and they are tighter.
Zhong and Cassandras, IEEE TAC, 2010
SYNCHRONOUS v ASYNCHRONOUS OPTIMAL COVERAGE PERFORMANCE
Christos G. Cassandras CODES Lab. - Boston University
SYNCHRONOUS v ASYNCHRONOUS:
for a deployment problem with obstacles SYNCHRONOUS v ASYNCHRONOUS: Achieving optimality in a problem with obstacles Energy savings + Extended lifetime
Christos G. Cassandras CODES Lab. - Boston University Zhong and Cassandras, 2008
http://www.bu.edu/codes/research/distributed-control/
DEMO: OPTIMAL DISTRIBUTED DEPLOYMENT WITH OBSTACLES – SIMULATED AND REAL
Christos G. Cassandras CODES Lab. - Boston University
Christos G. Cassandras CODES Lab. - Boston University
CONTROL/DECISION (Parameterized by q) SYSTEM
PERFORMANCE
NOISE
Christos G. Cassandras CISE SE - CODES Lab. - Boston University
)] ( [ q L E )] ( [ max q
q
L E
DIFFICULTIES: - E[L(q)] NOT available in closed form
GRADIENT ESTIMATOR
) (
1 n n n n
L q
q
L(q) ∆ REAL-TIME DATA
t u t x c e E
t U t u
)) , ( ), , ( ( max
) , (
q q
MDP:
t u t x c e E
t
)) , ( ), , ( ( max q q
CONTROL/DECISION (Parameterized by q) Discrete Event System (DES)
PERFORMANCE
L(q) IPA
NOISE ) (
1 n n n n
L q
q
For many (but NOT all) DES:
)] ( [ q L E
Christos G. Cassandras CISE SE - CODES Lab. - Boston University
x(t)
Model
x(t) L(q) ∆
CONTROL/DECISION (Parameterized by q) HYBRID SYSTEM
PERFORMANCE
L(q) IPA
) (
1 n n n n
L q
q
Sample path Christos G. Cassandras CISE SE - CODES Lab. - Boston University
)] ( [ q L E
NOISE
x(t) L(q) ∆
Performance metric (objective function):
q
q d d t x t x
k k
,
NOTATION:
Christos G. Cassandras CISE SE - CODES Lab. - Boston University
x L E T x J ), , ( ; ), , ( ; q q q q
q q d T x dJ ), , ( ;
q q
q d dL
n n n n
) (
1
q d dL
, normally
1. Continuity at events: Take d/dq : ) ( ) (
k
x x
k k k k k k
f f x x ' )] ( ) ( [ ) ( ' ) ( '
1
q
x q q d x
k
) , , , , ( ) ( '
CISE SE - CODES Lab. - Boston University
System dynamics over (k(q), k+1(q)]:
) , , ( t x f x
k
q
q
q
k
t x t x , ,
NOTATION:
Solve
q
( ) ( ' ) ( ) ( ' t f t x x t f dt t dx
k k
k du x u f k du x u f
k v k k t k k
x dv e v f e t x
( ) ( ) (
) ( ) (
initial condition from 1 above
Christos G. Cassandras CISE SE - CODES Lab. - Boston University
) , , ( t x f x
k
q
( ) ( ' ) ( ) ( ' t f t x x t f dt t dx
k k
NOTE: If there are no events (pure time-driven system), IPA reduces to this equation
depending on the event type:
k
), , ( (
k k x
g
( ) (
1 k k k k
x x g g f x g
) ( ) (
1
k k k k
y t y
CISE SE - CODES Lab. - Boston University
Ignoring resets and induced events:
k k k k k k k
f f x x ' )] ( ) ( [ ) ( ' ) ( '
1
k du x u f k du x u f
k v k k t k k
x dv e v f e t x
( ' ) ( ) (
) ( ) (
( ) (
1 k k k k
x x g g f x g
1. 2. 3.
Christos G. Cassandras CISE SE - CODES Lab. - Boston University
2 1 3
) ( '
x
q
q
k
t x t x ,
Recall:
Cassandras et al, Europ. J. Control, 2010
Back to performance metric:
k k
k k
dt t x L L
1
) , , (
q
q q
x L t x L
k k
, , , ,
NOTATION: Then:
k k k k k k k k
k k
dt t x L L L d dL
1 1
1
) , , ( ) ( ) (
q What happens at event times What happens between event times
Christos G. Cassandras CISE SE - CODES Lab. - Boston University
THEOREM 1: If either 1,2 holds, then dL(q)/dq depends only on information available at event times k:
k k k
f dt d x f dt d x L dt d
IMPLICATION: - Performance sensitivities can be obtained from information limited to event times, which is easily observed
k k k k k k k k
k k
dt t x L L L d dL
1 1
1
) , , ( ) ( ) (
q
Christos G. Cassandras CISE SE - CODES Lab. - Boston University
EXAMPLE WHERE THEOREM 1 APPLIES (simple tracking problem):
Christos G. Cassandras CISE SE - CODES Lab. - Boston University
k k k k k k k
d du f a x f q q
N k t w u t x a x dt g t x E
k k k k k k T
, , 1 ) ( ) ( ) ( s.t. )] ( ) ( [ min
,
1
L
NOTE: THEOREM 1 provides sufficient conditions only. IPA still depends on info. limited to event times if for “nice” functions uk(qk,t), e.g., bkqt N k t w t u t x a x
k k k k k k
, , 1 ) ( ) , ( ) (
EVENTS
Evaluating requires full knowledge of w and f values (obvious)
) ; ( q t x
However, may be independent of w and f values (NOT obvious)
q q d t dx ) ; (
It often depends only on: - event times k
) (
1
f ) ; , , , ( q t w u x f x
k+1
Christos G. Cassandras CISE SE - CODES Lab. - Boston University
THEOREM 2: Suppose an endogenous event occurs at k with switching function g(x,q). If , then is independent of fk−1. If, in addition, then IMPLICATION: Performance sensitivities are often reset to 0 sample path can be conveniently decomposed
) (
k
f
d dg ) (
x ) (
x
Christos G. Cassandras CISE SE - CODES Lab. - Boston University
IPA scales with the EVENT SET, not the STATE SPACE !
Christos G. Cassandras CISE SE - CODES Lab. - Boston University
As a complex system grows with the addition of more states, the number of EVENTS often remains unchanged or increases at a much lower rate. EXAMPLE: A queueing network may become very large, but the basic events used by IPA are still “arrival” and “departure” at different nodes. IPA estimators are EVENT-DRIVEN
in between events
can be adequate to perform sensitivity analysis and optimization, as long as event times are accurately observed and local system behavior at these event times can also be measured.
Common performance metrics (e.g., workload) satisfy THEOREM 1 In many cases:
Christos G. Cassandras CISE SE - CODES Lab. - Boston University
TOO CLOSE…
too much undesirable detail
TOO FAR…
model not detailed enough
JUST RIGHT…
good model
CREDIT: W.B. Gong
Christos G. Cassandras CISE SE - CODES Lab. - Boston University
Christos G. Cassandras CODES Lab. - Boston University
A basic binary switching control (GREEN – RED) problem with a long history…
Christos G. Cassandras CODES Lab. - Boston University
Use a Hybrid System Model: Stochastic Flow Model (SFM)
SFM DES
[Geng and Cassandras, 2012]
Single Intersection
Vehicle queue
Aggregate states into modes and keep only events causing mode transitions
Christos G. Cassandras CISE - CODES Lab. - Boston University
Traffic light control:
] , , , [
4 3 2 1
q q q q q
GREEN light cycle at queue n = 1,2,3,4 OBLECTIVE: Determine q to minimize total weighted vehicle queues
1
) , ( 1 ) ( min
n T n n T
dt t x w E T J q q
q
1 2 3 4
Christos G. Cassandras CISE - CODES Lab. - Boston University
( 1 ) , ( 1 ) ( min
4 1
q q q
q T n T n n T
L E T dt t x w E T J
through
q q d dJT
q q
q d dL
n T n n n
) (
1
q d dLT
Christos G. Cassandras CISE - CODES Lab. - Boston University
GREEN n n n
n
q
n
GREEN n ( ) ( )
n n n
z t if z t q
( ) ( ) ( )
n n n n n
if z t
z t
q q
Control: GREEN light cycle
Christos G. Cassandras CISE - CODES Lab. - Boston University
( ) ( )
n n n
z t if z t q
( ) ( ) ( )
n n n n n
if z t
z t
q q
( ) ( ) ( )
n n n n n
if z t
t G t
q q
Define: ( ) ( , ) ( ) ( ) ( ) ( ) ( ) ( )
n n n n n n n n
t if G z x t if x t and t t t t
Vehicle departure rate process Vehicle arrival rate process
[RESOURCE DYNAMICS] [USER DYNAMICS]
Christos G. Cassandras CISE - CODES Lab. - Boston University
Event E Non-Empty-Period (NEP) ends endogenous Event R2G RED light switches to GREEN endogenous Event G2R GREEN light switches to RED endogenous Event S Non-Empty-Period (NEP) starts endogenous or exogenous
Christos G. Cassandras CISE - CODES Lab. - Boston University
FOR EXAMPLE: Endogenous event with
) , ( ) ), , ( (
x x g
n k k
q q
) ( ) ( ) (
' , ' , k n k n k i n i k
x
( ) ( ) ( ) ( ) ( ) ( ) (
' , ' , ' ,
n k n k i n k n k n k i n k i n
x x x
RESET to 0 when NEP ends
k
( ) (
1 k k k k
x x g g f x g
k k k k k k
f f x x ' )] ( ) ( [ ) ( ' ) ( '
1
Christos G. Cassandras CISE - CODES Lab. - Boston University
( ) ( ,
, ,
) , (
q
m n m n
dt t x L
n m n
NOTES: - Need only TIMERS, COUNTERS and state derivatives
Christos G. Cassandras CISE - CODES Lab. - Boston University
9-fold cost reduction
Traffic pattern changes
Adaptivity
t
dt dx x R t u x P J
) (
) ( ))) ( ( , ( max s
u
N i F t si , , 1 , ) (
i
a
a1
a2
a3
O1 O2
Christos G. Cassandras CODES Lab. - Boston University
Ω GOAL: Find the best state trajectories si(t), 0 ≤ t ≤ T so that agents achieve a maximal reward from interacting with the mission space
N i t u s f s
i i i i
, , 1 ), , , (
Christos G. Cassandras CODES Lab. - Boston University
Need three model elements:
(how agents interact with environment)
N i t u s f s
i i i i
, , 1 ), , , (
t
dt dx x R t u x P J
) (
) ( ))) ( ( , ( max s
u
GOAL: Find the best state trajectories si(t), 0 ≤ t ≤ T so that agents achieve a maximal reward from interacting with the mission space
Christos G. Cassandras CODES Lab. - Boston University
Start with 1-dimensional mission space = [0,L] AGENT DYNAMICS:
1 ) ( ,
u u s
j j j
1 ) ( , ) (
u bu s g s
j j j j j
Christos G. Cassandras CODES Lab. - Boston University
s(t) x SENSING MODEL: p(x,s) Probability agent at s senses point x
Christos G. Cassandras CODES Lab. - Boston University
s(t) x ENVIRONMENT MODEL: Associate to x Uncertainty Function R(x,t)
x x
)) ( , ( ) ( )) ( , ( ) ( , ) , ( if ) , ( t s x Bp x A t s x Bp x A t x R t x R
Christos G. Cassandras CODES Lab. - Boston University
Partition mission space = [0,L] into M intervals:
1 M
For each interval i = 1,…,M define Uncertainty Function Ri(t):
)) ( ( )) ( ( , ) ( if ) (
i
t BP A t BP A t R t R
i i i i i
s s
j j i i
1
where Pi(s) = joint prob. i is sensed by agents located at s = [s1,…,sN]
j i j j i
Christos G. Cassandras CODES Lab. - Boston University
1 , ,
1
i i u u
N
b t s a t u u s
j j j j
( , 1 ) ( ,
)) ( ( )) ( ( , ) ( if ) (
i
t BP A t BP A t R t R
i i i i i
s s
Agent Network (time-varying) Agent-Target Interaction Network (time-varying)
Christos G. Cassandras CODES Lab. - Boston University
Hard to decentralize a controller that involves time-varying agent-environment interactions
Christos G. Cassandras CODES Lab. - Boston University
(conventional)
𝐵5 𝐵2 𝐵3 𝐵1 𝐵4
𝑈2 𝑈
1
𝑈3 𝑈5 𝑈
4
Christos G. Cassandras CODES Lab. - Boston University
N j x t s x
M j
, , 1 ) (
* 1
Under certain conditions:
] , [ for ) ( and ) (
2 1 * *
t t t t u x t s
j k j
) ( ) (
1 t
s t s
j j
Zhou et al, IEEE CDC, 2016
Christos G. Cassandras CODES Lab. - Boston University
Optimal trajectory is fully characterized by TWO parameter vectors:
j
jS j j
, , 1 ,
1
q q
j w w w
jS j j
, , 1 ,
1
Waiting times at switching points, wjk ≥ 0
k M i i
k k
) , ( ) , ( 1
1
w θ w θ
Under optimal control, this is a HYBRID SYSTEM
Christos G. Cassandras CODES Lab. - Boston University
Type 1: switches in 𝑆𝑗(𝑢) Type 2: switches in agent sensing Type 3: switches in 𝑡
𝑘(𝑢)
Type 4: changes in neighbor sets
Christos G. Cassandras CODES Lab. - Boston University
A simple example: 1 agent 1 target
𝑆𝑗 = 𝐵𝑗 − 𝐶𝑗𝑄𝑗(𝐭(𝑢))
𝑡𝑘 = −1
𝑆𝑗 = 𝐵𝑗 − 𝐶𝑗𝑄𝑗(𝐭(𝑢))
𝑡𝑘 = 1
𝑆𝑗 = 𝐵𝑗 − 𝐶𝑗𝑄𝑗(𝐭(𝑢))
𝑡𝑘 = 0 𝑆𝑗 = 0 𝑡𝑘 = −1 𝑆𝑗 = 0 𝑡𝑘 = 0 𝑆𝑗 = 0 𝑡𝑘 = 1
)) ( ( )) ( ( , ) ( if ) (
i
t BP A t BP A t R t R
i i i i i
s s
𝑆 ↓= 0 𝑆 ↑= 0
1 ) ( ,
u u s
j j j
𝑣 = −1 1
Christos G. Cassandras CODES Lab. - Boston University
i K k i
k k
1 ) , ( ) , (
1
w θ w θ
T i i i
t R t R t R
θ ) ( ) ( ) (
) (t Ri
is obtained using the IPA Calculus
k k k k k k k
f f x x ' )] ( ) ( [ ) ( ' ) ( '
1
k du x u f k du x u f
k v k k t k k
x dv e v f e t x
( ' ) ( ) (
) ( ) (
( ) (
1 k k k k
x x g g f x g
1. 2. 3.
is updated on an EVENT-DRIVEN basis k : kth event time
) (t Ri
Christos G. Cassandras CODES Lab. - Boston University
AGENT Event Set ℰ𝐵 TARGET Event Set ℰ𝑈
Christos G. Cassandras CODES Lab. - Boston University
ℰ𝑘
𝐵: Subset of ℰ𝐵 that contains only events related to agent j
ℰ𝑘
𝑈: Subset of ℰ𝑈 that contains only events related to agent j
Christos G. Cassandras CODES Lab. - Boston University
DECENTRALIZATION: Each agent should be able to evaluate …based only on LOCAL events (i.e., events it can observe)
i K k i
dt t R T J
k k
1 ) , ( ) , (
) ( 1 ) , (
1
w θ w θ
w θ
local events in ℰ𝑘 (t) ?
Christos G. Cassandras CODES Lab. - Boston University
THEOREM: Any centralized solution of the trajectory optimization problem can be recovered through In which each agent j optimizes its trajectory under the following conditions: 1. Initial trajectory parameters
) , (
j j w
θ
Christos G. Cassandras CODES Lab. - Boston University
“Almost decentralized” solution, J* = 37.38 Fully decentralized solution (ignorinig non-local events), J* = 41.66 Red: true state of target 3 Blue: state of target 3 observed by agent 1 when in its neighborhood Green dots: instants when agent 1 receives non-local events
Christos G. Cassandras CODES Lab. - Boston University
Random
Christos G. Cassandras CODES Lab. - Boston University
) (
s i i
s H s
At a local optimum s1
Christos G. Cassandras CODES Lab. - Boston University
Alter Hi(s) to
) ( ˆ
s i i
s H s
) ( ˆ s
i
H
NOTE: Hard to find the proper try altering directly
ˆ Hi(s)
i i
s H
(s
di(x) = x - si
ij i
j z i ij ij ij jx s V i x i i ix i
rdr s r w D n dx x d s x s x w s s H
2 ) ( 1
) ), ( ( sin ) sgn( + ) ( ) ( ) , ( ) (
¶Hi(s) ¶siy = w1(x,si)(x - si)y di(x) dx
V (si )
+ sgn(njy)cosqij Dij w2(r ij(r),si)rdr
zij
jÎ G
i
w1(x,si) =- R(x)F i(x) dpi(x,si) ddi(x)
Christos G. Cassandras CODES Lab. - Boston University
Zhong and Cassandras, Proc. IFAC World Congress 2008
w2(x,si) =- R(x)F i(x)pi(x,si)
F i(x) = [1- ˆ pk(x,sk)]
kÎ Bi
joint probability that a point is not detected by any neighbor of i
w1(x,si)
w1(x,si) w2(x,si)
2 2 2 2 1 1 1 1
i i i i
¶six = w1(x,si)(x - si)x di(x) dx
V (si )
sgn(njx)sinqij Dij w2(r ij(r),si)rdr
zij
jÎ G
i
BOOSTING FUNCTION: Transform the derivative so its value is ≠ 0 and provides a “boost” towards more likely optimum
ˆ w1(x,si)= gi(w1(x,si)) ˆ w2(x,si)=hi(w2(x,si))
Focus on linear forms:
Christos G. Cassandras CODES Lab. - Boston University
Christos G. Cassandras CODES Lab. - Boston University
) , ( ) , ( ˆ ) , ( ) , ( ) , ( ˆ
2 2 1 1 i i i i
s x w s x w s x w x kP s x w
¶ ˆ Hi(s) ¶six = ˆ w1(x,si)(x - si)x di(x) dx
V (si )
sgn(njx)sinqij Dij ˆ w2(r ij(r),si)rdr
zij
jÎ G
i
b1(x,s) = 0 a 2(x,s) =1 b2(x,s) = 0 a1(x,s) =kP(x,s)- g
P(x,s) =1- P i=1
N [1- ˆ
pi(x,si)]
Christos G. Cassandras CODES Lab. - Boston University
b1(x,s) = d
jÎ Bi
x - sj
kj si - x
g
, ( ) , ( ˆ ) , ( ) , ( ˆ
2 2 1 1 i i i j j B j i i
s x w s x w x s k s x s x w s x w
i
Hi(s) ¶six = ˆ w1(x,si)(x - si)x di(x) dx
V (si )
+ sgn(njx)sinqij Dij ˆ w2(r ij(r),si)rdr
zij
jÎ G
i
Christos G. Cassandras CODES Lab. - Boston University
ij i
j z i ij ij ij jx s V i x i i ix i
rdr s r w D n dx x d s x s x w s s H
2 ) ( 1
) ), ( ( ˆ sin ) sgn( + ) ( ) ( ) , ( ˆ ) ( ˆ
b1(x,s) = 0 a 2(x,s) =1 b2(x,s) = 0
i
B k k k i
s x p x )] , ( ˆ 1 [ ) (
Christos G. Cassandras CODES Lab. - Boston University
xmsun@bu.edu Xinmiao Sun CODES LAB 60
Christos G. Cassandras CODES Lab. - Boston University
Christos G. Cassandras CODES Lab. - Boston University
T k F k F T S T f k T f S f k S f
, any for ) ( }) { ( ) ( }) { (
f : 2N → R submodular if
Christos G. Cassandras CODES Lab. - Boston University