EVENT-DRIVEN AND DATA-DRIVEN CONTROL AND OPTIMIZATION IN - - PowerPoint PPT Presentation

event driven and
SMART_READER_LITE
LIVE PREVIEW

EVENT-DRIVEN AND DATA-DRIVEN CONTROL AND OPTIMIZATION IN - - PowerPoint PPT Presentation

EVENT-DRIVEN AND DATA-DRIVEN CONTROL AND OPTIMIZATION IN CYBER-PHYSICAL SYSTEMS C. G. Cassandras Division of Systems Engineering Dept. of Electrical and Computer Engineering Center for Information and Systems Engineering Boston University


slide-1
SLIDE 1
  • C. G. Cassandras

Division of Systems Engineering

  • Dept. of Electrical and Computer Engineering

Center for Information and Systems Engineering Boston University

https://christosgcassandras.org

Christos G. Cassandras CODES Lab. - Boston University

EVENT-DRIVEN AND DATA-DRIVEN CONTROL AND OPTIMIZATION IN CYBER-PHYSICAL SYSTEMS

slide-2
SLIDE 2

TIME-DRIVEN v EVENT-DRIVEN CONTROL Christos G. Cassandras

CODES Lab. - Boston University

REFERENCE

PLANT CONTROLLER

INPUT

  • +

SENSORS

MEASURED OUTPUT OUTPUT ERROR REFERENCE

PLANT CONTROLLER

INPUT

  • +

SENSORS

MEASURED OUTPUT OUTPUT ERROR

EVENT:

g(STATE) ≤ 0 EVENT-DRIVEN CONTROL: Act only when needed (or on TIMEOUT) - not based on a clock

slide-3
SLIDE 3

CYBER-PHYSICAL SYSTEMS Christos G. Cassandras

CISE SE - CODES Lab. - Boston University

INTERNET

CYBER PHYSICAL

Data collection: relatively easy… Control: a challenge… TIME-DRIVEN EVENT-DRIVEN

slide-4
SLIDE 4

OUTLINE Christos G. Cassandras

CODES Lab. - Boston University

  • Why EVENT-DRIVEN Control and Optimization ?
  • EVENT-DRIVEN Control in Distributed Multi-Agent Systems
  • A General Optimization Framework for Multi-Agent Systems
  • EVENT-DRIVEN + DATA-DRIVEN Control and Optimization:

the IPA (Infinitesimal Perturbation Analysis) Calculus

slide-5
SLIDE 5

REASONS FOR EVENT-DRIVEN MODELS, CONTROL, OPTIMIZATION Christos G. Cassandras

CODES Lab. - Boston University

  • Many systems are naturally Discrete Event Systems (DES)

(e.g., Internet) → all state transitions are event-driven

  • Most of the rest are Hybrid Systems (HS)

→ some state transitions are event-driven

  • Many systems are distributed

→ components interact asynchronously (through events)

  • Time-driven sampling inherently inefficient (“open loop” sampling)
slide-6
SLIDE 6

REASONS FOR EVENT-DRIVEN MODELS, CONTROL, OPTIMIZATION Christos G. Cassandras

CODES Lab. - Boston University

  • Many systems are stochastic

→ actions needed in response to random events

  • Event-driven methods provide significant advantages in

computation and estimation quality

  • System performance is often more sensitive to event-driven

components than to time-driven components

  • Many systems are wirelessly networked → energy constrained

→ time-driven communication consumes significant energy UNNECESSARILY!

slide-7
SLIDE 7

x + y x y x y

TIME

Time-driven (synchronous) implementation:

  • Sum repeatedly evaluated unnecessarily
  • When evaluation is actually needed, it is done at the wrong times !

TIME

t1 t2

TIME-DRIVEN (SYNCHRONOUS) v EVENT-DRIVEN (ASYNCHRONOUS) COMPUTATION Christos G. Cassandras

CODES Lab. - Boston University

slide-8
SLIDE 8

SELECTED REFERENCES - EVENT-DRIVEN

CONTROL, COMMUNICATION, ESTIMATION, OPTIMIZATION

Christos G. Cassandras

CODES Lab. - Boston University

  • Astrom, K.J., and B. M. Bernhardsson, “Comparison of Riemann and Lebesgue sampling for

first order stochastic systems,” Proc. 41st Conf. Decision and Control, pp. 2011–2016, 2002.

  • T. Shima, S. Rasmussen, and P. Chandler, “UAV Team Decision and Control using Efficient

Collaborative Estimation,” ASME J. of Dynamic Systems, Measurement, and Control, vol. 129,

  • no. 5, pp. 609–619, 2007.
  • Heemels, W. P. M. H., J. H. Sandee, and P. P. J. van den Bosch, “Analysis of event-driven

controllers for linear systems,” Intl. J. Control, 81, pp. 571–590, 2008.

  • P. Tabuada, “Event-triggered real-time scheduling of stabilizing control tasks,” IEEE Trans.
  • Autom. Control, vol. 52, pp. 1680–1685, 2007.
  • J. H. Sandee, W. P. M. H. Heemels, S. B. F. Hulsenboom, and P. P. J. van den Bosch, “Analysis

and experimental validation of a sensor-based event-driven controller,” Proc. American Control Conf., pp. 2867–2874, 2007.

  • J. Lunze and D. Lehmann, “A state-feedback approach to event-based control,” Automatica, 46,
  • pp. 211–215, 2010.
  • P. Wan and M. D. Lemmon, “Event triggered distributed optimization in sensor networks,”
  • Proc. of 8th ACM/IEEE Intl. Conf. on Information Processing in Sensor Networks, 2009.
  • Zhong, M., and Cassandras, C.G., “Asynchronous Distributed Optimization with Event-Driven

Communication”, IEEE Trans. on Automatic Control, AC-55, 12, pp. 2735-2750, 2010.

slide-9
SLIDE 9

EVENT-DRIVEN DISTRIBUTED OPTIMIZATION

slide-10
SLIDE 10

DISTRIBUTED COOPERATIVE OPTIMIZATION Christos G. Cassandras

CODES Lab. - Boston University

i N s s

s t s s s H

N

each

  • n

s constraint . . ) , , ( min

1 , ,

1

1 1

  • n

s constraint . . ) , , ( min

1

s t s s s H

N s

N N s

s t s s s H

N

  • n

s constraint . . ) , , ( min

1 

N system components

(processors, agents, vehicles, nodes),

  • ne common objective:
slide-11
SLIDE 11

DISTRIBUTED COOPERATIVE OPTIMIZATION Christos G. Cassandras

CODES Lab. - Boston University

i Controllable state si, i = 1,…,ni

)) ( ( ) ( ) 1 ( k d k s k s

i i i i

s α + = +

Step Size Update Direction, usually

)) ( ( )) ( ( k H k d

i i

s s −∇ =

i N s

s t s s s H

i

  • n

s constraint . . ) , , ( min

1 

i requires knowledge of all s1,…,sN Inter-node communication

slide-12
SLIDE 12

SYNCHRONIZED (TIME-DRIVEN) COOPERATION Christos G. Cassandras

CODES Lab. - Boston University

1 2 3

COMMUNICATE + UPDATE

Drawbacks:

  • Excessive communication (critical in wireless settings!)
  • Faster nodes have to wait for slower ones
  • Clock synchronization infeasible
  • Bandwidth limitations
  • Security risks
slide-13
SLIDE 13

ASYNCHRONOUS COOPERATION Christos G. Cassandras

CODES Lab. - Boston University

1 2 3

  • Nodes not synchronized, delayed information used

Bertsekas and Tsitsiklis, 1997

Update frequency for each node is bounded + technical conditions ⇒

)) ( ( ) ( ) 1 ( k d k s k s

i i i i

s α + = +

converges

slide-14
SLIDE 14

ASYNCHRONOUS (EVENT-DRIVEN) COOPERATION Christos G. Cassandras

CODES Lab. - Boston University

2 3

UPDATE COMMUNICATE

  • UPDATE at i :

locally determined, arbitrary (possibly periodic)

  • COMMUNICATE from i : only when absolutely necessary

1

slide-15
SLIDE 15

WHEN SHOULD A NODE COMMUNICATE? Christos G. Cassandras

CODES Lab. - Boston University

AT ANY TIME t :

  • If node i knows how j estimates its state, then it can evaluate

) (t x j

i

  • Node i uses
  • its own true state, xi(t)
  • the estimate that j uses,

) (t x j

i

… and evaluates an ERROR FUNCTION (

)

) ( ), ( t x t x g

j i i

Error Function examples:

2 1

) ( ) ( , ) ( ) ( t x t x t x t x

j i i j i i

− −

  • : node i state estimated by node j

) (t x j

i

slide-16
SLIDE 16

Christos G. Cassandras

CODES Lab. - Boston University

i i j

) (t xi

δi

WHEN SHOULD A NODE COMMUNICATE?

Node i communicates its state to node j only when it detects that its true state xi(t) deviates from j’ estimate of it so that ) (t x j

i

( )

i j i i

t x t x g δ ≥ ) ( ), (

( )

) ( ), ( t x t x g

j i i

Compare ERROR FUNCTION to THRESHOLD δi

⇒ Event-Driven Control

slide-17
SLIDE 17

Christos G. Cassandras

CODES Lab. - Boston University

CONVERGENCE Asynchronous distributed state update process at each i:

)) ( ( ) ( ) 1 ( k d k s k s

i i i i

s ⋅ + = + α

Estimates of other nodes, evaluated by node i

THEOREM: Under certain conditions, there exist positive constants α and Kδ such that

)) ( ( lim = ∇

∞ →

k H

k

s

INTERPRETATION: Event-driven cooperation achievable with minimal communication requirements ⇒ energy savings

Zhong and Cassandras, IEEE TAC, 2010

   − =

  • therwise

) 1 ( update sends if ) ( ( ) ( k k k d K k

i i i i

δ δ

δ

s

slide-18
SLIDE 18

Christos G. Cassandras

CODES Lab. - Boston University

COONVERGENCE WHEN DELAYS ARE PRESENT

Red curve: Black curve:

ij

τ

t

( )

k

i

δ

( )

j i i x

x g ~ ,

( )

j i i x

x g ,

ij 3

τ

ij 1

τ

ij 2

τ

ij 1

σ

ij 2

σ

ij 3

σ

ij 4

σ

ij 4

τ

( )

j i i x

x g ,

( )

k

i

δ t

ij 3

τ

ij 2

τ

ij 1

τ

ij

τ

Error function trajectory with NO DELAY

DELAY

slide-19
SLIDE 19

Christos G. Cassandras

CODES Lab. - Boston University

COONVERGENCE WHEN DELAYS ARE PRESENT

ASSUMPTION: There exists a non-negative integer D such that if a message is sent before tk-D from node i to node j, it will be received before tk.

INTERPRETATION: at most D state update events can occur between a node sending a message and all destination nodes receiving this message.

Add a boundedness assumption: THEOREM: Under certain conditions, there exist positive constants α and Kδ such that

)) ( ( lim = ∇

∞ →

k H

k

s

NOTE: The requirements on α and Kδ depend on D and they are tighter.

Zhong and Cassandras, IEEE TAC, 2010

slide-20
SLIDE 20

OPTIMAL COVERAGE IN A MAZE Christos G. Cassandras

CODES Lab. - Boston University

slide-21
SLIDE 21

A GENERAL OPTIMIZATION FRAMEWORK FOR MULTI-AGENT SYSTEMS

slide-22
SLIDE 22

= dx x R x P H ) ( ) , ( ) ( max s s

s

N i F si , , 1 ,  = Ω ⊆ ∈

  • R(x): property of point x
  • P(x, s): reward function
  • Oj: obstacle (constraint)

i

a

O1 O2

  • si: agent state, i = 1,…, N

s=[s1, … , sN ]

Christos G. Cassandras

CODES Lab. - Boston University

NETWORKED MULTI-AGENT OPTIMIZATION: PROBLEM 1: PARAMETRIC OPTIMIZATION

Ω GOAL: Find the best state vector s=[s1, … , sN ] so that agents achieve a maximal reward from interacting with the mission space

slide-23
SLIDE 23

∫ ∫

=

T t

dt dx x R t u x P J

) (

) ( ))) ( ( , ( max s

u

N i F t si , , 1 , ) (  = Ω ⊆ ∈

i

a

O1 O2

Christos G. Cassandras

CODES Lab. - Boston University

Ω GOAL: Find the best state trajectories si(t), 0 ≤ t ≤ T so that agents achieve a maximal reward from interacting with the mission space

N i t u s f s

i i i i

, , 1 ), , , (   = =

May also have dynamics

NETWORKED MULTI-AGENT OPTIMIZATION: PROBLEM 2: DYNAMIC OPTIMIZATION

slide-24
SLIDE 24

PROBLEMS THAT FIT THIS FRAMEWORK

slide-25
SLIDE 25

Christos G. Cassandras

CODES Lab. - Boston University

COVERAGE

= dx x R x P H ) ( ) , ( ) ( max s s

s

Event density: Prior estimate of event

  • ccurrence frequency

Joint event detection probability:

[ ]

=

− − =

N i i i

s x p x P

1

) , ( 1 1 ) , ( s

Event sensing probability

5 10 2 4 6 8 10 10 20 30 40 50

R(x) (Hz/ m2)

? ? ? ? ? ? ? ? ? Deploy sensors to maximize “event” detection probability - unknown event locations

slide-26
SLIDE 26

Christos G. Cassandras

CODES Lab. - Boston University

CONSENSUS

− =

i

N j i j i

t s t s t s ) ( ) ( ) ( 

) (

1 t

s

Ω

) (

2 t

s ) (

3 t

s ) (

4 t

s

N

s s = =

1

− = dx x R x P H ) ( ) , ( ) ( max s s

s

=

− =

N i i

s x x R

1

) ( ) ( 1

Only x that matter are agents

=

− =

N i i

s P H

1

) , ( ) ( max s s

s

     > ∈ − =

i i i j i j i

i j N j s s s s p

  • therwise

, ) , (

2

=

i

N j i j i i

s s p s s P ) , ( 2 1 ) , (

slide-27
SLIDE 27

Christos G. Cassandras

CODES Lab. - Boston University

PERSISTENT MONITORING

AGENT MODEL

N i t u s f s

i i i i

, , 1 ), , , (   = =

∫ ∫

=

T t

dt dx x R t u x P J

) (

) ( ))) ( ( , ( max s

u

GOAL: Find the best state trajectories si(t), 0 ≤ t ≤ T so that agents achieve a maximal reward from interacting with the mission space ENVIRONMENT MODEL SENSING MODEL (how agents interact with environment)

slide-28
SLIDE 28

Christos G. Cassandras

CODES Lab. - Boston University

PERSISTENT MONITORING

s(t) x ENVIRONMENT MODEL: Associate to x Uncertainty Function R(x,t)

   − < = =

  • therwise

)) ( , ( ) ( )) ( , ( ) ( , ) , ( if ) , ( t s x Bp x A t s x Bp x A t x R t x R 

slide-29
SLIDE 29

Christos G. Cassandras

CISE SE - CODES Lab. - Boston University

THE INTERNET OF CARS…

With traffic lights (non-Cooperative) No traffic lights: decentralized control of CAVs (Cooperative)

One of the worst-designed double intersections ever… (BU Bridge – Commonwealth Ave, Boston)

Zhang et al, Proc. of IEEE, 2018 Malikopoulos, Cassandras, Zhang et al, Automatica, 2018

slide-30
SLIDE 30

Christos G. Cassandras

CODES Lab. - Boston University

RELATED WORK

COVERAGE AND FORMATION CONTROL

Choset 2001, Leonard and Olshevsky 2013, Tron et al 2014, Egerstedt and Hu 2001 Cortes et al 2004 Zhong and Cassandras 2010, Sun and Cassandras 2016

SAMPLING AND TRACKING

Leonard and Zhang 2010, Ashley and Andersson 2016

PERSISTENT MONITORING

Smith et al, 2011, Michael et al, 2011, Lan and Schwager, 2014 Cassandras et al, 2013, Yu et al, 2017

CONSENSUS

Jadbabaie et al, 2003, Olfati-Saber and Murray, 2004, Ren and Beard 2005 Nedich et al, 2010

slide-31
SLIDE 31
  • 1. SCALABILITY
  • 2. AUTONOMY
  • 3. COMMUNICATION
  • 4. NON-CONVEXITY
  • 5. EXLOIT DATA

Christos G. Cassandras

CODES Lab. - Boston University

NETWORKED MULTI-AGENT OPTIMIZATION– CHALLENGES Distributed Algorithms (Decentralization) Global optimality, escape local optima Event-driven (asynchronous) Algorithms Data-Driven Algorithms

slide-32
SLIDE 32

EVENT-DRIVEN + DATA-DRIVEN OPTIMIZATION

slide-33
SLIDE 33

DATA-DRIVEN STOCHASTIC OPTIMIZATION

CONTROL/DECISION (Parameterized by θ) SYSTEM

PERFORMANCE

NOISE

Christos G. Cassandras

CISE SE - CODES Lab. - Boston University

)] ( [ θ L E

)] ( [ max θ

θ

L E

Θ ∈

GOAL:

DIFFICULTIES: - E[L(θ)] NOT available in closed form

  • not easy to evaluate
  • may not be a good estimate of

) (θ L ∇ ) (θ L ∇ )] ( [ θ L E ∇

L(θ) GRADIENT ESTIMATOR

) (

1 n n n n

L θ η θ θ ∇ + =

+

x(t) L(θ) ∆ REAL-TIME DATA

slide-34
SLIDE 34

REAL-TIME STOCHASTIC OPTIMIZATION FOR DES: INFINITESIMAL PERTURBATION ANALYSIS (IPA)

CONTROL/DECISION (Parameterized by θ) Discrete Event System (DES)

PERFORMANCE

L(θ) IPA

NOISE ) (

1 n n n n

L θ η θ θ ∇ + =

+

Sample path

For many (but NOT all) DES:

  • Unbiased estimators
  • General distributions
  • Simple on-line implementation

)] ( [ θ L E

Christos G. Cassandras

CISE SE - CODES Lab. - Boston University

x(t)

Model

x(t) L(θ) ∆

Ho and Cao, 1991; Glasserman, 1991; Cassandras, 1993; Cassandras and Lafortune, 2008

slide-35
SLIDE 35

REAL-TIME STOCHASTIC OPTIMIZATION: HYBRID SYSTEMS, CYBER-PHYSICAL SYSTEMS

CONTROL/DECISION (Parameterized by θ) HYBRID SYSTEM

PERFORMANCE

L(θ) IPA

) (

1 n n n n

L θ η θ θ ∇ + =

+

A general framework for an IPA theory in Hybrid Systems?

Sample path

Christos G. Cassandras

CISE SE - CODES Lab. - Boston University

)] ( [ θ L E

NOISE

x(t) L(θ) ∆

slide-36
SLIDE 36

THE IPA CALCULUS

slide-37
SLIDE 37

HYBRID AUTOMATA STOCHASTIC HYBRID AUTOMATA

) , , ( t x f x

k

= 

Event at time τk(θ) Event at time τk+1(θ) kth discrete state (mode) θ : control parameter, (system design parameter, parameter of an input process,

  • r parameter that characterizes a control policy)

Θ ∈ θ

θ

slide-38
SLIDE 38

IPA: THREE FUNDAMENTAL EQUATIONS

1. Continuity at events: Take d/dθ :

) ( ) (

− + = k k

x x τ τ

k k k k k k k

f f x x ' )] ( ) ( [ ) ( ' ) ( '

1

τ τ τ τ τ

+ − − − +

− + =

If no continuity, use reset condition ⇒

θ δ υ ρ τ d x q q d x

k

) , , , , ( ) ( ' ′ =

+

Christos G. Cassandras

CISE SE - CODES Lab. - Boston University

System dynamics over (τk(θ), τk+1(θ)]:

) , , ( t x f x

k

θ =  ( ) ( ) ( )

θ θ τ τ θ θ ∂ ∂ = ′ ∂ ∂ = ′

k k

t x t x , ,

NOTATION:

slide-39
SLIDE 39

IPA: THREE FUNDAMENTAL EQUATIONS

Solve

  • ver (τk(θ), τk+1(θ)]:

θ ∂ ∂ + ∂ ∂ = ) ( ) ( ' ) ( ) ( ' t f t x x t f dt t dx

k k

        ′ + ∫ ∂ ∂ ∫ = ′

+ ∂ ∂ − ∂ ∂ t k du x u f k du x u f

k v k k t k k

x dv e v f e t x

τ

τ θ

τ τ

) ( ) ( ) (

) ( ) (

initial condition from 1 above

Christos G. Cassandras

CISE SE - CODES Lab. - Boston University

  • 2. Take d/dθ of system dynamics
  • ver (τk(θ), τk+1(θ)]:

) , , ( t x f x

k

θ = 

θ ∂ ∂ + ∂ ∂ = ) ( ) ( ' ) ( ) ( ' t f t x x t f dt t dx

k k

NOTE: If there are no events (pure time-driven system), IPA reduces to this equation

slide-40
SLIDE 40
  • 3. Get depending on the event type:

IPA: THREE FUNDAMENTAL EQUATIONS

k

τ′

  • Exogenous event: By definition,

= ′

k

τ

) ), , ( ( = θ τ θ

k k x

g

  • Endogenous event: occurs when

      ′ ∂ ∂ + ∂ ∂       ∂ ∂ − = ′

− − −

) ( ) (

1 k k k k

x x g g f x g τ θ τ τ

  • Induced events:

) ( ) (

1 + −

′       ∂ ∂ − = ′

k k k k k

y t y τ τ τ

Christos G. Cassandras

CISE SE - CODES Lab. - Boston University

slide-41
SLIDE 41

Ignoring resets and induced events:

IPA: THREE FUNDAMENTAL EQUATIONS

k k k k k k k

f f x x ' )] ( ) ( [ ) ( ' ) ( '

1

τ τ τ τ τ ⋅ − + =

+ − − − +

          + ∂ ∂ = ′

− − −

+ − ∂ ∂ − ∂ ∂

∫ ∫

k k v k k k k k

k du x u f k du x u f k

x dv e v f e x

τ τ

τ θ τ

τ τ τ 1 1 1

) ( ' ) ( ) (

1 ) ( ) (

= ′

k

τ       ′ ∂ ∂ + ∂ ∂       ∂ ∂ − = ′

− − −

) ( ) (

1 k k k k

x x g g f x g τ θ τ τ

  • r

1. 2. 3.

Christos G. Cassandras

CISE SE - CODES Lab. - Boston University

2 1 3

) ( '

− k

x τ

( ) ( ) ( )

θ θ τ τ θ θ ∂ ∂ = ′ ∂ ∂ = ′

k k

t x t x ,

Recall:

Cassandras et al, Europ. J. Control, 2010

slide-42
SLIDE 42

IPA PROPERTIES

Back to performance metric:

( ) ∑ ∫

=

+

=

N k k

k k

dt t x L L

1

) , , (

τ τ

θ θ

( ) ( )

θ θ θ ∂ ∂ = ′ t x L t x L

k k

, , , ,

NOTATION: Then:

( ) ∑

= + +

        ′ + ⋅ ′ − ⋅ ′ =

+

N k k k k k k k k

k k

dt t x L L L d dL

1 1

1

) , , ( ) ( ) (

τ τ

θ τ τ τ τ θ θ

What happens at event times What happens between event times

Christos G. Cassandras

CISE SE - CODES Lab. - Boston University

slide-43
SLIDE 43

IPA PROPERTIES: ROBUSTNESS

THEOREM 1: If either 1,2 holds, then dL(θ)/dθ depends only on information available at event times τk:

  • 1. L(x,θ,t) is independent of t over [τk(θ), τk+1(θ)] for all k
  • 2. L(x,θ,t) is only a function of x and for all t over [τk(θ), τk+1(θ)]:

= ∂ ∂ = ∂ ∂ = ∂ ∂ θ

k k k

f dt d x f dt d x L dt d

IMPLICATION: - Performance sensitivities can be obtained from information limited to event times, which is easily observed

  • No need to track system in between events !

( ) ∑

= + +

        ′ + ⋅ ′ − ⋅ ′ =

+

N k k k k k k k k

k k

dt t x L L L d dL

1 1

1

) , , ( ) ( ) (

τ τ

θ τ τ τ τ θ θ

Christos G. Cassandras

CISE SE - CODES Lab. - Boston University Yao and Cassandras, J. DEDS, 2011

slide-44
SLIDE 44

IPA PROPERTIES

EVENTS

Evaluating requires full knowledge of w and f values (obvious)

) ; ( θ t x

However, may be independent of w and f values (NOT obvious)

θ θ d t dx ) ; (

It often depends only on: - event times τk

  • possibly

) (

1 − + k

f τ ) ; , , , ( θ t w u x f x = 

τk τk+1

Christos G. Cassandras

CISE SE - CODES Lab. - Boston University

slide-45
SLIDE 45

IPA PROPERTIES: SCALABILITY

IPA estimators are EVENT-DRIVEN ⇒ IPA scales with the EVENT SET, not the STATE SPACE ! ⇒ no time discretization needed

Christos G. Cassandras

CISE SE - CODES Lab. - Boston University

As a complex system grows with the addition of more states, the number of EVENTS often remains unchanged or increases at a much lower rate. EXAMPLE: A queueing network may become very large, but the basic events used by IPA are still “arrival” and “departure” at different nodes.

slide-46
SLIDE 46

DECENTRALIZING CAN BE HARD

DECENTRALIZED SOLUTION = CENTRALIZED SOLUTION

(AGENTS ACTING USING ONLY LOCAL INFO.)

(no performance loss due to decentralization)

slide-47
SLIDE 47

Agent Network (time-varying) Agent-Target Interaction Network (time-varying)

Christos G. Cassandras

CODES Lab. - Boston University

PERSISTENT MONITORING WITH KNOWN TARGETS

Hard to decentralize in the presence of time-varying agent-environment interactions

slide-48
SLIDE 48

Christos G. Cassandras

CODES Lab. - Boston University

THREE TYPES OF NEIGHBORHOODS

(conventional)

𝐵5 𝐵2 𝐵3 𝐵1 𝐵4

𝑈2 𝑈

1

𝑈3 𝑈5 𝑈

4

slide-49
SLIDE 49

Christos G. Cassandras

CODES Lab. - Boston University

“ALMOST DECENTRALIZATION” RESULT

  • Show that optimal trajectories consist of hybrid dynamics:

segments defined by observable EVENTS e.g., agent enters target sensing range, agent leaves neighborhood

  • Develop EVENT-DRIVEN gradient-based algorithms using the

Infinitesimal Pertubation Analysis (IPA) calculus: Each agent evaluates its IPA derivative

  • Does an agent’s IPA derivative depend only on LOCAL events?

THEOREM:: Each agent’s IPA derivative depends only on LOCAL events except for one global event

Zhou et al, IEEE TAC 2018

DECENTRALIZATION EVENT OBSERVABILITY

slide-50
SLIDE 50
  • 1. SCALABILITY
  • 2. AUTONOMY
  • 3. COMMUNICATION
  • 4. NON-CONVEXITY
  • 5. EXLOIT DATA

Christos G. Cassandras

CODES Lab. - Boston University

NETWORKED MULTI-AGENT OPTIMIZATION– CHALLENGES Distributed Algorithms (Decentralization) Global optimality, escape local optima Event-driven (asynchronous) Algorithms Data-Driven Algorithms

When are these possible? How to design? Can this be done in a distributed manner? Is convergence guaranteed? How do Event-Driven algorithms perform compared to Time-Driven ones? Solve stochastic optimization problems robust to modeling assumptions