Modeling and Co-design of Control Tasks over Wireless Networking - - PowerPoint PPT Presentation

modeling and co design of control tasks over wireless
SMART_READER_LITE
LIVE PREVIEW

Modeling and Co-design of Control Tasks over Wireless Networking - - PowerPoint PPT Presentation

Department of information engineering, computer science and mathematics Center Of Excellence DEWS University of LAquila, Italy Modeling and Co-design of Control Tasks over Wireless Networking Protocols Alessandro DInnocenzo January


slide-1
SLIDE 1

Department of information engineering, computer science and mathematics Center Of Excellence DEWS University of L’Aquila, Italy

​Modeling and Co-design of Control Tasks

  • ver Wireless Networking Protocols

Alessandro D’Innocenzo

January 9, 2017 OptHySYS Workshop, Trento

slide-2
SLIDE 2

Paradigm shift towards wireless control architectures

“Removing cables undoubtedly saves cost, but often the real cost gains lie in the radically different design approach that wireless solutions permit. […] In order to fully benefit from wireless technologies, a rethink of existing automation concepts and the complete design and functionality of an application is required.” Jan-Erik Frey, R&D Manager ABB

slide-3
SLIDE 3

Opportunities vs scientific challenges with Wireless Control Networks

Lower costs, easier installation

  • Suitable for emerging markets

Broadens scope of sensing and control

  • Easier to sense/monitor/actuate: opens new application domains

Compositionality

  • Enables system evolution via composable control loops

Runtime adaptation and reconfiguration

  • Control can be maintained in response to failures and malicious attacks

Complexity

  • Systems designers and programmers need suitable abstract models to hide the

complexity from wireless channels and communication protocols Reliability

  • Need for robust and predictable behavior despite wireless non-idealities

Security

  • Wireless technology is vulnerable: security mechanisms for control loops

Take into account communication protocol dynamics

slide-4
SLIDE 4

Control loop over a P2P wireless network

Session Presentation Transport Network Data/Link Physical

Sensing/actuation

Session Presentation Transport Network Data/Link Physical

Wireless link

))) ((( S1

  • Sensing and actuation data are relayed via the protocol stack layers
  • In classical control theory communication stack and medium are considered as

generic disturbances in the controller design

A1 A2

Robust and Fault-tolerant Control

𝑣" 𝑙 𝑣$ 𝑙 y 𝑙 𝐯 𝐥 = 𝐠(𝐳 𝐥 )

Plant control law (e.g. PID, MPC)

slide-5
SLIDE 5

Control loops over a P2P wireless network

A1 A2 S1

𝐯 𝐥 = 𝐠(𝐳 𝐥 )

Session Presentation Transport Network Data/Link Physical

Sensing/actuation

Session Presentation Transport Network Data/Link Physical

Wireless link

))) (((

Plant control law (e.g. PID, MPC)

  • Sensing and actuation data are relayed via the protocol stack layers
  • In classical control theory communication stack and medium are considered as

generic disturbances in the controller design

  • Several feedback control mechanisms within separate layers

TCP congestion control (e.g. Tahoe, Reno, Cubic) Routing control (e.g. RIP, Priority buff.) Medium access control (e.g. CSMA/CD) Power, coding & modulation control (e.g. UMTS inner loop)

Intra-layer control loops

slide-6
SLIDE 6

Control loops over a mesh wireless network

Wireless network

Borderline between control over network and control of network disappears

slide-7
SLIDE 7

Control loops over a mesh wireless network

Wireless network

Borderline between control over network and control of network disappears

M.C. Escher, Relativity Lithograph, 1953

Different perspectives in terms of

  • Time-scales
  • Mathematical setting
  • Performance metrics
  • Constraints & non-idealities
slide-8
SLIDE 8

A1 A2 S1

𝐯 𝐥 = 𝐠(𝐳 𝐥 )

Session Presentation Transport Network Data/Link Physical

Sensing/actuation

Session Presentation Transport Network Data/Link Physical

Wireless network

))) ((( Design network to meet control performance Control loop

Control-aware networking and communication

Modify network protocols and radio links for better real-time control performance

[Park et al 2011], [Fischione et al 2009], …

Control specification

slide-9
SLIDE 9

A1 A2 S1

𝐯 𝐥 = 𝐠(𝐳 𝐥 )

Session Presentation Transport Network Data/Link Physical

Sensing/actuation

Session Presentation Transport Network Data/Link Physical

Wireless network

))) ((( Network non-idealities Robust controller design Control specification

Network-aware control

Modify control algorithms to cope with communication imperfections

[Seiler&Sengupta 2001], [Jacobsson et al. 2004], [Sinopoli et al 2004], [Elia 2005], [Imer et al 2006], [Braslavsky et al 2007], [Gupta et al 2007], [Hespanha et al 2007], [Schenato et al 2007], [Heemels et al 2011, 2012], [Chiuso et al 2014], …

slide-10
SLIDE 10

A1 A2 S1

𝐯 𝐥 = 𝐠(𝐳 𝐥 )

Session Presentation Transport Network Data/Link Physical

Sensing/actuation

Session Presentation Transport Network Data/Link Physical

Wireless network

))) ((( Controller & network co-design Control specification

Co-design

Joint design of the control algorithm and the network protocol configuration

[Park et al 2011], [Mesquita et al 2012], [Pajic et al 2012], [Antunes&Heemels 2013], [D’Innocenzo et al 2013], …

slide-11
SLIDE 11

Cross-layer adaptation & optimization

A1 A2 S1

𝐯 𝐥 = 𝐠(𝐳 𝐥 )

Session Presentation Transport Network Data/Link Physical

Sensing/actuation

Session Presentation Transport Network Data/Link Physical

Wireless network

))) (((

Desirable signalling between communication layers to improve

  • verall performance
slide-12
SLIDE 12

Cross-layer adaptation & optimization

A1 A2 S1

𝐯 𝐥 = 𝐠(𝐳 𝐥 )

Session Presentation Transport Network Data/Link Physical

Sensing/actuation

Session Presentation Transport Network Data/Link Physical

Wireless network

))) (((

Desirable signalling between communication layers to improve

  • verall performance

Example: exploit plant and network feedback to decide actuation signal and power [D’Innocenzo et al 2012], [Gatsis et al 2013], …

slide-13
SLIDE 13

Cross-layer adaptation & optimization

A1 A2 S1

𝐯 𝐥 = 𝐠(𝐳 𝐥 )

Session Presentation Transport Network Data/Link Physical

Sensing/actuation

Session Presentation Transport Network Data/Link Physical

Wireless network

))) (((

Desirable signalling between communication layers to improve

  • verall performance

Example: exploit plant and network feedback to decide actuation signal and coding [Tatikonda&Mitter 2004], [Nair et al 2007], [Quevedo et al 2010], …

slide-14
SLIDE 14

Cross-layer adaptation & optimization

A1 A2 S1

𝐯 𝐥 = 𝐠(𝐳 𝐥 )

Session Presentation Transport Network Data/Link Physical

Sensing/actuation

Session Presentation Transport Network Data/Link Physical

Wireless network

))) (((

Desirable signalling between communication layers to improve

  • verall performance

Example: exploit plant and network feedback to decide actuation signal and access to channel [Xu&Hespanha 2004], [Cogill et al 2007], [Li&Lemmon 2011], [Tabuada 2007], [Molin&Hirche 2009], [Rabi&Johansson 2009], [Anta&Tabuada 2010], [Donkers et al 2011],…

slide-15
SLIDE 15

Cross-layer adaptation & optimization

A1 A2 S1

𝐯 𝐥 = 𝐠(𝐳 𝐥 )

Session Presentation Transport Network Data/Link Physical

Sensing/actuation

Session Presentation Transport Network Data/Link Physical

Wireless network

))) (((

Desirable signalling between communication layers to improve

  • verall performance

Example: exploit plant and network feedback to decide actuation signal and routing [Mesquita et al 2012], [Jungers et al. 2014], …

slide-16
SLIDE 16

Opportunities vs scientific challenges with Wireless Control Networks

Lower costs, easier installation

  • Suitable for emerging markets

Broadens scope of sensing and control

  • Easier to sense/monitor/actuate: opens new application domains

Compositionality

  • Enables system evolution via composable control loops

Runtime adaptation and reconfiguration

  • Control can be maintained in response to failures and malicious attacks

Complexity

  • Systems designers and programmers need suitable abstract models to hide the

complexity from wireless channels and communication protocols Reliability

  • Need for robust and predictable behavior despite wireless non-idealities

Security

  • Wireless technology is vulnerable: security mechanisms for control loops

Take into account communication protocol dynamics

slide-17
SLIDE 17

Role of communication in cyber-physical security research?

  • Y. Zacchia Lun , A. D’Innocenzo , I. Malavolta and M.D. Di Benedetto. Cyber-Physical Systems

Security: a Systematic Mapping Study. Submitted for publication, preprint on arXiv. A systematic mapping study is a research methodology intended to provide an unbiased,

  • bjective and systematic instrument to identify, classify, and analyze existing research on a

specific research area: cyber-physical systems security in our case.

  • K. Petersen, S. Vakkalanka, and L. Kuzniarz, “Guidelines for conducting systematic mapping

studies in software engineering: An update,” Information and Software Technology, vol. 64,

  • pp. 1–18, 2015
slide-18
SLIDE 18

Overview of the whole review process

slide-19
SLIDE 19

Communication aspects and network-induced imperfections

Surprisingly, 100 out of 118 studies (i.e., 84,75%) do not explicitly consider any communication aspect or imperfection, while only 6 studies (i.e. 5,08%) address more than one aspect. Surprisingly, very few papers (attempt to) provide non-trivial mathematical models

  • f the communication protocol, which indeed is a fundamental actor of almost any
  • CPS. In particular, only in 2 works a specific standard for communication is

explicitly considered in the CPS mathematical model.

6 6 3 3 5 3 2 1 1 2 3 4 5 6 7 Error control coding Transmission Scheduling Routing Time-varying sampling Variable Latency Packet loses and desorder Limited bandwidth Synchronisation errors

slide-20
SLIDE 20

Co-design over time-triggered communication protocols

Challenge: Co-design the control algorithm and the communication protocol (scheduling, routing and control)

Controller

Application Session Presentation Transport Network Data/Link Physical

slide-21
SLIDE 21

21

WirelessHART MAC (scheduling) and Network (routing) layers

§ Time-triggered access to the channel § Time divided in periodic frames § Each frame divided in Π time slots of duration Δ

slide-22
SLIDE 22

22

WirelessHART MAC (scheduling) and Network (routing) layers

§ Time-triggered access to the channel § Time divided in periodic frames § Each frame divided in Π time slots of duration Δ § Enables redundancy in data routing

slide-23
SLIDE 23

23

WirelessHART MAC (scheduling) and Network (routing) layers

§ Time-triggered access to the channel § Time divided in periodic frames § Each frame divided in Π time slots of duration Δ § Enables redundancy in data routing

slide-24
SLIDE 24

24

WirelessHART MAC (scheduling) and Network (routing) layers

§ Time-triggered access to the channel § Time divided in periodic frames § Each frame divided in Π time slots of duration Δ § Enables redundancy in data routing

slide-25
SLIDE 25

25

WirelessHART MAC (scheduling) and Network (routing) layers

§ Time-triggered access to the channel § Time divided in periodic frames § Each frame divided in Π time slots of duration Δ § Enables redundancy in data routing § Scheduling must guarantee relay via multiple paths

slide-26
SLIDE 26

26

WirelessHART MAC (scheduling) and Network (routing) layers

§ Time-triggered access to the channel § Time divided in periodic frames § Each frame divided in Π time slots of duration Δ § Enables redundancy in data routing § Scheduling must guarantee relay via multiple paths

slide-27
SLIDE 27

27

WirelessHART MAC (scheduling) and Network (routing) layers

§ Time-triggered access to the channel § Time divided in periodic frames § Each frame divided in Π time slots of duration Δ § Enables redundancy in data routing § Scheduling must guarantee relay via multiple paths

slide-28
SLIDE 28

28

WirelessHART MAC (scheduling) and Network (routing) layers

§ Time-triggered access to the channel § Time divided in periodic frames § Each frame divided in Π time slots of duration Δ § Enables redundancy in data routing § Scheduling must guarantee relay via multiple paths

How to exploit redundancy optimally w.r.t. control performance metrics?

slide-29
SLIDE 29

Redundancy in data routing…

1. Separation of concerns 2. Co-design

§ …makes system tolerant to long- term link failures § …enables detection and isolation of failures and malicious attacks § …makes system robust to short-term link failures (e.g. packet losses)

slide-30
SLIDE 30

Wireless control networks as switching systems

𝐿(𝑢)

slide-31
SLIDE 31

Wireless control networks as switching systems

𝑢

𝐿(𝑢)

slide-32
SLIDE 32

Wireless control networks as switching systems

t+1

𝐿(𝑢)

slide-33
SLIDE 33

Wireless control networks as switching systems

t+…

𝐿(𝑢)

Different paths are associated with different delays.

slide-34
SLIDE 34

Wireless control networks as switching systems

t+…

𝐿(𝑢)

𝐵 = 𝐵; 𝐶; 𝐽 ⋮ ⋮ ⋮ ⋯ ⋯ ⋱ ⋮ ⋮ ⋯ 𝐽 ⋯ 𝐽 ⋯ 𝐶 𝜏 𝑢 = 𝐶𝜀E F ,H 𝐽𝜀E F ," ⋮ 𝐽𝜀E F ,IJ$ 𝐽𝜀E F ,IJ" 𝐽𝜀E F ,I Different paths are associated with different delays. Mathematical model: 𝑦 𝑢 + 1 = 𝐵𝑦 𝑢 + 𝐶 𝜏 𝑢 𝑤 𝑢 , 𝑢 ∈ ℕ, where 𝑦 𝑢 is the plant and network state, 𝜏 𝑢 ∈ Σ depends on routing/scheduling. The switching signal is considered as a disturbance.

slide-35
SLIDE 35

Wireless control networks as switching systems

t+…

Problem: Design a controller 𝐿(𝑢) s.t. the closed loop system is asymptotically stable. Given a state-feedback controller 𝐿(𝑢), the closed loop systems is asymptotically stable iff the Joint Spectral Radius of 𝐵 + 𝐶 𝜏 𝑢 𝐿 𝑢

E F ∈R is smaller than 1.

Insights: Switching systems analysis and design is a crowded research area:

  • Leverage special structure of matrices 𝐵 and 𝐶 𝜏 𝑢

to provide tailored results that

  • utperform classical results on general switching systems
  • Decidability results on controllability and stabilizability depend on knowledge of 𝜏 𝑢

𝐿(𝑢)

Different paths are associated with different delays. Mathematical model: 𝑦 𝑢 + 1 = 𝐵𝑦 𝑢 + 𝐶 𝜏 𝑢 𝑤 𝑢 , 𝑢 ∈ ℕ, where 𝑦 𝑢 is the plant and network state, 𝜏 𝑢 ∈ Σ depends on routing/scheduling. The switching signal is considered as a disturbance.

slide-36
SLIDE 36

Wireless control networks as switching systems

  • Definition of controllability: A system 𝑦 𝑢 + 1 = 𝐵𝑦 𝑢 + 𝐶 𝜏 𝑢

𝑤 𝑢 is controllable

if for any initial state 𝑦H, any arbitrary final state 𝑦S and any switching signal 𝜏 𝑢 , there exists a finite time 𝑈 and a control sequence 𝑤 𝑢, 𝑦 𝑢 − 𝐸, … , 𝑢 , 𝜏 𝑢 − 𝐸, … , 𝑢 + 𝑂 , 𝑢 = 0, … , 𝑈 − 1, such that 𝑦 𝑈 = 𝑦S.

  • Definition of stabilizability: A system 𝑦 𝑢 + 1 = 𝐵𝑦 𝑢 + 𝐶 𝜏 𝑢

𝑤 𝑢 is stabilizable if

for any initial state 𝑦H and any switching signal 𝜏 𝑢 , there exists a control sequence 𝑤 𝑢, 𝑦 𝑢 − 𝐸, … , 𝑢 , 𝜏 𝑢 − 𝐸, … , 𝑢 + 𝑂 , 𝑢 ≥ 0, such that lim

F→\ 𝑦(𝑢) = 0.

𝐿(𝑢)

slide-37
SLIDE 37

Wireless control networks as switching systems

𝐿(𝑢)

Knowledge of the (routing) switching signal 𝜏(𝑢):

  • No knowledge of routing signal, i.e. we cannot measure 𝝉 𝒖 : then 𝑳 𝒖 = 𝑳, ∀𝒖 ∈ ℕ

– Example: Existence of non-linear controllers that outperform any linear controller – Approximation methods for the design of 𝐿 [Cicone et al., CDC’15]

  • We can measure and keep memory of 𝝉 𝒖 : then 𝑳 𝒖 = 𝑳 𝝉 𝒖 − 𝒆 : 𝝉 𝒖

– Theorem: Characterize pathologic switching sequences that invalidate stabilizability – Configure network nodes to avoid such sequences

slide-38
SLIDE 38

Wireless control networks as switching systems

We can measure and keep memory of 𝝉 𝒖 and have a finite horizon knowledge of future 𝑶 switching signals: then 𝑳 𝒖 = 𝑳 𝝉 𝒖 − 𝒆 : 𝝉 𝒖 + 𝑶

  • Theorem: [Jungers et al., IEEE-TAC’16] Controllability is decidable and can be checked by

verifying that for all switching signals of lenght ≤ 𝑜 + 2 𝐸 2 𝐸 a rank condition is satisfied, with 𝑜 plant state-space dimension, 𝐸 delays set cardinality. Moreover:

– If 𝑂 ≥ 𝑜 + 2 𝐸 2 𝐸 stabilizability is equivalent to controllability – If 𝑂 < 𝑜 + 2 𝐸 2 𝐸 controllability implies stabilizability but not vice-versa (by counter-example)

  • Configure network nodes to design routing policies for appropriate time-windows

𝐿(𝑢)

slide-39
SLIDE 39

References

  • R. M. Jungers, A. D'Innocenzo, M. D. Di Benedetto. Controllability of Linear Systems With Switching
  • Delays. IEEE Transactions on Automatic Control, 61(4):1117-1122, 2016.
  • A. Cicone, A. D'Innocenzo, N. Guglielmi, L. Laglia. A sub-optimal solution for optimal control of linear

systems with unmeasurable switching delays. 54th IEEE Conference on Decision and Control, Osaka, Japan, December 15-18, 2015.

  • R. M. Jungers, A. D'Innocenzo, M. D. Di Benedetto. Further results on controllability of linear systems

with switching delays. 9th IFAC World Congress, Cape Town, South Africa, August 24-29, 2014.

  • R. M. Jungers, A. D'Innocenzo, M. D. Di Benedetto. How to control Linear Systems with switching delays.

13th European Control Conference (ECC14), Strasbourg, France, June 24-27, 2014. R.M. Jungers, A. D'Innocenzo, M.D. Di Benedetto. Feedback stabilization of dynamical systems with switched delays. 51st IEEE Conference on Decision and Control, Maui, Hawaii, December 10-13 2012. Jungers, Heemels et Al.: extension to deterministic packet losses

𝐿(𝑢)

slide-40
SLIDE 40

Redundancy in data routing…

1. Separation of concerns 2. Co-design

§ …makes system tolerant to long- term link failures § …enables detection and isolation of failures and malicious attacks § …makes system robust to short-term link failures (e.g. packet losses)

slide-41
SLIDE 41

Considerations on packet dropouts

In the theoretical control literature, the most common model is

  • the Bernoulli loss process, which assumes that packet dropouts} occur

independently with a fixed probability pc,

  • that is a crossover probability of a binary symmetric channel:

In wireless channels, the errors typically come in bursts, due to the movement of the transmitter, receiver or small objects in the environment (Doppler spread) ⇒ time-selective fading. When the fading is slow (compared to a packet interval), the accurate model is given by finite-state Markov channel (FSMC).

slide-42
SLIDE 42

Finite-state Markov channel model

extends the two-state channel (a.k.a Gilbert-Elliot) model by taking into account more channel states. Basically, FSMC is a Markov chain, having a different binary symmetric channel associated to each state: Such model has also been used

  • to approximate the block error process of wireless channels,
  • in performance evaluation of wireless networks,
  • in evaluating cognitive radio, and
  • in evaluating MIMO communication systems.
slide-43
SLIDE 43
  • H. S. Wang and N. Moayeri. Finite-state Markov channel-a useful model for radio

communication channels. IEEE transactions on vehicular technology, 44(1):163–171, 1995.

  • Q. Zhang and S. A. Kassam. Finite-state Markov model for Rayleigh fading channels. IEEE

transactions on communications, 47(11):1688–1692, 1999.

  • P. Sadeghi, R. A. Kennedy, P. B. Rapajic, and R. Shams. Finite-state Markov modeling of

fading channels - a survey of principles and applications. IEEE Signal Processing Magazine, 25(5):57–80, September 2008.

  • C. Komninakis and R.D. Wesel. Joint iterative channel estimation and decoding in flat

correlated Rayleigh fading. IEEE Journal on Selected Areas in Communications, 19(9), 1706–1717, 2001.

Finite-state Markov channel model

slide-44
SLIDE 44

Finite-state Markov channel model

When the channel state information is available (through a channel estimation), FSMC can be portrayed by a Markov chain alone, but of double size (compared to a standard representation): Generally, the transition probability is defined as It is time-homogeneous when and is time-inhomogeneous otherwise.

pij(k) = pij ∀k

slide-45
SLIDE 45

Network paths characteristics are often “at odds”

Motivational example

𝐷 𝑄

Very reliable but slow Very fast but unreliable Optimally exploit the best of each path by co designing controller and routing

slide-46
SLIDE 46
  • Plant dynamics: 𝑦; 𝑙 + 1 = 𝐵;𝑦; 𝑙 +𝐶;𝑤 𝑙

Plant model

𝑒", 𝜏"(𝑙) 𝑒$, 𝜏$(𝑙)

𝑒k, 𝜏k(𝑙) 𝑒l, 𝜏l(𝑙)

slide-47
SLIDE 47
  • Send different actuation data trough the paths of the network
  • Controller can measure via sensors the plant state (state feedback architecure)
  • Controller is aware of the current and past actuation signals (memory)

Controller model

𝑣m 𝑙 = 𝐿m 𝑙 𝑦; 𝑙 , 𝐿 𝑙 = 𝐿"(𝑙) 𝐿$(𝑙) ⋮ 𝐿k(𝑙)

𝑒", 𝜏"(𝑙) 𝑒$, 𝜏$(𝑙)

𝑒k, 𝜏k(𝑙) 𝑒l, 𝜏l(𝑙)

slide-48
SLIDE 48
  • 𝜏m 𝑙 is a Markov chain modeling packet losses at time 𝑙 on path 𝑗
  • 𝜏 𝑙 = 𝜏" 𝑙

… 𝜏k 𝑙 models occurrence of packet losses in the network

Packet losses model

𝑒", 𝜏"(𝑙) 𝑒$, 𝜏$(𝑙)

𝑒k, 𝜏k(𝑙) 𝑒l, 𝜏l(𝑙)

slide-49
SLIDE 49
  • 𝑏m 𝑙 models the choice at time 𝑙 of sending a packet via path 𝑗
  • 𝑏m 𝑙 =p0

¬𝑇𝐹𝑂𝐸 1 𝑇𝐹𝑂𝐸 , 𝑏 𝑙 = 𝑏" 𝑙 … 𝑏k 𝑙

  • 𝑏 𝑙 is a control variable
  • Example: 𝑏 𝑙 = [1, 0, 1, 0, … , 0] means that at time 𝑙 we send actuation data
  • nly on paths 1 and 3.

Routing redundancy model

𝑒", 𝜏"(𝑙) 𝑒$, 𝜏$(𝑙)

𝑒k, 𝜏k(𝑙) 𝑒l, 𝜏l(𝑙)

slide-50
SLIDE 50
  • Actuate sum of received packets…or actuate most recent received packet
  • When the actuator receives no packets, actuate zero…or hold the previous actuation
  • Optimal design of 𝐿 𝑙 , 𝑏(𝑙) allows determining what is the best protocol to apply at

the actuator

Actuator model

𝑤 𝑙 = 𝑣m∗(𝑙 − 𝑒m∗), with 𝑗∗ the index of the path

associated to the smallest delay among received packets

𝑤 𝑙 = w 𝑣m(𝑙 − 𝑒m)

k mx"

𝑒", 𝜏"(𝑙) 𝑒$, 𝜏$(𝑙)

𝑒k, 𝜏k(𝑙) 𝑒l, 𝜏l(𝑙)

slide-51
SLIDE 51

𝑒", 𝜏"(𝑙) 𝑒$, 𝜏$(𝑙)

𝑒k, 𝜏k(𝑙) 𝑒l, 𝜏l(𝑙)

Network model

slide-52
SLIDE 52

MJLS mathematical framework

Assume that the routing policy 𝑏 𝑙 is fixed- separation of concerns paradigm. We leverage discrete-time Markov-jump linear systems (MJLS). A MJLS is a switching linear system where the switching signal is a Markov chain. The transition probability matrix (TPM) 𝑄(𝑙) of the Markov chain 𝜏 𝑙 can be used to model the stochastic process that rules packet losses due to wireless communication. y 𝑦 𝑙 + 1 = 𝐵z { 𝑦 𝑙 + 𝐶z { 𝑣 𝑙 , 𝑦 0 = 𝑦H , 𝜄 0 = 𝜄H

𝑦 𝑙 ∈ ℂ~ is the state vector, u 𝑙 ∈ ℂ€ is the input vector, 𝑙 ∈ ℕH, 𝐵z { ∈ ℂ~×~, 𝐶z { ∈ ℂ~×€ are the state and input matrices associated with mode 𝜄 𝑙

slide-53
SLIDE 53
  • 4-dimensional unstable plant
  • 2 routing paths: 𝑞" = " ƒ

⁄ , 𝑒" = 1; 𝑞$ = 0, 𝑒$ = 5

  • 5k Monte Carlo simulations
  • Compare 3 routing policies:

– LQR using only path 1 for any 𝑙 – LQR using only path 2 for any 𝑙 – LQR using both paths for any 𝑙

Example

slide-54
SLIDE 54

Use only path 1 for any 𝑙

slide-55
SLIDE 55

Use only path 2 for any 𝑙

slide-56
SLIDE 56

Use both paths for any 𝑙

Plant state plot

Averaged cost Via path 1 ∼ 900 Via path 2 ∼ 250 Via paths 1 and 2 ∼ 100

slide-57
SLIDE 57

Extensions of the MJLS mathematical framework

Extension 1: In general wireless channels are time-varying and packet loss probability not easy to compute/measure: consider a MJLS where 𝑄(𝑙) time-varying in a polytope Extension 2: We want to control the routing policy 𝑏(𝑙): consider a MJLS where 𝜄(𝑙) is a Markov-decision process

slide-58
SLIDE 58

Markov-jump linear systems stability

Noiseless autonomous system ( x(k+1) = Aθ(k)x(k), x(0) = x0, θ(0) = θ0 A noiseless autonomous MJLS is mean square stable if for every initial state and for every initial probability distribution of In time-homogeneous case, the MJLS is MSS if and only if a spectral radius

  • f the augmented matrix associated to the second

moment of the state vector is where For example, x0 θ0, lim

k→∞ E[x(k)] = 0

and lim

k→∞ E[x(k)x∗(k)] = 0.

ρ A, x(k), < 1, A , (P T ⊗ In2)diag[ ¯ Ai ⊗ Ai]

slide-59
SLIDE 59

Illustrative example

Consider the MJLS with

  • perational modes, and

A1 =  1 1.2

  • , A2 =

 1.13 0.16 0.48

  • , A3 =

 0.3 0.13 0.16 1.14

  • ,

N =3 P =   0.35 0.65 0.6 0.4 0.4 0.6  =P1 A1 = (P T

1 ⊗ In2)diag[ ¯

Ai ⊗ Ai] ρ(A1)=0.901601

slide-60
SLIDE 60

Illustrative example

slide-61
SLIDE 61

Illustrative example

For the same values of state matrices let us consider another transition probability matrix A1 =  1 1.2

  • , A2 =

 1.13 0.16 0.48

  • , A3 =

 0.3 0.13 0.16 1.14

  • ,

P2: P2 =   0.25 0.75 0.6 0.4 0.4 0.6   A2 = (P T

2 ⊗ In2)diag[ ¯

Ai ⊗ Ai] ρ(A2)=0.905686

slide-62
SLIDE 62

Illustrative example

slide-63
SLIDE 63

Illustrative example

What happens when the transition probability matrix is uncertain and is switching (randomly) between and P1 P2? P =   0.35 0.65 0.6 0.4 0.4 0.6  =P1 P2 =   0.25 0.75 0.6 0.4 0.4 0.6  

slide-64
SLIDE 64

Illustrative example

What happens when the transition probability matrix is uncertain and is switching (randomly) between and P1 P2? P =   0.35 0.65 0.6 0.4 0.4 0.6  =P1 P2 =   0.25 0.75 0.6 0.4 0.4 0.6  

slide-65
SLIDE 65

Polytopic uncertainty model

works on convex sets and is widely used in robust control. A set is convex, when every point on the line segment connecting two elements of a convex set is in as well. Transition probability matrix is time-varying, with variations that are arbitrary within a bounded set of stochastic matrices. The current value of is unknown, but assumed to be a convex combination of the known vertices of a convex polytope: C C P P (k) P(k) =

L

X

l=1

λl(k)Pl, λl(k) ≥ 0,

L

X

l=1

λl(k) = 1.

slide-66
SLIDE 66

Related works on stability

Only sufficient stability conditions have been derived for MJLSs with such model of uncertainty:

  • S. Aberkane et al. "Stochastic stabilization of a class of

nonhomogeneous Markovian jump linear systems": provides a sufficient condition for stochastic stability in terms

  • f linear matrix inequality feasibility problem
  • S. Chitraganti et al. "Mean square stability of non-homogeneous

Markov jump linear systems using interval analysis" presents a sufficient condition for MSS of MJLS with interval transition probability matrix (TPM).

slide-67
SLIDE 67

Joint Spectral Radius (JSR)

introduced by Rota and Strang in 1960, it characterizes

  • the maximal asymptotic growth rate of the norms of long products of

matrices taken in a set;

  • it is subject of intense research due to its role in the study of wavelets,

switching systems, approximation algorithms, etc. Formally, let be a family of complex square matrices Consider the set be a set of all possible products of length whose factors are elements of i.e. For any matrix norm

  • n

consider the supremum among the normalized norms of all products in i.e. The JSR of is defined as M Ml. Πk(M) k M, Πk(M) = {MlkMlk−1 · · · Ml1 | l1, . . . , lk ∈{1, ..., L}}. k·k Cn×n, Πk(M), ˆ ρk(M) , sup

Π∈Πk(M)

kΠk

1

/

k.

M ˆ ρ(M) , lim

k→∞ ˆ

ρk(M).

slide-68
SLIDE 68

Properties and computation of JSR

Joint spectral radius has two important properties: Convergence of matrix products: For any bounded set of matrices all matrix products converge to zero matrix as if and only if Convex hull: The convex hull of a set has the same joint spectral radius as the

  • riginal set, i.e.

M, Π ∈ Πk(M) k → ∞, ˆ ρ(M) < 1 ˆ ρ(conv M) = ˆ ρ(M).

slide-69
SLIDE 69

Mean square stability: necessary and sufficient condition

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. On stability of time-inhomogeneous Markov jump

linear systems. 55th IEEE Conference on Decision and Control, Las Vegas, US, December 12-14, 2016

A discrete-time noiseless MJLS with polytopic uncertainties on TPM is MSS if and only if JSR of is where is a set of all vertices of the polytope associated to the second moment of The proof is based on:

  • the relevant properties of the joint spectral radius.
  • A linear space

made up of all sequences of complex matrices is uniformly homeomorphic to

  • dimensional complex

Euclidean space through the linear mapping which is based

  • n vectorization transformation
  • Kronecker product

has associativity property, and AL <1, L x(k). AL ,{Al}l∈L={1,...,L}, Al ,(P T

l ⊗ In2)diag[ ¯

Ai ⊗ Ai] Hm,n N m × n Nnm CNnm ˆ ϕ(·) ϕ(·). ⊗ ϕ(XY Z)=(ZT⊗X)ϕ(Y )

Theorem

slide-70
SLIDE 70

Computation of the joint spectral radius

The computation and approximation of JSR in notoriously difficult. There exist tools for its computation, such as Vankeerberghen et al., "JSR: A Toolbox to Compute the Joint Spectral Radius”: The tool is implemented in Matlab and is freely downloadable. The JSR of the set of augmented matrices associated to the second moment

  • f
  • f the system provided in the illustrative example is

x(k) ˆ ρ(AL) = [ˆ ρmin(AL), ˆ ρmax(AL)] = [1.024442, 1.031096]

slide-71
SLIDE 71

For MJLSs with polytopic uncertainties on TPMs, unless P = NP, there is no polynomial-time algorithm that decides whether it is mean square stable. The proof is obtained by:

  • Reduction from the matrix semigroup stability problem,

which is well-known to be NP-hard.

NP-hardness of the stability analysis problem

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. On stability of time-inhomogeneous Markov jump

linear systems. 55th IEEE Conference on Decision and Control, Las Vegas, US, December 12-14, 2016

Theorem

slide-72
SLIDE 72

The following assertions are equivalent for MJLSs with polytopic uncertainties

  • n TPMs, from here on denoted by
  • is mean square stable (MSS).
  • is exponentially mean square stable (EMSS), i.e.
  • is stochastically stable (SS), i.e.
  • The proof is based on:
  • radical test for infinite series,
  • equivalence of norms on finite-dimensional linear spaces,
  • Frobenius norm.

(S) : (S) (S) E[kx(k)k2]  βζkkx0k2

2,

β 1, 0 < ζ < 1 (S)

X

k=0

E[kx(k)k2] < 1

Equivalence of stability notions

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. On stability of time-inhomogeneous Markov jump

linear systems. 55th IEEE Conference on Decision and Control, Las Vegas, US, December 12-14, 2016

Theorem

slide-73
SLIDE 73

Autonomous MJLS with bounded noise: (with operational modes)

  • Theorem. Given a discrete-time MJLS with polytopic TPM, then

if and only if for every and . Also this proof is based on:

  • radical test for infinite series,
  • Frobenius norm and equivalence of norms on finite-dimensional linear spaces.

Practical stability of time-inhomogeneous MJLS

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. On robust stability of time-inhomogeneous Markov jump

linear systems. Submitted.

( x(k + 1) = Aθ(k)x(k) + Gθ(k)w(k), x(0) = x0, θ(0) = θ0, (S) (S) ˆ ρ(AL) < 1 x = {x(k); k ∈ N0} ∈ Cn w = {w(k); k ∈ N0} ∈ Cr, x0 ∈ Cn θ0 ∈ Θ0 N

slide-74
SLIDE 74

Controlled noiseless MJLS:

  • Problem. Design an optimal (robust in TPM) mode-dependent state-feedback

control sequence minimizing the quadratic cost

Optimal robust LQR of time-inhomogeneous MJLS

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. Robust LQR of time-inhomogeneous Markov jump linear
  • systems. Submitted.

(S) :      x(k + 1) = Aθ(k)x(k) + Bθ(k)u(k), y(k) = Cθ(k)x(k) + Dθ(k)u(k), x(0) = x0, θ(0) = θ0, x(k)∈Cn is a state vector, u(k)∈Cm is a control vector, θ(k) is an operational mode y(k)∈Cs is a (measured) output of the system u , [u(0), . . . , u(T −1)] J (θ0, x0, u) , max

p

J (θ0, x0, u, p) = max

p T −1

X

k=0

E[ky(k)k2]+E[x∗(T)Xθ(T )(T)x(T)], p , [pθ(0), . . . , pθ(T −1)], pθ(k) is a row of TPM at time step k X(T)=[X1(T), . . . , XN(T)]∈Hn+ is a vector of the terminal state weighs In a game-theoretic formulation, we assume that perturbation-player (environment and/or malicious adversary) has no information on the choice of the controller and vice-versa.

slide-75
SLIDE 75

Solution is state dependent, with state space partitioned in a number of regions.

Denoting by the vertex of the convex polytope of TPM s.t. the is attained at the previous time step for an operational mode, we have that Multiple coupled Riccati difference equations (CRDEs)

Optimal robust LQR of time-inhomogeneous MJLS

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. Robust LQR of time-inhomogeneous Markov jump linear
  • systems. Submitted.

v∈L={1, . . . , L} max

π

J (θ0, x0, u, π) u(k) = Kθ(k)|v(k)x(k), Ki|v(k) = −[D∗

i Di + B∗ i N

X

j=1

pij|vXj|l(j)(k+1)Bi]−1B∗

i N

X

j=1

pij|vXj|l(j)(k+1)Ai, Xi|v(k) = C∗

i Ci + A∗ i N

X

j=1

pij|vXj|l(j)(k+1)Ai + A∗

i N

X

j=1

pij|vXj|l(j)(k+1)BiKi|v(k).

Ω1(k) Ω2(k) !(#)

slide-76
SLIDE 76

Some technical details.

1. Parsimonious set of CRDEs includes only the members which ever achieve (2): if , then also if is a convex combination of other vertices, it is redundant. 2. If , then considering instead of introduces a numerical error of at most . 3. If the MJLS is stabilizable, i.e. s.t. , and is mean square stable, then after a transient, the finite-horizon LQR solution becomes unique.

Optimal robust LQR of time-inhomogeneous MJLS

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. Robust LQR of time-inhomogeneous Markov jump linear
  • systems. Submitted.

Xl1

i|v(k) Xl2 i|v(k) ⌫ 0

x∗(k)Xl1

i|v(k)x(k) ≥ x∗(k)Xl2 i|v(k)x(k) ∀x(k);

Xl2

i|v(k)

Xl1

i|v(k)+✏In ⌫ Xl2 i|v(k)

Xl1

i|v(k)

Xl2

i|v(k)

✏kx(k)k2, 8x(k)2Cn ∃K =[K1, . . . , KN]∈Hn u(k)=Kθ(k)x(k) x(k+1)=(Aθ(k)+Bθ(k)u(k))x(k)

slide-77
SLIDE 77

Example (inverted pendulum controlled through wireless channel):

Inverted pendulum mounted to a motorized cart (discretized, sample time 0.02 s): The relative speed between controller and plant can vary between 1 and 1.5 km/h. The vertices of time-varying TPM of associated Markov chain are

Optimal robust LQR of time-inhomogeneous MJLS

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. Robust LQR of time-inhomogeneous Markov jump linear
  • systems. Submitted.

            0.4954 0.2667 0.2379 0.0681 0.7334 0.1658 0.0327 0.1282 0.7334 0.1384 0.0544 0.9456 0.4954 0.2667 0.2379 0.0681 0.7334 0.1658 0.0327 0.1282 0.7334 0.1384 0.0544 0.9456             ,             0.6589 0.2000 0.1411 0.0709 0.8000 0.1140 0.0152 0.1151 0.8000 0.0849 0.0897 0.9103 0.6589 0.2000 0.1411 0.0709 0.8000 0.1140 0.0152 0.1151 0.8000 0.0849 0.0897 0.9103            

A1 =A2 =     1.000000000000000 0.019961536921409 0.000304142666220 0.000002027774250 0.996155844513813 0.030413280758014 0.000304142666220 −0.000062069931882 1.003653767543144 0.020024353570176 −0.006206791991431 0.365567654428609 1.003653767543144    , B1 =     0.000384630785908 0.038441554861875 0.000620699318817 0.062067919914314    , B2 =        , DT

1 D1 =DT 2 D2 =4 · 10−16, C1 =C2 =

 1 1

  • .
slide-78
SLIDE 78

Example (inverted pendulum controlled through wireless channel):

After 1000 time steps, the MJLS has only few discrete states for some modes:

Optimal robust LQR of time-inhomogeneous MJLS

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. Robust LQR of time-inhomogeneous Markov jump linear
  • systems. Submitted.

State space partitions for each mode, no approx. State space partitions for each mode, approx. with ✏ k 1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8 ✏ 999 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 998 2 1 2 2 2 1 1 1 2 1 2 2 2 1 1 1 997 5 9 8 8 5 6 8 4 2 2 1 1 2 2 1 1 10−8 996 16 16 4 2 16 16 4 2 3 4 2 1 8 8 2 1 10−5 995 139 307 12 2 176 336 12 2 2 3 1 1 4 8 1 1 10−4 994 48 48 5 1 48 48 5 1 3 3 1 1 10 7 2 1 10−4 993 129 179 5 1 180 180 5 1 5 2 1 1 14 3 1 1 10−4 992 148 191 3 2 252 280 3 1 4 2 1 1 11 3 1 1 10−4 991 85 59 3 2 159 160 3 1 5 2 1 1 9 3 1 1 10−4 990 63 49 4 1 161 178 3 1 6 3 1 1 6 3 1 1 10−4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 001 32 32 2 1 32 32 2 2 1 1 1 1 16 7 1 1 10−6 000 32 32 2 1 32 32 2 2 1 1 1 1 16 7 1 1 10−6

slide-79
SLIDE 79

Example (inverted pendulum controlled through wireless channel):

The time-homogeneous MJLS is stabilizable, with the following spectral radii: We observe that these values are much closer to 1. JSR can result to be > 1. We set dynamic approximation values and the approximation threshold, which triggers approximation when the number of CRDEs is > 12000. After 1000 time steps, the MJLS has only few discrete states for some modes.

Optimal robust LQR of time-inhomogeneous MJLS

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. Robust LQR of time-inhomogeneous Markov jump linear
  • systems. Submitted.

ρ(Λ1)=0.904052453, ρ(Λ2)=0.904052372. ✏=[0, 10−6, 10−4]

slide-80
SLIDE 80

Extensions of the MJLS mathematical framework

Extension 1: In general wireless channels are time-varying and packet loss probability not easy to compute/measure: consider a MJLS where 𝑄(𝑙) time-varying in a polytope Extension 2: We want to control the routing policy 𝑏(𝑙): consider a MJLS where 𝜄(𝑙) is a Markov-decision process

slide-81
SLIDE 81

Controlled noiseless Markov jump SWITCHED linear system:

Robust LQR on MJLS with continous and discrete inputs

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. Robust LQR of time-inhomogeneous Markov jump

switched linear systems. Submitted.

(S) :      x(k + 1) = Aθ(k)x(k) + Bθ(k)u(k), y(k) = Cθ(k)x(k) + Dθ(k)u(k), x(0) = x0, θ(0) = θ0, x(k)∈Cn is a state vector, u(k)∈Cm is a control vector, θ(k) is an operational mode y(k)∈Cs is a (measured) output of the system σ(k) is a Markov-decision process,

  • perational mode θ(k) depends on discrete input a(k)

pa

ij(k) , Pr{θ(k+1) = j | a, θ(k) = i} ≥ 0, N

X

j=1

pa

ij(k) = 1 if a 2 A(i),

pa

ij(k) = 0 if a 62 A(i)

slide-82
SLIDE 82

Robust LQR on MJLS with continous and discrete inputs

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. Robust LQR of time-inhomogeneous Markov jump

switched linear systems. Submitted.

  • Problem. Design an optimal (robust in TPM) mode-dependent state-feedback

control policy minimizing the quadratic cost where π?

T ={u? k, a? k}T −1 k=0

J (θ0, x0, πT ) , max

p

J (θ0, x0, πT , p), p,{pa(k)

θ(k)(k)}T −1 k=0

X(T)=[X1(T), . . . , XN(T)]∈Hn+ vector of terminal continuous state weighs g=(g1, . . . , gN)∈RN

+ vector of terminal discrete state weights

J(θ0, x0, πT , p),E[x∗(T)Xθ(T )(T)x(T)+g(θ(T))]+

T− 1

X

k=0

(E[ky(k)k2+g(θ(k), a(θ(k), k))],

slide-83
SLIDE 83

References

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. Robust LQR of time-inhomogeneous Markov jump

switched linear systems. Submitted.

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. Robust LQR of time-inhomogeneous Markov jump linear
  • systems. Submitted.
  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. On robust stability of time-inhomogeneous Markov jump

linear systems. Submitted

  • Y. Zacchia Lun, A. D'Innocenzo, M.D. Di Benedetto. On stability of time-inhomogeneous Markov jump linear
  • systems. 55th IEEE Conference on Decision and Control, Las Vegas, US, December 12-14, 2016.
  • G. D. Di Girolamo, A. D'Innocenzo, M. D. Di Benedetto. Co-design of controller and routing redundancy over

a wireless network. 5th IFAC Workshop on Estimation and Control of Networked Systems (NecSys 2015), Philadelphia PA, September 10-11, 2015

  • F. Smarra, A. D'Innocenzo, M. D. Di Benedetto. Approximation methods for optimal network coding in a

multi-hop control network with packet losses. 14th European Control Conference (ECC 2015), Linz, Austria, July 15-17, 2015

slide-84
SLIDE 84

Experimental setup: TSCH over TelosB HW

slide-85
SLIDE 85

Future work

  • Take into account infinite horizon LQR & Stability
  • Co-design control and transmission power
  • Co-design control and network coding
  • Co-design control and tx time (static PETC)
  • Develop FDI techniques on our model (CPS security)