Getting multi-agent systems to cooperate Luca Schenato University - - PowerPoint PPT Presentation

getting multi agent systems to cooperate
SMART_READER_LITE
LIVE PREVIEW

Getting multi-agent systems to cooperate Luca Schenato University - - PowerPoint PPT Presentation

Getting multi-agent systems to cooperate Luca Schenato University of Padova OptHySys 2017 Outline Motivations and target applications Challenges The consensus algorithm Application of consensus Conclusions and open vistas


slide-1
SLIDE 1

Getting multi-agent systems to cooperate

Luca Schenato University of Padova OptHySys 2017

slide-2
SLIDE 2

Outline

Motivations and target applications Challenges The consensus algorithm Application of consensus Conclusions and open vistas

slide-3
SLIDE 3

Outline

Motivations and target applications Challenges The consensus algorithm Application of consensus Conclusions and open vistas

slide-4
SLIDE 4

Networked Control Systems

Physically distributed dynamical systems interconnected by a communication network

INTELLIGENT TRAFFIC SYSTEMS SWARM ROBOTICS WIRELESS SENSOR NETWORKS SMART CITIES SMART BUILDINGS SMART GRIDS

slide-5
SLIDE 5

Smart Camera Networks

Target applications: MAgIC Lab. at University of Padova

Wireless Sensor Actuator Networks Smart Energy Grids Robotic Networks

slide-6
SLIDE 6

Joint work with

Colleagues at Univ. of Padova Former/current students: International collaborators:

Ruggero Carli Angelo Cenedese Alessandro Chiuso Gianluigi Pillonetto Sandro Zampieri Andrea Carron Marco Todescato Simone Del Favero Damiano Varagnolo

  • Univ. of Lulea, Swedenn

Filippo Zanella Sellf Inc. Saverio Bolognani MIT, USA Sinan Yildirim Ege Univ., Turkey Fabio Fagnani Turin Politech, Italy Lara Brinon-Arranz IST, Portugal Alexandra von Meier

  • UC. Berkeley, USA

Kameshwar Poolla

  • UC. Berkeley, USA

Reza Argandeh CIEE, Berkeley, USA

slide-7
SLIDE 7

Outline

Motivations and target applications Challenges The consensus algorithm Application of consensus Conclusions and open vistas

slide-8
SLIDE 8

Challenges

Unreliable (wireless) communication:

Random delay, packet loss, limited communication range

Scalability:

Complexity (CPU, memory, communication) per agent

must be constant

Robustness:

Mild performance degradation when local failures

Architecture:

Centralized vs hierarchical vs distributed vs decentralized Cooperative vs competitive

slide-9
SLIDE 9

Challenges: a personal experience

Prototyping time

Leader-based/hierarchical

algorithms too complex to code

Debugging time

Few LEDs for visual

inspection

Ex-post analysis of dozens of

agent data logs after a failure

Rapid peer-to-peer

communication

Wi-Fi, bluetooth, zigbee not

suitable for peer-to-peer

Need for simple asynchronous peer-to-peer algorithms

Courtesy of Antonio Franchi, CNRS, Toulouse

slide-10
SLIDE 10

Some working complex systems

INTERNET Cell phones networks

slide-11
SLIDE 11

A leading paradigm: ISO layers with few primitives

Application layer Communication layers

slide-12
SLIDE 12

Multi-agent systems: an ISO-like paradigm ?

What should be the right ISO-model ? Need to

seamlessly integrate:

Communication network(s) Sensing and control Physical constraints (conservation mass/energy) Markets

Smart Power Grids Intelligent transportation

slide-13
SLIDE 13

ISO for multi-agent systems

Point-to-point Broadcast Multi-cast Time-synch Sensor calibration ??? Map building Application layer ??? Communication layer

slide-14
SLIDE 14

Consensus algorithm: a primitive for cooperation

Average consensus Consensus ??? Point-to-point Broadcast Multi-cast Time-synch Sensor calibration ??? Map building Application layer Cooperation layer ??? Communication layer

slide-15
SLIDE 15

Outline

Motivations and target applications Challenges The consensus algorithm Application of consensus Conclusions and open vistas

slide-16
SLIDE 16

The consensus problem

Main idea

Having a set of agents to agree upon a certain value (usually global function) using only local information exchange (local interaction)

Also known as:

Agreement problem (economics, social networks) Load balancing (Computer Science & communications) Synchronization (statistical mechanics) Rendezvous and flocking (robotics)

Old problem: Markov Chains (60’s), Load balancing (70’s), Distributed decision making (80’s), flocking(00’s)

slide-17
SLIDE 17

Multi-agent modeling

Network of

N agents Communication graph: i-th node neighbors: Local variable: node i store

slide-18
SLIDE 18

Recursive Distributed Algorithms

DEFINITION: Recursive Distributed Algorithm consistent with the graph G:

Any recursive algorithm where the i-th node’s update law depends only on the local variables of i and its neighbors

slide-19
SLIDE 19

Consensus definitions

DEFINITION:

A Recursive Distributed Algorithm consistent with the graph G is said to asymptotically achieve consensus if

DEFINITION:

A Recursive Distributed Algorithm consistent with the graph G is said to asymptotically achieve average consensus if

10 20 30 40 2 4 6 8 10 Consensus Iteration xi 10 20 30 40 2 4 6 8 10 Consensus Iteration xi

slide-20
SLIDE 20

A robotics example: the rendezvous problem

1 3 2 4 Receiving node: Other nodes:

slide-21
SLIDE 21

A robotics example: the rendezvous problem

1 3 2 4 Receiving node: Other nodes:

slide-22
SLIDE 22

A robotics example: the rendezvous problem

1 3 2 4 Receiving node:

slide-23
SLIDE 23

The linear consensus algorithm

PROPERTIES OF P(t) (Stochastic Matrix)

Consistent with the graph: Component-wise non-negative: Row-sum unitary: P(t) doubly stochastic if also column-sum unitary:

slide-24
SLIDE 24

Constant matrix P

Synchronous communication:

At each time all nodes communicate according to the communication graph and update their local variables (Laplacian weigths)

slide-25
SLIDE 25

Time varying P(t): broadcast

Broadcast communication:

At each time one node wakes up and broadcasts its information to all its neighbors

slide-26
SLIDE 26

Time varying P(t): symmetric gossip

Symmetric gossip communication:

At each time one node wakes up and choses one of its

  • neighbors. These two nodes exchange their local variables
slide-27
SLIDE 27

Standard Consensus (Broadcast)

Graph rooted on average Self-loops, i.e. P(t) with positive diagonal P(t) row-stochastic

Average Consensus (Gossip)

Graph connected on average Self-loops, i.e. P(t) with positive diagonal P(t) doubly stochastic

Asynchronous consensus: convergence

slide-28
SLIDE 28

Convergence for time-varying communication

union

slide-29
SLIDE 29

Broadcast-based Consensus

Achieves consensus updates per 1 sent message No ACK message required

Gossip-based Consensus

Achieves average consensus 2 updates per (at least) 3 sent messages Non-trivial communication protocol

Asynchronous consensus: communication burden

slide-30
SLIDE 30

Average consensus: the (broadcast) ratio consensus

Standard

Transmitter node Receiver nodes Other nodes:

Ratio

Transmitter node Receiver nodes Other nodes: Row stochastic Column stochastic

slide-31
SLIDE 31

Average consensus: the ratio consensus

Ratio Standard

  • D. Kempe, A. Dobra, and J. Gehrke, 2003
  • M. Alighanbari and J. How, 2008
  • F. Benezit, V. Blondel, P. Thiran, J. Tsitsiklis, M.

Vetterli, 2010

slide-32
SLIDE 32

Realistic scenarios

Ideal scenario Collisions Packet losses

slide-33
SLIDE 33

Packet loss: Broadcast consensus

Standard Ratio

Transmitter node Receiver nodes Other nodes: Transmitter node Receiver nodes Other nodes: Row stochastic Column sub-stochastic

slide-34
SLIDE 34

Packet losses: symmetric gossip consensus

Gossip nodes Other nodes Row stochastic

slide-35
SLIDE 35

Standard Consensus (broadcast)

Guaranteed (slower) convergence

Average Consensus (gossip)

Guaranteed (slower) convergence, but loss of average Under randomized communication:

Ratio Consensus (broadcast)

No convergence

Robust Ratio Consensus (broadcast)

Guaranteed average consensus Additional local variables required

Asynchronous consensus: packet loss and random delay

(Dominguez Garcia-Hadjicostis-Vaidya, 2014) (Fagnani-Zampieri 2009, Frasca-Hendrickx 2013)

slide-36
SLIDE 36

Consensus algorithm: a primitive for multi-agent systems

Average consensus Consensus ??? Point-to-point Broadcast Multi-cast Time-synch Sensor calibration ??? Distributed

  • ptimization

Application layer Cooperation layer ??? Communication layer

Robust asynchronous broadcast-based and relatively simple implementations available

slide-37
SLIDE 37

Outline

Motivations and target applications Challenges The consensus algorithm Application of consensus Conclusions and open vistas

slide-38
SLIDE 38

Smart Camera Networks

Consensus-based applications

Wireless Sensor Actuator Networks Smart Energy Grids Robotic Networks

  • Sensor Calibration
  • RF indoor tracking
  • Clock Synchronization
  • Cardinality estimation
  • Perimeter patrolling
  • Rendez-vous
  • Map building
  • Localization
  • Source-seeking
  • Multi-area state

estimation

slide-39
SLIDE 39

Sensor calibration issues in RF-based localization

Systematic calibration errors

i j k di dj dk

slide-40
SLIDE 40

WSN sensor calibration

Ideally:

Estimate : Use to compensate the

  • ffset:

What we propose is:

All nodes overestimate or underestimate the distance similarly. The errors, in the triangulation process, cancel out partially. Calibrated measurement

slide-41
SLIDE 41

Calibration as consensus problem

update equation Steady state

Define we want Recalling:

slide-42
SLIDE 42

Experimental Testbed

25 Tmote-Sky nodes with Chipcon CC2420 RF transceiver randomly placed inside a single conference room Network topology and nodes displacement: Edge if packet loss probability <25%

2 4 6 8 10 12 1 2 3 4 5 6 7 8 y [m] x [m]

slide-43
SLIDE 43

500 1000 1500 −2 −1 1 2 3 Number of consensus iterations ˆ

  • i [dBm]

Experimental results: Broadcast consensus

Links divided into 2 categories:

Training links (black) Validation links (gray)

2 4 6 8 10 12 1 2 3 4 5 6 7 8 y [m] x [m] −2 −1.5 −1 −0.5 0.5 1 1.5 2

1 2 3 4 5 6 10 20 30 40 50 60 70 80 90 100 |P ij

rx − P ji rx| [dBm]

number of edges

Error distribution

before after

  • scillations

500/25=20

slide-44
SLIDE 44

Estimation from noisy relative measurements

Synchronous implementations:

Barooah 2007

Asynchronous implementations:

  • P. Barooah and J. P. Hespanha, 2005
  • A. Giridhar and P. R. Kumar, 2006
  • N. M. Freris and A. Zouzias, 2012
  • C. Ravazzi, P. Frasca, H. Ishii, and R. Tempo, 2013

Asynchronous implementation robust to packet

losses and random delays

  • M. Todescato, A. Carron, R. Carli, L. Schenato, 2014
slide-45
SLIDE 45

Clock Synchronization in WSN

Node i Node j OFF ON transmission

BASE STATION sensor node

Low Power TDMA communication for battery powered nodes

slide-46
SLIDE 46

Clock Synchronization: cascade consensus

Hardware clocks Virtual reference clock Software clock

Offset Skew/Drift

Goal:

slide-47
SLIDE 47

Clock Synchronization: cascade consensus

i j Drift compensation Offset compensation

  • Solis, Borkar, Kumar, 2006
  • Sommer, Wattenhofer, 2009
  • Fiorentin, Schenato 2011
  • Liao, Barooha 2013
slide-48
SLIDE 48

Clock Synchronization: PI consensus

PI consensus: Cascade consensus:

  • Carli, Chiuso, Schenato, Zampieri 2006
  • Yildirim, Carli, Schenato, 2014

steady state error

Hardware clocks Virtual reference clock Software clock

slide-49
SLIDE 49

Clock Synch in WSN: experiments

FTSP PulseSync GTSP PISync CPU Overhead (ticks)

' 5440 ' 5440 ' 5610 ' 145

Message Length (bytes)

9 9 14 4-9

Main Memory Overhead (bytes)

52 52 64*|N | + 12 16

Flash Memory Requirements (bytes)

18000 17856 22092 15432

PI synch Sommer 2009 Complexity Actual code 20 lines x50 faster x4-20 less RAM memory

slide-50
SLIDE 50

Clock Synch in WSN: video

Courtesy of Sinan Yildirim, Ege University, Turkey

slide-51
SLIDE 51

Map-building in robotic networks

0.2 0.4 0.6 0.8 1 0 0.5 1 −2 2 χ x 0.2 0.4 0.6 0.8 1 0 0.5 1 −2 2 χ

Scenarios

Each robot collects local data Local communication with robot Patrolled area dynamically change

i j

slide-52
SLIDE 52

Map-building as least-squares regression

Model class: Noisy measurements: Goal: minimize sum of

squares of residues

  • Xiao-Boyd-Lall, 2005
  • Bolognani-Del Favero-Schenato-Varagnolo, 2010
slide-53
SLIDE 53

Consensus-based Map-building: gossip communication

i j

x 0.2 0.4 0.6 0.8 1 0 0.5 1 −2 2 χ
slide-54
SLIDE 54

Courtesy of Damiano Varagnolo, University of Lulea, Sweden

Consensus-based map-building: robust broadcast ratio consensus

slide-55
SLIDE 55

Cooperative distributed optimization

Global estimation:

Each node wants a copy of the global minimizer Machine learning, map building, ….

Local estimation:

each node just wants Calibration, localization, ….

: number of agents : state dimension ( ) Agents cooperate to find the minimizer

  • f the network cost:
slide-56
SLIDE 56

On going work: Newton-Raphson Consensus

Distributed optimization very popular research area:

Augmented Lagrangians (ADMM) Sub-gradient methods …

Asynchronous and robust distributed optimization

still very open and practically relevant

Our recent effort in merging Newton-Raphson and

consensus ideas together

slide-57
SLIDE 57

Outline

Motivations and target applications Challenges The consensus algorithm Application of consensus Conclusions and open vistas

slide-58
SLIDE 58

Conclusions

Consensus as a building block for cooperative multi-

agent applications

Effort is in casting general problems as consensus Time-varying higher order consensus is still an open

problem

PI consensus (clock synch) PD consensus (fast consensus, diffusive algorithms) PID (?)

Self-tuning: adaptive tuning of parameters/gains in

distributed algorithms

slide-59
SLIDE 59

Open vistas (1)

Architecture: Multi-agent/complex systems still

an open challenge

Smart Power Grids

slide-60
SLIDE 60

Open vistas (2)

Computation: Asynchronous distributed algorithms

robust to unreliable communication

Cloud computing (new paradigm) Parallel computing (old paradigm)

slide-61
SLIDE 61

Open vistas (3)

Data Tsunami (≠Big data): most data is time-

series.

Time and causality must be treated differently than

usually done in machine learning

Cooperative multi-agent algorithms will be a necessity

slide-62
SLIDE 62

Q&A

URL: http://automatica.dei.unipd.it/people/schenato.html

slide-63
SLIDE 63

References (1)

Consensus:

  • F. Garin, L. Schenato. A survey on distributed estimation and control

applications using linear consensus algorithms. Networked Control

  • Systems. vol. 406,pp. 75-107, 2011

Sensor calibration:

  • S. Bolognani, S. Del Favero, L. Schenato, D. Varagnolo. Consensus-based

distributed sensor calibration and least-square parameter identification in WSNs. International Journal of Robust and Nonlinear Control, vol. 20(2), 2010

  • M. Todescato, A. Carron, R. Carli, L. Schenato. Distributed Localization

from Relative Noisy Measurements: a Robust Gradient Based

  • Approach. IEEE Conference on Decision and Control (CDC14), submitted

Clock synchronization:

  • L. Schenato, F. Fiorentin. Average TimeSynch: a consensus-based

protocol for time synchronization in wireless sensor networks. Automatica, vol. 47(9), pp. 1878-1886, 2011

  • K. Yildirim, R. Carli, L. Schenato. Proportional-Integral Synchronization

In Wireless Sensor Networks. ACM Transactions on Sensor Networks (submitted)

slide-64
SLIDE 64

References (2)

Map Building:

  • A. Carron, M. Todescato, R. Carli, L. Schenato, G. Pillonetto. Multi-agents

adaptive estimation and coverage control using Gaussian

  • regression. IEEE Conference on Decision and Control (CDC14), submitted

Distributed optimization:

  • D. Varagnolo, F. Zanella, A. Cenedese, G. Pillonetto, L. Schenato. Newton-

Raphson Consensus for Distributed Convex Optimization. IEEE Transactions on Automatic Control (submitted)

  • R. Carli, G. Notarstefano, L. Schenato, D. Varagnolo. Asynchronous

Newton-Raphson Consensus for Robust Distributed Convex

  • Optimization. IEEE Conference on Decision and Control (CDC14), submitted