Distributed intelligence in multi agent systems Usman Khan - PowerPoint PPT Presentation

Department of Electrical and Computer Engineering Distributed intelligence in multi ‐ agent systems Usman Khan Department of Electrical and Computer Engineering Tufts University Workshop on Distributed Optimization, Information Processing, and Learning Rutgers University August 21, 2017

Department of Electrical and Computer Engineering Who am I � Usman A. Khan � Associate Professor, Tufts Tufts � Postdoc � U ‐ Penn � Education � PhD, Carnegie Mellon Harvard � MS, UW ‐ Madison � BS, Pakistan MIT 2

Department of Electrical and Computer Engineering My Research Lab: Projects and demos Inspecting leaks in NASA’s lunar habitat Aerial Formation Flying Inference in Social Networks SHM over a campus footbridge Dowling North Hall 8 7 6 4 5 3 2 Upper 1 Campus 3

Department of Electrical and Computer Engineering Trailer SPARTN—Signal Processing and RoboTic Networks Lab at Tufts 4

Department of Electrical and Computer Engineering My Research Lab: Theory Xin Reza (2011 ‐ Xi Sam Fakhteh (2016 ‐ ): 15): (2012 ‐ 16): (2013 ‐ ): (2014 ‐ ): Optimization, Graph ‐ Optimization Fusion in non ‐ Distributed Graph theory theoretic over directed deterministic estimation estimation graphs graphs cont…d Best paper 4 TAC papers 2 Best papers Journal cover 6 IEEE journal papers 5

Department of Electrical and Computer Engineering My Research: In depth � Distributed Intelligence in multi ‐ agent systems � Estimation, optimization, and control over graphs (networks) � Mobile � Dynamic � Heterogeneous � Directed � Autonomous � Non ‐ deterministic � Applications: � Cyber ‐ physical systems, IoTs, Big Data � Aerial SHM, Power grid, Personal exposome � Distributed Optimization: Path planning and Formation control 6

Department of Electrical and Computer Engineering Optimization over directed graphs 7

Department of Electrical and Computer Engineering Problem � Agents interact over a graph � Directional informational flow � No center with all information 8

Department of Electrical and Computer Engineering A nice solution � Gradient Descent � No one knows the function f � Local Gradient Descent � Converges to only to a local optimal � Distributed Gradient Descent [Nedich et al., 2009]: Fuse Information 9

Department of Electrical and Computer Engineering Distributed Gradient Descent � Distributed Gradient Descent � W ={ w ij } is a doubly ‐ stochastic matrix (underlying graph is balanced) � Step ‐ size goes to zero (but not too fast) � Agreement: � Optimality: � Lets do a simple analysis… 10

Department of Electrical and Computer Engineering Distributed Gradient Descent � Distributed Gradient Descent � Assume the corresponding sequences converge to their limits � Let W be CS but not RS � Let W be RS but not CS � � Then , no agreement! Then , i.e., agreement � But suboptimal! 11

Department of Electrical and Computer Engineering Distributed Gradient Descent � Distributed Gradient Descent � If W is RS but not CS (unbalanced directed graphs), agents agree on a suboptimal solution � Consider a modification (Nedich 2013 , similar in spirit but with different execution ): � Row ‐ stochasticity guarantees agreement, scaling ensures optimality � Estimate the left eigenvector? 12

Department of Electrical and Computer Engineering Estimating the left eigenvector � A = { a ij } is row ‐ stochastic with � Consider the following iteration: � Every agent learns the entire left eigenvector asymptotically � Similar method learns the right eigenvector for CS matrices 13

Department of Electrical and Computer Engineering Optimization over directed graphs: Recipe � 1. Design row ‐ or column ‐ stochastic weights � 2. Estimate the non ‐ 1 eigenvector for the eval of 1 � 3. Scale to remove the imbalance � Side note: Push ‐ sum algorithm (Gehrke et al., 2003; Vetterli et al., 2010) 14

Department of Electrical and Computer Engineering Related work (a very small sample) � Algorithms over undirected graphs: � Distributed Gradient Descent (Nedich et al., 2009) � Non ‐ smooth � EXTRA (Yin et al., Apr. 2014) � Fuses information over past two iterates � Use gradient information over past two iterates � Smooth, Strong ‐ convexity, Linear convergence � NEXT (Scutari et al., Dec. 2015) � Functions are smooth non ‐ convex + non ‐ smooth convex � Harnessing smoothness … (Li et al., May 2016) � Some similarities to EXTRA 15

Department of Electrical and Computer Engineering Related work (a small sample) � Add push ‐ sum to the previous obtain algorithms for directed graphs: � Gradient Push (Nedich et al., 2013) � Sub ‐ linear convergence � DEXTRA (Khan et al., Oct. 2015) � Strong ‐ convexity, Linear convergence � Difficult to compute step ‐ size interval � SONATA (Scutari et al., Jul. 2016) � Functions are (smooth non ‐ convex + non ‐ smooth convex) � Sub ‐ linear convergence � ADD ‐ OPT (Khan et al., Jun. 2016) and PUSH ‐ DIGing (Nedich et al., Jul. 2016) � Strong ‐ convexity, Linear convergence � Step ‐ size interval lower bound is 0 � All these algorithms employ column ‐ stochastic matrices 16

Department of Electrical and Computer Engineering Column ‐ vs. Row ‐ stochastic Weights i4 i3 i i1 i2 � Incoming weights are simpler to design � For column sum to be 1, agent i cannot design the incoming weights as it does not know the neighbors of i1 and i2 � Column ‐ stochastic weights thus are designed at outgoing edges � Requires the knowledge of out ‐ neighbors or out ‐ degree 17

Department of Electrical and Computer Engineering Optimization with Row ‐ stochastic weights � A = { a ij } is row ‐ stochastic � Row ‐ stochastic weight design is simple � However, in contrast to CS methods: � Agents run an n th order consensus for the left eigenvector � Agents need unique identifiers 18

Department of Electrical and Computer Engineering Optimization with Row ‐ stochastic weights � A = { a ij } is row ‐ stochastic � Vector form of the algorithm: � In contrast, with a column ‐ stochastic B , ADDOPT/PUSH ‐ DIGing is: � Iterate does not result in agreement � The function argument is scaled by the right eigenvector � Ensures optimality 19

Department of Electrical and Computer Engineering Optimization with Row ‐ stochastic weights � Algorithm: � A simple intuitive argument: � Assume each sequence converges to its limit, then � Every agent agrees on c 20

Department of Electrical and Computer Engineering Optimization with Row ‐ stochastic weights � Algorithm: � Show that c is the optimal solution � Sum the update over k : 21

Department of Electrical and Computer Engineering Optimization with Row ‐ stochastic weights � Algorithm: � Asymptotically 22

Department of Electrical and Computer Engineering Optimization with Row ‐ stochastic weights � Algorithm: � We assumed that the sequences reach their limit � However, under what conditions and at what rate? 23

Department of Electrical and Computer Engineering Convergence conditions � Assume strong ‐ connectivity, Lipschitz ‐ continuous gradients, strongly ‐ convex functions � Consider � If some norm of t k goes to 0 , then each element goes to 0 and the sequences converge to their limits 24

Department of Electrical and Computer Engineering Convergence conditions � Assume strong ‐ connectivity, Lipschitz ‐ continuous gradients, strongly ‐ convex functions � � Lemma: H k goes to 0 linearly � Lemma: Spectral radius of G is less than 1 25

Department of Electrical and Computer Engineering Convergence conditions � 26

Department of Electrical and Computer Engineering Convergence Rate � � The rate variable γ is the max of fusion rate and the rate at which G decays 27

Department of Electrical and Computer Engineering Some comparison 28

Department of Electrical and Computer Engineering Conclusions � Optimization with row ‐ stochastic matrices � Does not require the knowledge of out ‐ neighbors or out ‐ degree � Agents require unique identifiers � Strongly ‐ convex functions with Lipschitz ‐ continuous graidents � Strongly ‐ connected directed graphs � Linear convergence 29

Department of Electrical and Computer Engineering More Information � My webpage: http://www.eecs.tufts.edu/~khan/ � My email: khan@ece.tufts.edu � My Lab’s YouTube channel: https://www.youtube.com/user/SPARTNatTufts/videos/ 30

Distributed intelligence in multi agent systems Usman Khan - PowerPoint PPT Presentation

Department of Electrical and Computer Engineering Distributed intelligence in multi agent systems Usman Khan Department of Electrical and Computer Engineering Tufts University Workshop on Distributed Optimization, Information Processing, and

Overview Multi-Agent Systems Introduction to multi-agent systems and agent societies Agent

Multi-agent learning Multi-agent reinforcement learning Gerard Vreeswijk , Intelligent Systems

Agent-Based Systems Agent communication Speech act theory Michael Rovatsos Agent

Multi-agent learning Gerard Vreeswijk , Intelligent Systems Group, Computer Science Department,

Agent-Based Systems Michael Rovatsos mrovatso@inf.ed.ac.uk Lecture 6 Agent Communication 1

An Agent Architecture An Agent Architecture An Agent Architecture An Agent Architecture for

S S S S calable calable Agent calable calable Agent Agent Plat forms Agent Plat forms

W HAT S AN A GENT ? Weiss, p. 29 [after Wooldridge and Jennings]: An agent is a

M ULTI -A GENT S YSTEMS Overview and Research Directions Whats an agent? AI Class 12 (C H .

REINFORCEMENT LEARNING IN MULTI-AGENT SYSTEMS MACHINE LEARNING MEETUP DR. ANA PELETEIRO

Agent-Based Systems Agent: autonomous Learning for Agent-Based Systems Environment: fully,

Agent-Based Systems Michael Rovatsos mrovatso@inf.ed.ac.uk Lecture 2 Abstract Agent

A Distributed Multi-agent Control System for Power Consumption in Buildings Anna Magdalena Kosek

The Player Agent The Player Agent Are they the most important league official right now? right

Rational Agents (Ch. 2) Rational agent An agent/robot must be able to perceive and interact with

Agent-Based Systems Criticism of symbolic AI/deliberative architectures

Osama Khan and Randal Burns, Johns Hopkins University James Plank, University of Tennessee Cheng

Pattern-Matching Spi-Calculus A Type System for Cryptographic Protocols Christian Haack and Alan

PSEUDOSPECTRA o Application of eigenvalue o Pseudospectra definition D E F I N I T I O N S A N D

Star Formation across cosmic time Florent Renaud & Oscar Agertz Lund Observatory Polaris

Regret bounds for online variational inference Pierre Alquier ACML Nagoya, Nov. 18, 2019

Workflow in General Practices Dr. Urooj R. Khan Introduction 1. PCEHR/MyHR been live for more

Fine-Grained Similarity Measurement of Educational Videos and Exercises Xin Wang 1 , Wei Huang 1 ,

Module V: Vector Spaces Module V Math 237 Module V Section V.0 Section V.1 Section V.2

Distributed intelligence in multi agent systems Usman Khan - PowerPoint PPT Presentation

Department of Electrical and Computer Engineering Distributed intelligence in multi agent systems Usman Khan Department of Electrical and Computer Engineering Tufts University Workshop on Distributed Optimization, Information Processing, and

Overview Multi-Agent Systems Introduction to multi-agent systems and agent societies Agent

Multi-agent learning Multi-agent reinforcement learning Gerard Vreeswijk , Intelligent Systems

Agent-Based Systems Agent communication Speech act theory Michael Rovatsos Agent

Multi-agent learning Gerard Vreeswijk , Intelligent Systems Group, Computer Science Department,

Agent-Based Systems Michael Rovatsos mrovatso@inf.ed.ac.uk Lecture 6 Agent Communication 1

An Agent Architecture An Agent Architecture An Agent Architecture An Agent Architecture for

S S S S calable calable Agent calable calable Agent Agent Plat forms Agent Plat forms

W HAT S AN A GENT ? Weiss, p. 29 [after Wooldridge and Jennings]: An agent is a

M ULTI -A GENT S YSTEMS Overview and Research Directions Whats an agent? AI Class 12 (C H .

REINFORCEMENT LEARNING IN MULTI-AGENT SYSTEMS MACHINE LEARNING MEETUP DR. ANA PELETEIRO

Agent-Based Systems Agent: autonomous Learning for Agent-Based Systems Environment: fully,

Agent-Based Systems Michael Rovatsos mrovatso@inf.ed.ac.uk Lecture 2 Abstract Agent

A Distributed Multi-agent Control System for Power Consumption in Buildings Anna Magdalena Kosek

The Player Agent The Player Agent Are they the most important league official right now? right

Rational Agents (Ch. 2) Rational agent An agent/robot must be able to perceive and interact with

Agent-Based Systems Criticism of symbolic AI/deliberative architectures

Osama Khan and Randal Burns, Johns Hopkins University James Plank, University of Tennessee Cheng

Pattern-Matching Spi-Calculus A Type System for Cryptographic Protocols Christian Haack and Alan

PSEUDOSPECTRA o Application of eigenvalue o Pseudospectra definition D E F I N I T I O N S A N D

Star Formation across cosmic time Florent Renaud &amp; Oscar Agertz Lund Observatory Polaris

Regret bounds for online variational inference Pierre Alquier ACML Nagoya, Nov. 18, 2019

Workflow in General Practices Dr. Urooj R. Khan Introduction 1. PCEHR/MyHR been live for more

Fine-Grained Similarity Measurement of Educational Videos and Exercises Xin Wang 1 , Wei Huang 1 ,

Module V: Vector Spaces Module V Math 237 Module V Section V.0 Section V.1 Section V.2

Star Formation across cosmic time Florent Renaud & Oscar Agertz Lund Observatory Polaris