Independent vs. Joint Estimation in Multi Agent Iterative Learning - PowerPoint PPT Presentation

Independent vs. Joint Estimation in Multi ‐ Agent Iterative Learning Control Angela Schoellig, Javier Alonso ‐ Mora and Raffaello D‘Andrea Institute for Dynamic Systems and Control ETH Zürich, Switzerland 1 Control and Decision Conference 2010, Atlanta – Dec 17, 2010

SYSTEMS ARE ABLE TO LEARN Open ‐ loop swing ‐ up of a cart ‐ pendulum system. [Schöllig and D'Andrea, ECC 2009] https://youtu.be/W2gCn6aAwz4?list=PLC12E387419CEAFF2 Angela Schoellig ‐ ETH Zürich 2

CAN SIMILAR SYSTEMS BENEFIT FROM EACH OTHER... …when learning the same task? Blind Juggler Array Flying Machine Arena KIVA Systems Distributed Flight Array Balancing Cube Angela Schoellig ‐ ETH Zürich 3

PROBLEM STATEMENT We consider • A group of similar agents • Performing the same task • Repeatedly • Simultaneous operation Is an individual agent able to learn faster when performing a task simultaneously with a group of similar agents? Angela Schoellig ‐ ETH Zürich 4

SIMILAR AGENTS (1) Same nominal dynamics. Physical model of real ‐ world system Same task. GOAL OF LEARNING: Follow the desired trajectory. Angela Schoellig ‐ ETH Zürich 5

SIMILAR AGENTS (2) Linearize. Small deviations from nominal trajectory. Discretize. Linear, time ‐ varying difference equations. Lifted ‐ system representation. Static mapping representing one execution. With and Angela Schoellig ‐ ETH Zürich 6

SIMILAR BUT NOT IDENTICAL... In the iteration domain. For trial : Agent index Measurement noise Process noise Iteration index For each agent : REPETITIVE DISTURBANCE Same nominal dynamics. Same task. Different repetitive disturbance. Angela Schoellig ‐ ETH Zürich 7

HOW DOES A SINGLE AGENT LEARN? EXECUTE NEW ITERATION (1) Estimate the repetitive disturbance by taking into ESTIMATE account all past measurements. Obtain . CORRECT (2) Correct for by updating the input. “Minimize” . For example, Can the disturbance estimate be improved by taking into account the measurements of the other agents? Angela Schoellig ‐ ETH Zürich 8

FOCUS: ESTIMATION PROBLEM INDEPENDENT ESTIMATION vs. JOINT ESTIMATION Angela Schoellig ‐ ETH Zürich 9

REDUCE MODEL DYNAMICS with  neglect deterministic part  assume state is measured directly  assume independence and same noise characteristics for vector entries MEASUREMENT AND PROCESS NOISE LEARNING PERFORMANCE is measured by the variance of the state estimate. Angela Schoellig ‐ ETH Zürich 10

JOINT ESTIMATION Estimation objective. Kalman equations. Variance of disturbance estimate. PROPOSITION: Covariance of an individual’s disturbnance estimate INDEPENDENT CASE: Angela Schoellig ‐ ETH Zürich 11

COMPARISON COVARIANCE OF STATE ESTIMATE: with RATIO OF COVARIANCE: independent vs. joint estimation (I) PURE PROCESS NOISE (II) PURE MEASUREMENT NOISE Angela Schoellig ‐ ETH Zürich 12

RESULT Performance increase due to joint estimation: THEOREM 1: Pure Process Noise limit case for THEOREM 2: Pure Measurement Noise limit case for Angela Schoellig ‐ ETH Zürich 13

EXAMPLE For 10 agents: 14

JOINT ESTIMATION IS ONLY BENEFICIAL IF... (1) High similarity between agents (2) Process noise negligible (3) Common model error large compared to the noise Angela Schoellig ‐ ETH Zürich 15

Independent vs. Joint Estimation in Multi ‐ Agent Iterative Learning Control Angela Schoellig, Javier Alonso ‐ Mora and Raffaello D‘Andrea Institute for Dynamic Systems and Control ETH Zürich, Switzerland 16 Control and Decision Conference 2010, Atlanta – Dec 17, 2010

Independent vs. Joint Estimation in Multi Agent Iterative Learning - PowerPoint PPT Presentation

Independent vs. Joint Estimation in Multi Agent Iterative Learning Control Angela Schoellig, Javier Alonso Mora and Raffaello DAndrea Institute for Dynamic Systems and Control ETH Zrich, Switzerland 1 Control and Decision Conference

Overview Multi-Agent Systems Introduction to multi-agent systems and agent societies Agent

Multi-agent learning Multi-agent reinforcement learning Gerard Vreeswijk , Intelligent Systems

Sensitivity of Joint Estimation in Multi Agent Iterative Learning Control Angela Schoellig and

Multi-agent learning Gerard Vreeswijk , Intelligent Systems Group, Computer Science Department,

An Agent Architecture An Agent Architecture An Agent Architecture An Agent Architecture for

S S S S calable calable Agent calable calable Agent Agent Plat forms Agent Plat forms

Agent-Based Systems Agent communication Speech act theory Michael Rovatsos Agent

Basic Techniques II: Iterative Compression Marek Cygan Institute of Informatics University of

Chapter 12: Iterative Methods ES 240: Scientific and Engineering Computation. Iterative Methods

Development Figures are from : Agile and Iterative Development: A Manager's Guide, Craig

The Player Agent The Player Agent Are they the most important league official right now? right

Rational Agents (Ch. 2) Rational agent An agent/robot must be able to perceive and interact with

Agent-Based Systems Michael Rovatsos mrovatso@inf.ed.ac.uk Lecture 6 Agent Communication 1

W HAT S AN A GENT ? Weiss, p. 29 [after Wooldridge and Jennings]: An agent is a

M ULTI -A GENT S YSTEMS Overview and Research Directions Whats an agent? AI Class 12 (C H .

MULTI-AGENT SYSTEM AND DATA ANALYTICS MULTI-AGENT SYSTEM AND DATA ANALYTICS Monty Abbas, Virginia

LTG Dennis L. Via Director for C4 Systems, Joint Staff (J6) UNCLASSIFIED The Joint Staff Today

Texture Characterization via Joint Statistics of Wavelet Coef cient Magnitudes Eero Simoncelli

Condition Number for Joint Optimization of Cycle-Consistent Networks Leonidas Guibas 1 , Qixing

Leveraging Joint Interactions for Credibility Analysis in News Communities Subhabrata Mukherjee

EECS 70: Lecture 27. Recap Joint distribution. Joint and Conditional Distributions. Variance

Discrete Translates in Function Spaces Alexander Olevskii The talk is based on joint work with

Formal Modeling in Cognitive Science Lecture 20: Joint, Marginal, and Conditional Distributions

Joint Source-Channel LZ'77 Coding Stefano Lonardi University of California, Riverside Wojciech

Sambuz

Useful Links

Newsletter

Mail Us