Independent vs. Joint Estimation in Multi Agent Iterative Learning - - PowerPoint PPT Presentation

independent vs joint estimation in multi agent iterative
SMART_READER_LITE
LIVE PREVIEW

Independent vs. Joint Estimation in Multi Agent Iterative Learning - - PowerPoint PPT Presentation

Independent vs. Joint Estimation in Multi Agent Iterative Learning Control Angela Schoellig, Javier Alonso Mora and Raffaello DAndrea Institute for Dynamic Systems and Control ETH Zrich, Switzerland 1 Control and Decision Conference


slide-1
SLIDE 1

1

Independent vs. Joint Estimation in Multi‐Agent Iterative Learning Control

Angela Schoellig, Javier Alonso‐Mora and Raffaello D‘Andrea

Institute for Dynamic Systems and Control ETH Zürich, Switzerland Control and Decision Conference 2010, Atlanta – Dec 17, 2010

slide-2
SLIDE 2

2

SYSTEMS ARE ABLE TO LEARN

Open‐loop swing‐up of a cart‐pendulum system.

[Schöllig and D'Andrea, ECC 2009]

https://youtu.be/W2gCn6aAwz4?list=PLC12E387419CEAFF2

Angela Schoellig ‐ ETH Zürich

slide-3
SLIDE 3

3

…when learning the same task?

CAN SIMILAR SYSTEMS BENEFIT FROM EACH OTHER...

Angela Schoellig ‐ ETH Zürich

Distributed Flight Array KIVA Systems Blind Juggler Array Flying Machine Arena Balancing Cube

slide-4
SLIDE 4

4

PROBLEM STATEMENT

Angela Schoellig ‐ ETH Zürich

We consider

  • A group of similar agents
  • Performing the same task
  • Repeatedly
  • Simultaneous operation

Is an individual agent able to learn faster when performing a task simultaneously with a group of similar agents?

slide-5
SLIDE 5

5

SIMILAR AGENTS (1)

Angela Schoellig ‐ ETH Zürich

Same nominal dynamics. Same task.

Physical model of real‐world system GOAL OF LEARNING: Follow the desired trajectory.

slide-6
SLIDE 6

6

  • Linearize. Small deviations from nominal trajectory.
  • Discretize. Linear, time‐varying difference equations.

Lifted‐system representation. Static mapping representing one execution. With and

SIMILAR AGENTS (2)

Angela Schoellig ‐ ETH Zürich

slide-7
SLIDE 7

7

In the iteration domain. For trial : For each agent :

Same nominal dynamics. Same task. Different repetitive disturbance.

SIMILAR BUT NOT IDENTICAL...

Angela Schoellig ‐ ETH Zürich

Iteration index Agent index Measurement noise Process noise

REPETITIVE DISTURBANCE

slide-8
SLIDE 8

8 (1) Estimate the repetitive disturbance by taking into account all past measurements. Obtain . (2) Correct for by updating the input. “Minimize” . For example,

HOW DOES A SINGLE AGENT LEARN?

NEW ITERATION

EXECUTE ESTIMATE CORRECT

Can the disturbance estimate be improved by taking into account the measurements of the other agents?

Angela Schoellig ‐ ETH Zürich

slide-9
SLIDE 9

9

FOCUS: ESTIMATION PROBLEM

Angela Schoellig ‐ ETH Zürich

INDEPENDENT ESTIMATION vs. JOINT ESTIMATION

slide-10
SLIDE 10

10

DYNAMICS

 neglect deterministic part  assume state is measured directly  assume independence and same noise characteristics for vector entries

REDUCE MODEL

Angela Schoellig ‐ ETH Zürich with MEASUREMENT AND PROCESS NOISE

LEARNING PERFORMANCE is measured by the variance of the state estimate.

slide-11
SLIDE 11

11

INDEPENDENT CASE:

JOINT ESTIMATION

Angela Schoellig ‐ ETH Zürich

Estimation objective. Kalman equations. Variance of disturbance estimate.

PROPOSITION: Covariance of an individual’s disturbnance estimate

slide-12
SLIDE 12

12

COVARIANCE OF STATE ESTIMATE: with RATIO OF COVARIANCE: independent vs. joint estimation

(I) PURE PROCESS NOISE (II) PURE MEASUREMENT NOISE

COMPARISON

Angela Schoellig ‐ ETH Zürich

slide-13
SLIDE 13

13

Performance increase due to joint estimation:

THEOREM 1: Pure Process Noise THEOREM 2: Pure Measurement Noise

RESULT

Angela Schoellig ‐ ETH Zürich

limit case for limit case for

slide-14
SLIDE 14

14

For 10 agents:

EXAMPLE

slide-15
SLIDE 15

15

JOINT ESTIMATION IS ONLY BENEFICIAL IF...

(1) High similarity between agents (2) Process noise negligible

Angela Schoellig ‐ ETH Zürich

(3) Common model error large compared to the noise

slide-16
SLIDE 16

16

Independent vs. Joint Estimation in Multi‐Agent Iterative Learning Control

Angela Schoellig, Javier Alonso‐Mora and Raffaello D‘Andrea

Institute for Dynamic Systems and Control ETH Zürich, Switzerland Control and Decision Conference 2010, Atlanta – Dec 17, 2010