D ISTRIBUTED S YSTEMS - The Next Grand Challenge in Embedded System - - PowerPoint PPT Presentation

d istributed s ystems
SMART_READER_LITE
LIVE PREVIEW

D ISTRIBUTED S YSTEMS - The Next Grand Challenge in Embedded System - - PowerPoint PPT Presentation

D ISTRIBUTED S YSTEMS - The Next Grand Challenge in Embedded System Design Jan M. Rabaey Donald O. Pederson Distinguished Prof. Director FCRP MultiScale Systems Center (MuSyC) Scientific Co-Director Berkeley Wireless Research Center University


slide-1
SLIDE 1

DISTRIBUTED SYSTEMS -

Jan M. Rabaey

Donald O. Pederson Distinguished Prof. Director FCRP MultiScale Systems Center (MuSyC) Scientific Co-Director Berkeley Wireless Research Center University of California at Berkeley

INTEL, FEBRUARY 23 2011

The Next Grand Challenge in Embedded System Design

slide-2
SLIDE 2

Infrastructural core

The Swarm and the Cloud

TRILLIONS OF CONNECTED DEVICES

[J. Rabaey, ASPDAC’08]

THE CLOUD THE SWARM

slide-3
SLIDE 3

The New Moore’s Law

Still … Improve functionality per unit cost to create whole new application areas, But in a brand new setting

1970 Mainframes 1980 PCs 1990 Internet 2000 Wireless & Personal Devices 2010- Cloud Computing Immersive User Experiences Ubiquitous Sensing

slide-4
SLIDE 4

The Swarm Perspective

It’s A Connected World

Time to Abandon the “Component”-Oriented Vision Moore’s Law Revisited: Scaling is in number of connected devices, no longer in number of transistors/chip

[MuSyC 2009]

The functionality is in the swarm! Resources can be dynamically provided based on availability

slide-5
SLIDE 5

One Vision: CyberPhysical Systems

Linking the Cyber and Physical Words

[H. Gill, NSF 2008]

slide-6
SLIDE 6

Another One: BioCyber (?) Systems

Linking the Cyber and Biological Worlds

Examples: Brain-machine interfaces and body-area networks

slide-7
SLIDE 7

The Cloud and the Swarm Distributed Sense and Control Challenges

Complexity

Modeling/ Abstractions

System Metrics (ENERGY)

Run-time Management / Diagnostics

Verification Security/Tru st Robustness/ Reliability

Failure to Address in Fundamental and Cohesive Way will Slow Down or Prohibit Adoption

slide-8
SLIDE 8

It’s All About Energy

Energy among most compelling concern of distributed IT platform and its applications.

Mobiles Smart grid Avionics Human-centric systems

OUR VISION: Distributed Sense and Control Systems to Dynamically Enforce Energy-Proportionality

slide-9
SLIDE 9

Business as Usual Will Not Do

The mantra’s of two decades of low-power design: slow, simple, many, dedicated, adaptive

While some opportunities are left, concepts now commonly exploited The end of voltage and energy scaling !?

Unless novel devices are adapted soon …

0.001 0.01 0.1 1 Total Switching Leakage

0.2 0.4 0.6 0.8 1 1.2 VDD (V) 0.001 0.01 0.1 1 Energy (norm.)

0.3V

12x

In Need of Novel Architectural Ideas

slide-10
SLIDE 10

The Golden Opportunity

Energy-efficiency of most systems decreases under reduced loads Energy-Proportional Computing

Throughput Actual Ideal

Power

Courtesy:

  • L. Barroso, Google
slide-11
SLIDE 11

Computation and Energy

Throughput Power Actual Ideal DOING NOTHING (or LITTLE) WELL

Energy efficiency of most systems degrades under reduced load conditions How we design systems How nature designs systems

[* Term coined by L. Barroso, Google]

slide-12
SLIDE 12

A Generic Concept

Throughput Power Actua l Ideal DOING NOTHING (or LITTLE) WELL

  • Conceive and Enable Systems that are

Energy-Proportional over Large Throughput Range.

  • Applies to all aspects of the IT Platform!

Not the case in today’s systems (computing, storage, communication)

slide-13
SLIDE 13

The Big Picture

Hugely Scalable Platforms

“Providing computation/computation at the optimal energy”

Attention-Optimized Computing/Communication

“Matching computation to desired utility”

Utility Maximization

A Closed Loop System

slide-14
SLIDE 14

The Cloud/Swarm Challenge

  • Trade off computation and communication
  • in light of limited energy, communication and,

computational resources

  • so that desired utility is reached
  • under highly variable conditions and loads

Requires scalable distributed optimization strategy

slide-15
SLIDE 15

The “Playground”

A continuously changing alignment (environment, density, activity) The Swarm/Cloud Operating System -

Dynamically trading off resources

The Swarm/Cloud Services and Applications “What matters in the end is the utility delivered to the user”

Utility Maximization

Distributed Resources

Communication (Spectrum) Computation

Sensing Actuation Storage Energy

slide-16
SLIDE 16

ADDRESSING THE CHALLENGES

slide-17
SLIDE 17

17

Focus Center Research Program

Features

 Multi-university teams  Focus on topics where evolutionary R&D is insufficient  Emphasis on discovery; long-range time horizon  Large-scale effort (~ $7M per center annually)  Equal cost sharing between industry & government  Access to relevantly trained graduate students

“The (SRC) focus center program is designed to create a nationwide, multi-university network of research centers that will keep the United States and U.S. semiconductor firms at the front of the global microelectronics revolution.”

Craig R. Barrett Retired Chairman of the Board, Intel Former Chair, Semiconductor Technology Council Recent Chair, FCRP Governing Council

slide-18
SLIDE 18

MUSYC IN A NUTSHELL

Grand Goal: Grand Challenge: Create comprehensive and systematic solution the distributed multi-scale system design challenge. “Energy-smart” distributed systems, that

  • Are deeply aware of balance between energy availability and

demand

  • Adjust behavior through dynamic and adaptive optimization at

all scales of design hierarchy. Common Core: 20 Faculty Distributed over 10 US Universities SCS Theme Distributed sense and control systems. Target: Airborne Platforms (Avionics) LSS Theme Large-scale “energy-intensive” systems Target: Data centers SSS Theme Small-scale “energy-frugal” systems Target: Human-centered networks for augmented sensing (e.g. BMI) Exploring the multi-scale space:

slide-19
SLIDE 19

THE MUSYC TEAM

SCS LSS SSS

Including experts in petascale computing, networking, control, signal processing, information theory, avionics and neuro-engineering

slide-20
SLIDE 20

THEME 1: DISTRIBUTED SENSE AND CONTROL SYSTEM METHODOLOGY (A. SANGIOVANNI- VINCENTELLI)

Address challenges in complex distributed control systems by employing structured and formal design methodologies that seamlessly and coherently combine various dimensions of multi-scale design space, and that provide appropriate abstractions to manage inherent complexity. Case study: Avionics

Complexity

slide-21
SLIDE 21

SCS DRIVERS AND METRICS

Today

Power sources/sinks Electric distribution Control system

  • # power sources ~1
  • # loads ~100
  • peak power ~ 400kW
  • # power sources ~ 10
  • # loads ~1000
  • peak power ~ 4MW

Tomorrow

Large Airborne Platforms

In Line with DARPA META Program Reduction of development time of complex, distributed control systems by 2X through increased use of formal methods for specification, design and verification. Reduction of the number of faults that require the system to be taken out of service for inspection or repair by 2X, through the increase used of onboard models and dynamic reconfiguration to provide enhanced fault tolerance.

slide-22
SLIDE 22

SCS HIGHLIGHT: FORMULATED DESIGN FLOW FOR DISTRIBUTED AVIONICS SYSTEMS

Platform-based design enables architecture exploration (tradeoff weight, stability, …)

Power System Architecture Control System Architecture Hardware, Software, Communications

Redesign

Incremental conservative design

  • Steady state worst case power draw
  • 2x overdesign results in weight penalty

Dynamics problems identified in verification Communications latency impacts stability Dynamics, control, communication latency addressed in all layers

Current State of the Art

Robust design for distributed control system Ptolemy, Metro tools enable robust design

  • f complex dynamical

systems

Our Approach (STRONG impact on META I and II BAA) Collaboration with UTC (HS), IBM and Raytheon Contributors: E. Lee, R. Murray and ASV Realistic Test Benches under development

slide-23
SLIDE 23

THEME 2: LARGE-SCALE SYSTEMS (T. SIMUNIC- ROSING)

Realize distributed closed-loop power-management strategies that result in “energy- intensive” large-scale systems to be orders of magnitude more energy-efficient, while ensuring that mission-critical goals are met. To be accomplished by employing holistic multi-scale solution including all components of the system at multiple hierarchy levels. Target: Data centers

“Doing nothing well”

slide-24
SLIDE 24

LSS DRIVERS AND METRICS

SOLUTION: Distributed and hierarchical management that ensures that energy is only consumed if, when and where needed.

Enable “energy-proportional” computing, and to “do nothing well” in Datacenters and Cloud Computing METRIC: Datacenter Energy Efficiency

Barroso & Hölzle, 2009

slide-25
SLIDE 25

LSS HIGHLIGHT: ENERGY-AWARE LOAD

SCHEDULING

B Workload Model/ Predictor Energy Aware Workload Scheduler Cluster Manager Building/Facility Manager Tasks SLAs Energy Supply Information Energy Consumption Application Resource Footprint

Contributors: Katz, Snavely, Rosing, NSF GreenLight

Cooling-aware management

slide-26
SLIDE 26

THEME 3: SMALL-SCALE SYSTEMS (D. JONES)

Explore absolute bounds of energy-efficiency and miniaturization in “energy-frugal” human-centric distributed IT systems, through distributed management strategy that dynamically and adaptively selects correct operational point corresponding to varying application needs in terms of accuracy or resolution. Target: Augmented sensing in humans (BMI)

slide-27
SLIDE 27

SSS DRIVERS AND METRICS

KEY METRIC: UTILITY/ENERGY Utility Maximization

  • Define system performance in terms of

user/application relevant utility

  • Dynamically optimize algorithms and

platforms to maximize utility

Explore, analyze, and implement advanced closed-loop learning systems in brain-machine interfaces

In collaboration with UCB Neuroscience and UCSF Neurosurgery

slide-28
SLIDE 28

Scalable Signal acquisition

Utility-Optimizing Scalable Systems Management

Hugely Scalable Processor Attentional Algorithms Scalable Radio Frequency Tx and Rx RF Energy Harvesting Efficient Integrated Microscopic Antenna Voltage Scalable Power Source

3D Integrated Packaging

HUGELY SCALABLE SSS PLATFORM

3 10 70 0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 Vmin [V] Stage depth [fo4] 20f 25f 30f 35f 40f 45f Etotal [J]
slide-29
SLIDE 29

3 3

SSS HIGHLIGHT: HUGELY SCALABLE BMI PLATFORMS

Contributors: Rabaey, Blaauw, Franzon

3D Inductors promising higher L, Q

Energy-neutral wireless link delivers energy- proportionality over broad performance range from scavenged power

3

1 mm

65 nm CMOS, in fab

[Franzon] [Rabaey] Low-Jitter Timers for Power Control [Blaauw]

1.4 μJ/hour

IBM 130 nm CMOS

slide-30
SLIDE 30

In Summary … The Laws of the Cloud and the Swarm

 In a connected world, functionality arises

from connections of devices.

 Largest efficiency gain obtained by

balancing available resources: computation, communication and energy.

 The dynamic nature of the environment,

the needs and the resources dictate adaptive solutions.

 No one wins by being selfish.

Cooperation and collaboration are a must.

MuSyC as a Collaborative Answer to the Swarm and Cloud Challenges