OddCI: On-Demand Distributed Computing Infrastructure Rostand Costa - - PowerPoint PPT Presentation

oddci on demand distributed computing infrastructure
SMART_READER_LITE
LIVE PREVIEW

OddCI: On-Demand Distributed Computing Infrastructure Rostand Costa - - PowerPoint PPT Presentation

OddCI: On-Demand Distributed Computing Infrastructure Rostand Costa Francisco Brasileiro Guido Lemos Filho Dnio Mariz Sousa MTAGS 2nd Workshop on Many-Task Computing on Grids and Supercomputers Co-located with ACM/IEEE SC09 (International


slide-1
SLIDE 1

OddCI: On-Demand Distributed Computing Infrastructure

Rostand Costa Francisco Brasileiro Guido Lemos Filho Dênio Mariz Sousa

MTAGS 2nd Workshop on Many-Task Computing on Grids and Supercomputers Co-located with ACM/IEEE SC09 (International Conference for High Performance, Networking, Storage and Analysis) Portland, Oregon -- November 16th, 2009

1

slide-2
SLIDE 2

Agenda

Motivation DCI requirements for MTC OddCI: a novel approach to DCI OddCI over a digital TV network

2

Performance assessment Concluding remarks

slide-3
SLIDE 3

Introduction

MTC speeds up execution of applications, but...

Large amount of parallelism can only be achieved if there is a relatively high level of independency among the sub-tasks The scheduler need to have access to a huge number of processors.

  • number of processors.

In this paper we are concerned with the issue of

Providing ways to assemble large pools of processors for the execution of MTC applications. In particular, we focus on large-scale distributed computing infrastructures (DCI)

3

slide-4
SLIDE 4

System Requirements

The throughput achieved by MTC over a DCI depends on

the scale it allows

To provide extremely high-throughput computing to a large

number of applications, a DCI must meet some requirements:

extremely high scalability: it must be able to handle up to extremely high scalability: it must be able to handle up to hundreds of millions of processing resources in the same way that it handles a few dozens of them;

  • n-demand instantiation: it must offer mechanisms for

discovery, assemblage and coordination of the required resources, on demand and for a specified amount of time; efficient setup: the configuration of the processing nodes must be carried out quickly and demanding minimal interventions.

4

slide-5
SLIDE 5

Available Alternatives

Desktop Grid Computing

the combination of computer resources from a single or multiple administrative domains applied to a common task e.g. Condor, OurGrid, Alchemi

Voluntary Computing

a type of distributed computing in which computer owners a type of distributed computing in which computer owners donate their computing resources (such as processing power and storage) to one or more "projects“. e.g. SETI@home, FightAIDS@home, Folding@Home

Infrastructure as a Service (IaaS)

the delivery of computer infrastructure (typically a platform virtualization environment) as a service e.g. Amazon Elastic Compute Cloud (Amazon EC2),

5

slide-6
SLIDE 6

No available technology is able to simultaneously

address all the requirements to provide extremely

high-throughput computing to over a DCI

Available Alternatives Vs Requirements

Available Technologies Requirement Voluntary Computing Desktop Grid Infrastructure as a Service Extremely High Scalability

  • Efficient Setup
  • On-demand

Instantiation

  • 6
slide-7
SLIDE 7

On-Demand Distributed Computing Infrastructure

OddCI consider a special category of devices which

may be organized as a broadcast network

Mobile phones, Digital TV receivers, Cable TV receivers Devices connected to the Internet with reasonably Devices connected to the Internet with reasonably powerful processors Broadcast network can access simultaneously all the devices which can be coordinated to run some task

7

slide-8
SLIDE 8

On-Demand Distributed Computing Infrastructure

A novel architecture for generic DCI Flexible

Can be used for several scenarios and with different technologies and devices

Potentially highly scalable

Millions of potential devices Millions of potential devices

On-demand instantiation

Resources are discovered and allocated as required and for a specified amount of time

Efficient setup

Building DCI instances with millions or thousand nodes demands similar effort via broadcast communication

8

slide-9
SLIDE 9

OddCI Architecture

Provider: creates, manages, destroys OddCI instances

Backend Controller Provider

PNA 1 PNA N

...

Broadcast Channel Direct Channel

Provider: creates, manages, destroys OddCI instances Controller: Setup, controls, sends software images,

monitors PNA status

Backend: schedules tasks, provide input data, collects

  • utput data, post-processing

PN Agent: actually runs tasks, processes control

messages

9

slide-10
SLIDE 10

OddCI Architecture: operation

Backend Controller Provider

PNA 1 PNA N

...

Broadcast Channel Direct Channel

10

User submits a “processing request” to the

provider

DCI instance size (number of processing nodes) Application image, common data Node requirements

slide-11
SLIDE 11

Backend Controller Provider

PNA 1 PNA N

...

Broadcast Channel Direct Channel

OddCI Architecture: operation

11

Provider evaluate the user request

checks availability keeps control information

Command the Controller for creating the OddCI

required instance

slide-12
SLIDE 12

Backend Controller Provider

PNA 1 PNA N

...

Broadcast Channel Direct Channel

OddCI Architecture: operation

12

Controller triggers a wakeup process to PNAs

through the broadcast channel

PNA can drop jobs when busy or accept when idle

Controller also send other control messages (e.g.

dismantle instances)

All PNA receives messages simultaneously

slide-13
SLIDE 13

OddCI Architecture: operation

Backend Controller Provider

PNA 1 PNA N

...

Broadcast Channel Direct Channel

13

PNA loads application image for execution in a DVE

(Dynamic Virtual Environment)

Controller monitors active PNA Direct channel is a two-way road

Application can interact with the Backend for requesting specific input data or send results (optional) PNA sends status messages frequently to the Controller

slide-14
SLIDE 14

Proof of Concept: OddCI over a Digital TV Network

Why DTV network?

Open technology, well-defined standards Native transmission of data in broadcast Fast expansion, being deployed in many countries Great spectrum of devices: from set-top boxes to mobile devices Great spectrum of devices: from set-top boxes to mobile devices Potential for millions of devices Powerful middleware

And also ...

Feasibility for building a testbed Previous experience of our group

14

slide-15
SLIDE 15

Digital TV Broadcaster

DTV Generic Model

Digital TV Receiver Content Production

  • Broadcast

Channel

DSM-CC DTV Broadcast Transmission MPEG-2 Transport Stream

  • 15
  • Controller

Gateway Carousel Generator Controller Integration

PNA

Application Xlet DTV Receiver PNA Xlet Middleware

Backend

Interaction Channel

DTV Return path PNA Communication Internet

Applications & Data

Provider

slide-16
SLIDE 16

Digital TV Broadcaster

DTV Generic Model

Digital TV Receiver Content Production

  • 16
  • Applications

& Data

slide-17
SLIDE 17

Implementing OddCI over DTV components

Processing Nodes Direct Channel Application Xlet

DTV Receiver

PNA Xlet Middleware DTV Return path

PNA Communication

Internet

Backend

17

Controller

Processing Nodes Channel Broadcast Channel Gateway Carousel Generator

Controller Integration

DSM-CC

DTV Broadcast Transmission

MPEG-2 Transport Stream

Provider Backend

slide-18
SLIDE 18

Experiment setup

Experiments were performed using:

SBTVD (Brazilian DTV standard) Software

  • Brazilian middleware “Ginga” implementation from UFPB
  • NCBI Toolkit ported using a cross-compiler

NCBI Toolkit ported using a cross-compiler

  • BLAST – Basic Local Alignment Search Tool tasks, from

NCBI

DTV Receiver

  • STI microelectronic´s processor ST7109
  • 32MB Flash memory, 256MB RAM

Reference system

  • Dual Core Pentium, 1.6GHz, 1GB RAM, Debian Linux

18

slide-19
SLIDE 19

DTV STB Performance - Preliminary findings

10 20 30 40 50 60 70

Performance factor

Relative Processing Time - BLASTall program

PC (Ref) STB In Use/PC STB Standby/PC

Lower is better

STB “in use” means “user watching TV” Experiment Setup

BLAST application running in a STB and compared with a reference PC desktop STB with Brazilian middleware “Ginga” Tests performed using the cheapest STB in the Brazilian market (~US$ 100)

Remarks

Ref PC is ~31 times faster than STB “in use” mode Ref PC is ~17 times faster than STB “standby” mode

19

1 2 3 4 5 6 7 8 9 10 11 12

BLASTAll Tasks

“user watching TV” while task runs

slide-20
SLIDE 20

Performance Assessment

1 10 100 1000 10000 100000 1000000 10000000 10000000

Makespan (log)

n/N=1 n/N=10 n/N=100 n/N=1000 n/N=10000

Simple analytical model was developed System parameters

Broadcast channel capacity =1Mbps Return channel capacity = 150kbps Image size = 10Mb Application Input + output data size = 1kb Average Processing time = from 53ms to 1.5hour

20

1 1 10 100 1000 10000 100000

Φ Φ Φ Φ

slide-21
SLIDE 21

DONE DONE

Research Roadmap

Define an (ideally generic) architecture Preliminary analysis

STB basic performance assessment

Proof of concept

Digital TV Network

Complete analysis

FUTURE FUTURE NOW NOW

Complete analysis

Mathematical model, simulation Case studies with different application profiles

Security issues Optimization in specific components (e.g. controller) Business models

Voluntary computing model, reward model Possibly with a real DTV broadcaster’s partnership

21

slide-22
SLIDE 22

Concluding Remarks

OddCI: a novel approach to DCI

Efficient setup, on-demand instantiation Great potential to enable DCI for Extremely High- Throughput Computing

OddCI can be instantiated over a DTV system OddCI can be instantiated over a DTV system

Less processing power, but huge pool size Brazilian DTV expects ~100 million receivers by 2016 European DVB: >500 million receivers deployed Chinese DTMB: ~100 million receivers (estimated)

Great research challenges to deal with

22

slide-23
SLIDE 23

OddCI: On-Demand Distributed Computing Infrastructure

Contact Information Contact Information Rostand Costa rostand.costa@lsd.ufcg.edu.br Francisco Brasileiro fubica@lsd.ufcg.edu.br Guido Lemos Filho guido@lavid.ufpb.br Dênio Mariz Sousa denio@ifpb.edu.br Rostand Costa rostand.costa@lsd.ufcg.edu.br Francisco Brasileiro fubica@lsd.ufcg.edu.br Guido Lemos Filho guido@lavid.ufpb.br Dênio Mariz Sousa denio@ifpb.edu.br

23