Using Proxies to Accelerate Cloud Applications Siddharth - - PowerPoint PPT Presentation

using proxies to accelerate cloud applications
SMART_READER_LITE
LIVE PREVIEW

Using Proxies to Accelerate Cloud Applications Siddharth - - PowerPoint PPT Presentation

Using Proxies to Accelerate Cloud Applications Siddharth Ramakrishnan Jon Weissman Department of CSE University of Minnesota Introduction Cloud ecosystem (Gannon 2009) SAAS: (Google Spreadsheet, Gmail) I/P-AAS: (Virt: EC2/S3,


slide-1
SLIDE 1

Using Proxies to Accelerate Cloud Applications

Siddharth Ramakrishnan Jon Weissman Department of CSE University of Minnesota

slide-2
SLIDE 2

Introduction

  • Cloud ecosystem (Gannon 2009)

– SAAS: (Google Spreadsheet, Gmail) – I/P-AAS: (Virt: EC2/S3, Azure), Google AppEngine – Parallel frameworks: (MapReduce cloud)

  • Scale-up/Scale-down
  • Remote execution/hosting
  • Performance
  • Transparency
slide-3
SLIDE 3

Application View: Cloud Diversity

  • Data clouds

– S3, SkySurvey, GoogleHealth, …

  • Compute clouds

– EC2, IronScale, …

  • Service clouds

– Gmail, Gmaps, Google-earth

slide-4
SLIDE 4

Trends

  • Specialization and diversity

– Functional and non-functional – Non-functional: security, reliability, SLAs, cost – Functional: type of data, type of services, …

  • Distributed clouds

– Smaller footprint data center containers geographically dispersed – Logical cloud federation: OpenCirrus

slide-5
SLIDE 5

Confluence

  • Diversity of clouds + push for distribution
  • (1) No single cloud model will rule
  • (2) New distributed models are attractive
  • (3) Emerging applications will utilize multiple

clouds “multi-cloud” applications

slide-6
SLIDE 6

An Aside: Edge Systems

  • Edge systems

– Compute-oriented: BOINC, @home, … – Data-oriented: P2P, Bittorent, openDHT, …

Appeal: scale, cost, *diversity* => Edge computers can play an important role in multi-cloud applications

slide-7
SLIDE 7

Multi-Cloud Applications

  • Specialization => data-intensive applications

will increasingly span multiple clouds

– data is dispersed across multiple clouds

  • Distributed data mining

– Ex: weather data + commodity prices

  • Scientific workflows

– Ex: life science: GenBank<->BLAST<->PubMed, …

  • Mashups

– Ex: GoogleEarth + CDC pandemic data

  • Multi-cloud parallel frameworks

– Ex: MapReduce, AllPairs, …

slide-8
SLIDE 8

The Problem

S2 S1 E

  • Current cloud interaction paradigm is client-server

– Web Services or http

  • Data flows back and forth to end-client application

compute on S1 output Better available nodes

slide-9
SLIDE 9

Solution: Proxy Architecture: 50K ft

Exploit diversity

  • f proxy nodes

Resource constrained

slide-10
SLIDE 10

S2 S1 E Proxy Network P

slide-11
SLIDE 11

Data-oriented Proxy Roles

  • Cloud service interaction

– Proxy as a client

  • Routing

– Proxy routes data to other proxies

  • Computing => Grids

– Proxy computes data operators: compress, filter, merge, mine, …

  • Caching => P2P

– Proxy caches data (from cloud, computations, …)

slide-12
SLIDE 12

Proxy Network

  • Where do proxies come from?

– volunteers, deployed CDNs, …

  • How do proxies form overlays?

– is there a system-wide overlay and/or application- specific overlays? – need more experience with multi-cloud applications

slide-13
SLIDE 13

How Much Network Diversity?

  • Extensive evaluation of PlanetLab and Internet

services

Need download

  • 1. Cluster of good proxies
  • 2. Best proxy depends on

cloud service

slide-14
SLIDE 14

Proxy Hop Penalty?

  • Despite network proximity and data

reduction, proxies may add a network hop

– 1600 paths – Over 70% benefited by intermediary – Over 20% performance improvement

S1 E P

slide-15
SLIDE 15

Example: Montage

slide-16
SLIDE 16

Montage Speedup

Initiator is the workflow engine, remote from Montage services One proxy per Montage service, co-located

slide-17
SLIDE 17

Example: Image Processing

Basic workflow Enhanced proxy workflow

slide-18
SLIDE 18

Results

There exist many proxies that can accelerate this application

Image processing cloud location end-user image server location fixed

slide-19
SLIDE 19

Summary

  • Cloud specialization will trigger a new wave of

multi-cloud applications

  • Proposed a proxy network to “accelerate” these

applications => bottleneck awareness

  • Many research challenges

– Proxy node selection – Proxy network configuration