CS5412: SPRING 2012 CLOUD COMPUTING
Ken Birman Lecture 1
CS5412 Spring 2012 (Cloud Computing: Birman) 1
CS5412: SPRING 2012 CLOUD COMPUTING Lecture 1 Ken Birman Welcome - - PowerPoint PPT Presentation
CS5412 Spring 2012 (Cloud Computing: Birman) 1 CS5412: SPRING 2012 CLOUD COMPUTING Lecture 1 Ken Birman Welcome to CS 5412... 2 A completely new course dedicated to the technology behind cloud computing! In my country of Khazackstan, many
CS5412 Spring 2012 (Cloud Computing: Birman) 1
In my country of Khazackstan, many excellent hacker. If hack cloud, can steal private stuff of whole world!
CS5412 Spring 2012 (Cloud Computing: Birman)
A general term for the style of computing that
Increasingly powerful and universal Enables a new kind of massively scaled, elastic app Our goal: understand the technology of the cloud,
Invent “highly assured cloud computing” options
CS5412 Spring 2012 (Cloud Computing: Birman)
3
Big data, updates by “owner” Dominated by reads Index... search... share Monetized by advertising, sales
CS5412 Spring 2012 (Cloud Computing: Birman)
4
High assurance Real-time control Runs “everything” Monitized by “roles”
eHealth CloudBank GridCloud eChauffer
Big data, updates by “owner”
Dominated by reads
Index... search... share
Monetized by advertising, sales
CS5412 Spring 2012 (Cloud Computing: Birman)
5
Huge data centers, far larger than past systems Very automated: far from where developers work.
Packed for high efficiency. Each machine hosts
Scheduled to keep everything busy (but overloads
CS5412 Spring 2012 (Cloud Computing: Birman)
6
Each data center is 11.5 times the size of a football field
Technology Cost in small- sized Data Center Cost in Large Data Center Cloud Advantage Network $95 per Mbps/ month $13 per Mbps/ month 7.1 Storage $2.20 per GB/ month $0.40 per GB/ month 5.7 Administration ~140 servers/ Administrator >1000 Servers/ Administrator 7.1
Slide provided by Roger Barga, Head of Cloud Computing, Microsoft
7
Machines busier, earn more $’s for each $ investment
Hardware handled a whole truckload at a time
Applications far more standardized
Automated management: few “sys admins” needed Power consumed near generator: less wastage Data center runs hot, wasting less on cooling Can “rent” resources rather than owning them
Supports new, extremely large-scale services
Elasticity to accomodate surging demands Can accumulate and access massive amounts of data But must read or process it in a massively parallel way Enables overnight emergence of major companies, but scalability model
does require new programming styles, and imposes new limits
CS5412 Spring 2012 (Cloud Computing: Birman)
8
Unfortunately, today’s cloud
Has a limited security model focused on credit card
Weakens consistency to achieve faster response times:
Pushes many aspects of failure handling to clients
Model supported by the “CAP” and “FLP” theorems,
Instead, cloud favors “BASE”
CS5412 Spring 2012 (Cloud Computing: Birman)
9
CAP: A theorem that says one can have just two from
FLP: A theorem that says it is impossible to guarantee
BASE: A cloud computing methodology that seeks
ACID: A database methodology: offers guaranted
CS5412 Spring 2012 (Cloud Computing: Birman)
10
Future cloud will need stronger guarantees than we
How can we achieve those? Are strong guarantees “scalable”?
Betting that the cloud will win
Cheaper than other options... ... and the cheaper option usually wins! But technology also advances over time, which helps!
CS5412 Spring 2012 (Cloud Computing: Birman)
11
Find ways to overcome limitations like FLP and CAP Define new assurance goals that might still be forms of
Only consider things that are real enough to be
But use theoretical tools when theory helps with goals.
CS5412 Spring 2012 (Cloud Computing: Birman)
12
We’ll treat the cloud as having three main parts
The client side: Everything on your device The Internet, as used by the cloud Data centers, which themselves have a “tiered” structure
Like a dedicated and
Yet massively scaled
Special theme:
13
Old world: we replicated servers for speed and
New world: scalability matters most of all
Focus is on extremely rapid response times Amazon estimates that each millisecond of delay has a
But our premise is that we can have scalability and
14
CS5412 Spring 2012 (Cloud Computing: Birman)
Security: Only correctly authorized users (who are
Privacy: Data doesn’t leak to intruders Rapid response despite failures or disruption Consistency and coordinated behavior Ability to overcome attacks or mishaps Guarantee that center operates at a high level of
Archival protection of important data
CS5412 Spring 2012 (Cloud Computing: Birman)
15
If we were to run high assurance solutions on
Goal is to leverage the cloud or even run on
This forces us to look hard at how things work
CS5412 Spring 2012 (Cloud Computing: Birman)
16
Interactive graphical interface: Executable code downloaded from web site Web Services “stub” procedures DNS used to locate the “right” cloud data center. SOAP/HTTP/TCP carry requests Client side Load-balancing router on cloud platform First-tier services do as much work as possible locally, often use cached data from tier-two key-value stores Inner tiers offer more sophisticated services but are only consulted if necessary Cloud service side Internet routing plays key roles
CS5412 Spring 2012 (Cloud Computing: Birman)
17
First tier: web page with associated request processing logic.
Second tier: highly scalable key- value storage, caches, used to support the first tier. The term sharding is often used to refer to the process of breaking a data set into smaller replicated data sets so that the data associated with each key value (a shard) is replicated on just a few nodes.
Inner tiers: Databases and index files used by the first and second tiers
Back-end: Batch processing applications that run out-of-band to create precomputed index files and analyze large data collections
1 1 1 1 1 1 1 1 1
Index DB
2 2 Shards 2 2 2 2 2 2
18
Load-balancing router: Role is to spray requests over available first-tier service
user), affinity (if possible, requests from a given client should route to the same server), load balancing, effective use of elasticity. First-tier services are limited to using soft-state or running without any state at all: on restart, any temporary files or data will be wiped away. They make extensive use of key- value stores and caches running at similar scale in the second tier of the cloud. Inner tiers offer more sophisticated services but are only consulted if necessary. These
strong consistency models, such as the ACID model or snapshot isolation, but these are costly and hence the first-tier shields the inner ones from load. Infrastructure services manage the ensemble, launching new services or shutting down active ones in response to shifting load patterns and failures. They may do this without warning, especially for services in the first-tier. Back-end applications run batch-style, often on very large numbers of machines with very large data sets. Using tools like MapReduce or Hadoop, they analyze those data sets and create helper files that will be used later by the first-tier.
19
Cloud area is just too big right now So we’ll look at good examples of representative
We’ll try and hit the famous ones you’ve heard about Also some less famous but interesting options
We’ll drill down on issues relating to replication with
CS5412 Spring 2012 (Cloud Computing: Birman)
20
Not everything scales Many things are hard to pull off when you have an
Must start by understanding what works, then see if
CS5412 Spring 2012 (Cloud Computing: Birman)
21
Which is better: Multithreaded servers?
22
Which is better: Multithreaded servers? Or multiple single-threaded servers?
23
Built from things that already exist and already
Expect that each 10x scaleup will still break things
When feasible, go for “no brainer” scalability
Armies of cheap machines and cheap storage A form of “brute force” solution
Success stories of today’s cloud often are
CS5412 Spring 2012 (Cloud Computing: Birman)
24
Integrated glucose monitor and Insulin pump receives instructions wirelessly Motion sensor , fall-detector
Home healthcare application
Healthcare provider monitors large numbers of remote patients Medication station tracks, dispenses pills
25
Need: Strong consistency and durability for data
Her stomach is upset and she hasn’t been eating well, yet her blood sugars are high.
Let’s stop the oral diabetes medication and increase her insulin, but we’ll need to monitor closely for a week Patient Records DB
26
Update the monitoring and alarms criteria for Mrs. Marsh as follows… Confirmed
Response delay seen by end-user would also include Internet latencies
Local response delay flush Send Send Send Execution timeline for an individual first-tier replica
Soft-state first-tier service A B C D
An online monitoring system might focus on real-time response
27
Air Traffic Controllers depend on consistent data With a single server this isn’t hard to guarantee
ATC DB Safe for US Air 221 to land?
CS5412 Spring 2012 (Cloud Computing: Birman)
28
But suppose we replicate the server? Designate one as “primary”
ATC DB Safe for US Air 221 to land? Backup
CS5412 Spring 2012 (Cloud Computing: Birman)
29
Failure detection will be key to consistency Otherwise could end up with two primaries!
ATC DB Safe for US Air 221 to land? ATC DB’ Safe for Air France 31 to take off?
CS5412 Spring 2012 (Cloud Computing: Birman)
30
Cloud computing systems
Overcome failure by replicating services But have no standard way to decide which server is in
Easiest form of failure “detection” is by timeout
But this might not be accurate: a network partitioning
Maybe just some connections will fail And if the network then recovers, the old ATC service might
CS5412 Spring 2012 (Cloud Computing: Birman)
31
How to scale? Just add more replicas, balance load Fault-tolerance? If something crashes but has replicas,
Elasticity? Launch new replicas or shut some down What makes replication hard are cases where we need
If we don’t worry about such things, may even be able
CS5412 Spring 2012 (Cloud Computing: Birman)
32
With iCloud, a lot of the data is pretty static If we update data (or applications) while also
Creates risk of “split brain” problems
CS5412 Spring 2012 (Cloud Computing: Birman)
33
2000 4000 6000 8000 10000 12000 250 400 550 700 850 messages /s time (s)
With small-scale replication, IPMC is a big win But IPMC “storms” can occur in a data center with
Wild load swings, heavy loss rates, thrashing
But it worked in the lab! CS5412 Spring 2012 (Cloud Computing: Birman)
34
Today’s cloud is built with simple components and
Companies spending a fortune to eliminate such issues They can limit scalability
Tomorrow’s cloud thus poses a deep question
Will it be limited to simple applications? Or can we migrate application like health care,
CS5412 Spring 2012 (Cloud Computing: Birman)
35
We’ll take a step-by-step approach First look at properties of the client platform Next consider Internet and its evolution under
Finally focus on the data center and look at it tier
CS5412 Spring 2012 (Cloud Computing: Birman)
36
High assurance means different things in each layer
A client depending on a browser worries about apps,
The network worries about efficient routing, BGP
The cloud worries about maintaining rapid response,
CS5412 Spring 2012 (Cloud Computing: Birman)
37
For the first few weeks, we’ll be more engineering
But then as we focus more on replicated processing and
Fault-tolerance will round off our investigation. We’ll
CS5412 Spring 2012 (Cloud Computing: Birman)
38
Approximately 25 lectures, with a few surprise
Must be in class on time to take quizzes. No makeups! We maintain videonotes, in case you miss a lecture. Since some people will be ill or out of town, can miss a
Individualized cloud computing projects (80%), can
Course is curved to a B+
CS5412 Spring 2012 (Cloud Computing: Birman)
39
Professor Birman gives most lectures Course roughly parallels his textbook; you can
But no assigned readings or homework from textbook Not really a “required” book, just a useful supplement
We have four quarter-time TAs with office hours Web page has contact info and more details
CS5412 Spring 2012 (Cloud Computing: Birman)
40
Wide range of topics (we’ll suggest many, or you
Must meet with a TA twice during the semester to
Graded by TA and Prof. Birman at end of semester Projects tackled by two people are expected to be
Project can “double” as an MEng project if you also
CS5412 Spring 2012 (Cloud Computing: Birman)
41
Integrate Isis2 with Live Objects Build services of the kind Amazon uses for system
Simulate and/or experiment on flow control for
Implement a realistic Air Traffic Control system with
Explore best options for wide area file transfer
CS5412 Spring 2012 (Cloud Computing: Birman)
42
We’ll be using Ken’s new textbook
Written as a teaching tool Ken doesn’t earn royalties on it!
Available end of February 2012
Will need to place orders online Won’t be available via Cornell bookstore this year
Until then, we’ll provide PDFs for materials related to
CS5412 Spring 2012 (Cloud Computing: Birman)
43
Solid understanding of computer archictectures,
Some basic appreciation of how networks work,
But no prior exposure to “distributed computing”
CS5412 Spring 2012 (Cloud Computing: Birman)
44
Our IS program has a wonderful course on large-
CS5413 looks at modern security challenges There are several courses on networks and mobility
CS5412 Spring 2012 (Cloud Computing: Birman)
45
Courses like CS5412 are aimed at
Advanced undergraduates from Cornell’s program MEng students looking for “knowledge they can use” Some PhD students (very much welcome) but course won’t be
Our focus is on practical aspects, things known to work Courses with CS6xxx numbering are specifically
Goal is to advance the frontier of knowledge
CS5412 Spring 2012 (Cloud Computing: Birman)
46