Applied research group Systems+database people building prototypes, - PowerPoint PPT Presentation

Applied research group Systems+database people building prototypes, publishing papers

Applied research group Systems+database people building prototypes, publishing papers Collaborating with Big Data product group at MS Shipping our code to production

Applied research group Systems+database people building prototypes, publishing papers Collaborating with Big Data product group at MS Shipping our code to production Open-sourcing our code Apache Hadoop, REEF, Heron

Resource Distributed Query management tiered storage optimization Stream Log analytics processing

Node Node Node Manager Manager Manager

• Node Node Node Manager Manager Manager

• • Node Node Node Manager Manager Manager

• • 1. Request Node Node Node Manager Manager Manager

• • 1. Request 2. Allocation Node Node Node Manager Manager Manager

• • 1. Request 2. Allocation 3. Start task Node Node Node Manager Manager Manager

• • 1. Request 2. Allocation • 3. Start task Node Node Node Manager Manager Manager

• • 1. Request 2. Allocation • • 3. Start task Node Node Node Manager Manager Manager

• • 1. Request Do we really need a Resource Manager? 2. Allocation • • 3. Start task Node Node Node Manager Manager Manager

Hadoop 1 World Hadoop 2 World • monolithic Users Application Frameworks Hive / Pig Hive / Pig Ad-hoc Ad-hoc app Ad-hoc Apps Ad-hoc Ad-hoc Ad-hoc Scope app Programming app app MR v1 app Model(s) on MR ... YARN Tez Giraph Storm Spark Dryad v2 Heron REEF Hadoop 1.x (MapReduce) Cluster OS (Resource YARN Management) File System HDFS 2 HDFS 1 Hardware

Hadoop 1 World Hadoop 2 World • monolithic Users Application Frameworks • Reuse of RM Hive / Pig Hive / Pig Ad-hoc Ad-hoc app Ad-hoc Apps Ad-hoc component Ad-hoc Ad-hoc Scope app Programming app app MR v1 app Model(s) on MR ... YARN Tez Giraph Storm Spark Dryad v2 Heron REEF Hadoop 1.x (MapReduce) Cluster OS (Resource YARN Management) File System HDFS 2 HDFS 1 Hardware

Hadoop 1 World Hadoop 2 World • monolithic Users Application Frameworks • Reuse of RM Hive / Pig Hive / Pig Ad-hoc Ad-hoc app Ad-hoc Apps Ad-hoc component Ad-hoc Ad-hoc Scope app Programming app app MR v1 app Model(s) on MR ... YARN Tez Giraph Storm Spark Dryad v2 Heron REEF Hadoop 1.x (MapReduce) YARN Cluster OS • (Resource YARN Management) layering abstractions File System HDFS 2 HDFS 1 Hardware

But is all this good enough for the Microsoft clusters?

High resource Scalability utilization Production jobs Workload and heterogeneity predictability

100% Utilization

• Wide variety

• Wide variety • •

deadlines recurring >60% • Predictability over-provisioned

4 Hadoop committers in CISL 404 patches as of last night • Rayon/Morpheus: • Mercury/Yaq: • YARN Federation: • Medea:

[Hadoop 3.0; ATC 2015, EuroSys 2016]

RM N1 N2

j1 RM N1 N2

j2 RM N1 N2

j2 RM • Feedback delays idle between allocations N1 N2

j2 RM • Feedback delays idle between allocations N1 N2 5 sec 10 sec 50 sec Mixed-5-50 Cosmos-gm 60.59% 78.35% 92.38% 78.54% 83.38%

j2 RM • Feedback delays idle between allocations N1 N2 5 sec 10 sec 50 sec Mixed-5-50 Cosmos-gm 60.59% 78.35% 92.38% 78.54% 83.38% • Actual

• Introduce task queuing at nodes • Mask feedback delays • Improve cluster utilization • Improve task throughput (by up to 40%) • Container types • GUARANTEED and OPPORTUNISTIC • Keep guarantees for important jobs • Use opportunistic execution to improve utilization

RM N1 N2

j1 RM N1 N2

j2 RM N1 N2

• j2 RM N1 N2

• j2 RM • N1 N2

• •

• So all we need to do is use long queues? •

can be detrimental for job completion times • Despite the utilization gains

can be detrimental for job completion times • Despite the utilization gains Proper queue management techniques are required

N1 N2 N3

Prioritize task Place tasks to execution node queues (queue reordering) Bound queue lengths

Prioritize task Place tasks to execution node queues (queue reordering) Bound queue lengths Yaq improves median job completion time by 1.7x over YARN

RM N1 N2 N3

queue length RM N1 N2 N3

queue length RM queue wait time N1 N2 N3

• Shortest Remaining Job First (SRJF) • Least Remaining Tasks First (LRTF)

RM j2: 5 tasks j3: 9 tasks j1: 21 tasks • Shortest Remaining Job First (SRJF) • Least Remaining Tasks First (LRTF) N1 N2 N3

RM j2: 5 tasks j3: 9 tasks j1: 21 tasks • Shortest Remaining Job First (SRJF) • Least Remaining Tasks First (LRTF) N1 N2 N3 job-aware

lower throughput longer job completion times

• 1.7x improvement in median JCT over YARN

• Container types distributed scheduling any distributed scheduler over-commitment multi-tenancy • Pricing

cluster utilization queue management techniques job completion time

Applied research group Systems+database people building prototypes, - PowerPoint PPT Presentation

Applied research group Systems+database people building prototypes, publishing papers Applied research group Systems+database people building prototypes, publishing papers Collaborating with Big Data product group at MS Shipping our code to

Applied Machine Learning Introduction 1 APPLIED MACHINE LEARNING Practicalities Contact

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

Cornell University Program on Applied Demographics Applied Demographics CaRDI Spring Research

Applied Physics Day 2017 Welcome! Department of Applied Physics Present AlbaNova Electrum

The Applied Research Institute Overview The University of Virginia Applied Research Institute

Applied Geology APPLIED GEOLOGY SUBJECT AREA GROUP Tuning Africa Phase II Third General Meeting

Applied Algorithm Design Lecture 3 Pietro Michiardi Eurecom Pietro Michiardi (Eurecom) Applied

The Applied Control The Applied Control Technology Consortium Technology Consortium Dr Arek

Applied Machine Learning Introduction 1 APPLIED MACHINE LEARNING Practicalities Slides and

Applied Algorithm Design Lecture 5 Pietro Michiardi Eurecom Pietro Michiardi (Eurecom) Applied

APPLIED MACHINE LEARNING Probability Density Functions Gaussian Mixture Models 1 APPLIED

PORTFOLIO www.adept-group.biz 28 www.adept-group.biz 29 www.adept-group.biz 30

Process Dr. Michael Yukish Simon Miller Research Associate Doctoral Candidate Applied Research

Applied Ethics & Societal Aspects in Applied Human Pharmacology David H.-U. Haerry, EATG

What is a group and why should I care? Daniel Platt October 10, 2019 What is a Group? Very

Access to EU Research Funding for Universities of Applied Sciences 19-20 November, Spanish

Optimization Models EECS 127 / EECS 227AT Laurent El Ghaoui EECS department UC Berkeley Spring

1 Wednesday, October 6, 2010 Image credit: http://browsertoolkit.com/fault-tolerance.png 2

Non-Transitive Connectivity and DHTs Mike Freedman Karthik Lakshminarayanan Sean Rhea Ion

z Towards Plan-aware Resource Allocation in Serverless Query Processing Malay Bag Alekh Jindal

programmers? Frances Buontempo @fbuontempo 1 Yes But @fbuontempo 2 What is AI?

Programming language shapes Programming thought programming languages are not merely

CE419 Session 1: Fundamentals Web Programming Let's begin with the premise that everything

1Q18 Earnings Call Presentation April 25, 2018 Sands Macao Sands Bethlehem Four Seasons Macao The

Applied research group Systems+database people building prototypes, - PowerPoint PPT Presentation

Applied research group Systems+database people building prototypes, publishing papers Applied research group Systems+database people building prototypes, publishing papers Collaborating with Big Data product group at MS Shipping our code to

Applied Machine Learning Introduction 1 APPLIED MACHINE LEARNING Practicalities Contact

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

Cornell University Program on Applied Demographics Applied Demographics CaRDI Spring Research

Applied Physics Day 2017 Welcome! Department of Applied Physics Present AlbaNova Electrum

The Applied Research Institute Overview The University of Virginia Applied Research Institute

Applied Geology APPLIED GEOLOGY SUBJECT AREA GROUP Tuning Africa Phase II Third General Meeting

Applied Algorithm Design Lecture 3 Pietro Michiardi Eurecom Pietro Michiardi (Eurecom) Applied

The Applied Control The Applied Control Technology Consortium Technology Consortium Dr Arek

Applied Machine Learning Introduction 1 APPLIED MACHINE LEARNING Practicalities Slides and

Applied Algorithm Design Lecture 5 Pietro Michiardi Eurecom Pietro Michiardi (Eurecom) Applied

APPLIED MACHINE LEARNING Probability Density Functions Gaussian Mixture Models 1 APPLIED

PORTFOLIO www.adept-group.biz 28 www.adept-group.biz 29 www.adept-group.biz 30

Process Dr. Michael Yukish Simon Miller Research Associate Doctoral Candidate Applied Research

Applied Ethics &amp; Societal Aspects in Applied Human Pharmacology David H.-U. Haerry, EATG

What is a group and why should I care? Daniel Platt October 10, 2019 What is a Group? Very

Access to EU Research Funding for Universities of Applied Sciences 19-20 November, Spanish

Optimization Models EECS 127 / EECS 227AT Laurent El Ghaoui EECS department UC Berkeley Spring

1 Wednesday, October 6, 2010 Image credit: http://browsertoolkit.com/fault-tolerance.png 2

Non-Transitive Connectivity and DHTs Mike Freedman Karthik Lakshminarayanan Sean Rhea Ion

z Towards Plan-aware Resource Allocation in Serverless Query Processing Malay Bag Alekh Jindal

programmers? Frances Buontempo @fbuontempo 1 Yes But @fbuontempo 2 What is AI?

Programming language shapes Programming thought programming languages are not merely

CE419 Session 1: Fundamentals Web Programming Let's begin with the premise that everything

1Q18 Earnings Call Presentation April 25, 2018 Sands Macao Sands Bethlehem Four Seasons Macao The

Applied Ethics & Societal Aspects in Applied Human Pharmacology David H.-U. Haerry, EATG