CS 744: SPLIT ANNOTATIONS Shivaram Venkataraman Fall 2020 - PowerPoint PPT Presentation

! welcome CS 744: SPLIT ANNOTATIONS Shivaram Venkataraman Fall 2020

ADMINISTRIVIA Course Project Checkins – due tomorrow! Hot CRP → In-class project presentations Dec 8 th and Dec 10 th presentation 4 slot min Sign up sheet on Piazza a 5 min → @ LA + a min upload slides

↳ computing computing semesters cloud maintain and compose efficiency j NEW HARDWARE and data MODELS

⇒ SETTING options workload ✓ Intel ) pricing MKL Multi-core machines // inputs are double arrays with `len` elems vdLog1p(len, d1, d1);// d1 = log(d1) Multiple functions and libraries - vdAdd(len, d1, tmp, d1);// d1 = d1 + tmp . // d1 = d1 / vol_sqrt scope - vdDiv(len, d1, vol_sqrt, d1); ↳ optimizes movement is Data a) across all within a operators expensive even d1 machine - Cpu → TVM larger is Arrays I data ↳ layers of ④ de streaming . PNN cache them to DRAM & writes reads - spark - ↳ cake if data fits in memory

COMPILER-BASED APPROACHES want → - we be → to - - here ! Replace every library call to emit Kvm ) intermediate representation (IR) loop fusion rich Compile all the IR together Existing ← pipelining Nunley , libraries Lots of code change required! - g- Pandas - - -

GOALS Provide data movement optimizations across libraries intrusive be very Require minimal or no changes to existing libraries not → Leverage existing hand-tuned code for speedups - I I matrix FFT multiply

APPROACH split execution Build (1) earhex . nu graph ti . splits sized cache 14 pass d1 = price * strike → - - function d1 = np.log2(d1) + strike to every - - -

SPLIT ANNOTATIONS library Given easier to provide a types data ↳ fewer code changing than . @splittable( than - size: SizeSplit(size), a: ArraySplit(size), operators a IT ex - • # mut out: ArraySplit(size)) = - void vdLog1p(long size, double*a, double*out) - ← 'T pipeline these size " ] a :[ can you Vd Scale ( long functions double * a) int scalar , size , y ' ::::÷i÷ : Split types: N ⟨ V0...Vn ⟩ e.g,: ArraySplit ⟨ 10, 2 ⟩ for 10 element array, 2 pieces - . . . - Split annotation: Name and split type to each argument and return value out " ) expert fashion same the as is split in output

⇒ IMPLEMENTING SPLIT API same shares . If Arrays flitter > data can you split type → pipeline safely Parameters ) ⇒ split ( double intend start * a pipeline cannot , . If you return results at prior merge function @splittable(m:MatrixSplit(m, axis), axis:_) next call -> ReduceSplit(axis) ,[ vector sumReduceToVector(matrix m, int axis); → log , multiply > Eg dog , multiply , #D!m ⑦ Reduces Hit imide implemented operation eye → partial outputs - combine to class

MOZART DESIGN execution this → Capture graph I II - this evaluate → lazily opportunity maximum graph , IT to pipeline

PYTHON CLIENT LIBRARY p Already exists Writing Annotations: Function decorators ] @sa((DataFrameSplit(), DataFrameSplit()), {}, DataFrameSplit()) - def divide(series, value): Pandas library calls somebody Capturing the graph 1 If Wraps original Python function and registers in graph be " divide can , by decorator - Returns a Future object → ( Ray , Pywren ) intercepted - constructed Graph is Evaluation Points internally Lazily evaluate by overriding __getattribute__ oral internally Future [ Data frame ] do : print ( Io ) the → result call print the and on .

MOZART RUNTIME 'm :3 :D It :S ? Take dataflow graph à execution plan 'm Series of stages each stage split, pipeline and merge w . . . . . . e. are . - merge split - pipeline Choosing a batch size Set number of elements per batch using L2 cache size will fit cache number of elements that in L2 . compute

SUMMARY workload Iterative Applications compose data processing libraries add ↳ will Data movement is bottleneck on multi-core machines to graph stages Key idea: Split and pipeline data across functions ↳ pipeline across iterations ? Split Annotations to reduce programmer effort Mozart: Client library and runtime for lazy evaluation

DISCUSSION https://forms.gle/F2LJ21qFkBGWyypB7

↳ How does the dataflow graph that is executed by Mozart compare to dataflow graphs we have seen in other systems like Spark/PyT orch etc. Similarities Differences execution hazy tolerance is → Fault → objective not the dependencies narrow → checkpoint 'ng = pipelined → No by Mozart . black bones Functions are → . shuffling can't pick merging us → optimal join 3.7¥ ' operator , →

Mhienednhfthreads increase men bandwidth two ?e' " " . expensive ✓ comp , mid for add - - 7 - speedier exp e - n Ix ' → - I - more having threads can intensive compute ⇒ not functions leed E mem speed up bottleneck how much

NEXT STEPS Next class: TPU Project check-ins on HotCRP!

CS 744: SPLIT ANNOTATIONS Shivaram Venkataraman Fall 2020 - PowerPoint PPT Presentation

! welcome CS 744: SPLIT ANNOTATIONS Shivaram Venkataraman Fall 2020 ADMINISTRIVIA Course Project Checkins due tomorrow! Hot CRP In-class project presentations Dec 8 th and Dec 10 th presentation 4 slot min Sign up sheet on

8 JDT embraces Type Annotations JDT embraces Type Annotations Java 8 ready Stephan Herrmann GK

SPL SPLIT IT CA CAST ST Installation of Split Cast Kit Split Cast Kit Rf QM 2000 2 screws

PRODUCT DECOMPOSITION Ante Rozga, University of Split, Faculty of Economics/Split - Cvite

U i U i University of Split University of Split i i f S li f S li Livanjska 5 Livanjska 5

Phone Fax 25448 SEIL ROAD 1-815-744-1910 1-815-744-1968 SHOREWOOD, ILLINOIS 60404-7620

1 Reflection on code annotations Classification of Code Annotations (1) Code annotations may

From Open Annotations to W3C Web Annotations (and the impact on IIIF Presentation API 3.0)

Split Packing: An Algorithm for Packing Circles with up to Critical Density Sebastian Morr

Split Rock to Lakefield Junction 345 kV Transmission Line SD PUC Docket No. EL05-023 Split

2.744 Dreamweaver Tutorial Sangmok Han sangmok@mit.edu Feb 24, 2010 Overview We will go over

Combining Dependent Annotations for Relational Algebra Egor V. Kostylev, Peter Buneman

Using null type annotations in practice Till Brychcy, Mercateo EclipseCon Europe, 2017 What

Extending ensembldb : MySQL backend and protein annotations Johannes Rainer (EURAC research,

MODELLING AND EXCHANGING ANNOTATIONS FOR EUROPEANA PROJECTS Hugo Manguinhas, Antoine Isaac,

Web Annotations Building the Experience Annotation An annotation is something added. It is not

Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations Lubomir Bourdev and

PARALLEL THINKING: THE SIEVE OF ERATOSTHENES 2 1 26 07 2015 THE SIEVE OF ERATOSTHENES

Classifying space for proper actions for groups admitting a strict fundamental domain Tomasz

Using seccomp to limit the kernel attack surface Michael Kerrisk, man7.org c 2015 man7.org

Component-based Construction of Heterogeneous Real-time Systems in BIP FOSE 2010 Zurich,

Cabrini University: Planning Commission August 7, 2017 EXISTING CAMPUS SITE PLAN 2 2012

The investigating team Asim Qayyum, Annemaree Lloyd, Kim Thompson, & Mary Anne Kennan

Distributed W atchpoints: Debugging Very Large Ensem bles of Robots De Rosa, Goldstein, Lee,

VisualPen It replaces keyboard and mouse: VisualPen: A Physical Interface for natural

Sambuz

Useful Links

Newsletter

Mail Us

CS 744: SPLIT ANNOTATIONS Shivaram Venkataraman Fall 2020 - PowerPoint PPT Presentation

! welcome CS 744: SPLIT ANNOTATIONS Shivaram Venkataraman Fall 2020 ADMINISTRIVIA Course Project Checkins due tomorrow! Hot CRP In-class project presentations Dec 8 th and Dec 10 th presentation 4 slot min Sign up sheet on

8 JDT embraces Type Annotations JDT embraces Type Annotations Java 8 ready Stephan Herrmann GK

SPL SPLIT IT CA CAST ST Installation of Split Cast Kit Split Cast Kit Rf QM 2000 2 screws

PRODUCT DECOMPOSITION Ante Rozga, University of Split, Faculty of Economics/Split - Cvite

U i U i University of Split University of Split i i f S li f S li Livanjska 5 Livanjska 5

Phone Fax 25448 SEIL ROAD 1-815-744-1910 1-815-744-1968 SHOREWOOD, ILLINOIS 60404-7620

1 Reflection on code annotations Classification of Code Annotations (1) Code annotations may

From Open Annotations to W3C Web Annotations (and the impact on IIIF Presentation API 3.0)

Split Packing: An Algorithm for Packing Circles with up to Critical Density Sebastian Morr

Split Rock to Lakefield Junction 345 kV Transmission Line SD PUC Docket No. EL05-023 Split

2.744 Dreamweaver Tutorial Sangmok Han sangmok@mit.edu Feb 24, 2010 Overview We will go over

Combining Dependent Annotations for Relational Algebra Egor V. Kostylev, Peter Buneman

Using null type annotations in practice Till Brychcy, Mercateo EclipseCon Europe, 2017 What

Extending ensembldb : MySQL backend and protein annotations Johannes Rainer (EURAC research,

MODELLING AND EXCHANGING ANNOTATIONS FOR EUROPEANA PROJECTS Hugo Manguinhas, Antoine Isaac,

Web Annotations Building the Experience Annotation An annotation is something added. It is not

Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations Lubomir Bourdev and

PARALLEL THINKING: THE SIEVE OF ERATOSTHENES 2 1 26 07 2015 THE SIEVE OF ERATOSTHENES

Classifying space for proper actions for groups admitting a strict fundamental domain Tomasz

Using seccomp to limit the kernel attack surface Michael Kerrisk, man7.org c 2015 man7.org

Component-based Construction of Heterogeneous Real-time Systems in BIP FOSE 2010 Zurich,

Cabrini University: Planning Commission August 7, 2017 EXISTING CAMPUS SITE PLAN 2 2012

The investigating team Asim Qayyum, Annemaree Lloyd, Kim Thompson, &amp; Mary Anne Kennan

Distributed W atchpoints: Debugging Very Large Ensem bles of Robots De Rosa, Goldstein, Lee,

VisualPen It replaces keyboard and mouse: VisualPen: A Physical Interface for natural

Sambuz

Useful Links

Newsletter

Mail Us

The investigating team Asim Qayyum, Annemaree Lloyd, Kim Thompson, & Mary Anne Kennan