The Big Project Aws Albarghouthi Calvin Smith University of - - PowerPoint PPT Presentation

▶

Jan 22, 2024 245 likes •546 views

The Big Project Aws Albarghouthi Calvin Smith University of Wisconsin-Madison input data map shu ffl e reduce output data m(i 1 ) i 1 reduce i 2 m(i 2 ) reduce output i 3 reduce m(i 3 ) Big : Analyses from Examples [PLDI16]

SLIDE 1

The Bigλ Project

Aws Albarghouthi Calvin Smith University of Wisconsin-Madison

SLIDE 2

SLIDE 3

input data map reduce

utput data

shuffle

utput

i2 i3 m(i1) … m(i2) m(i3) reduce reduce reduce

SLIDE 4

Bigλ: Analyses from Examples

Synthesize data-parallel programs from input/output examples Example:

{ {

,

Output: [PLDI16]

SLIDE 5

Challenges

Non-determinism generate proven-deterministic solutions Variety of domains parameterize by extensible APIs Sparse search space syntactically restrict to data- parallel programs

SLIDE 6

Bias search heavily towards data-parallel programs

Higher-order sketches

Bigλ uses 8 templates, gathered from reference implementations

SLIDE 7

Bias search heavily towards data-parallel programs

Higher-order sketches

map x . reduce x flatmap x . reduce x . apply x map x . reduceByKey x . filter x

{

e.g.

Bigλ uses 8 templates, gathered from reference implementations

SLIDE 8

Who uses the most #hashtags?

@Alice: “Hello AAIP #aaip #germany” @Bob: “Coffee machine refilled yet? #caffeine #java #4thcup #zzz” @Claire: “Torn between wine cellar and seminar #wine #seminar #zzz”

SLIDE 9

@Alice: “Hello AAIP #aaip #germany” @Bob: “Coffee machine refilled yet? #caffeine #java #4thcup #zzz” @Claire: “Torn between wine cellar and seminar #wine #seminar #zzz”

Who uses the most #hashtags?

2, 4, 3…must be @Bob!

{ {

,

@Bob

SLIDE 10

let p = map m . reduce r . apply f

@Alice: “Hello AAIP #aaip #germany” @Bob: “Coffee machine refilled yet? #caffeine #java #4thcup #zzz” @Claire: “Torn between wine cellar and seminar #wine #seminar #zzz”

SLIDE 11

let p = map m . reduce r . apply f where m = λt. (len(filter(is_hashtag, t)), author(t))

@Alice: “Hello AAIP #aaip #germany” @Bob: “Coffee machine refilled yet? #caffeine #java #4thcup #zzz” @Claire: “Torn between wine cellar and seminar #wine #seminar #zzz”

{2, @Alice} {4, @Bob} {3, @Claire}

SLIDE 12

let p = map m . reduce r . apply f where m = λt. (len(filter(is_hashtag, t)), author(t))

{2, @Alice} {4, @Bob} {3, @Claire}

@Alice: “Hello AAIP #aaip #germany” @Bob: “Coffee machine refilled yet? #caffeine #java #4thcup #zzz” @Claire: “Torn between wine cellar and seminar #wine #seminar #zzz”

SLIDE 13