Spark RDD Operations
Transformations and Actions
1
Spark RDD Operations Transformations and Actions 1 RDD Processing - - PowerPoint PPT Presentation
Spark RDD Operations Transformations and Actions 1 RDD Processing Model RDD can be modeled using the Bulk Synchronous Parallel (BSP) model Communication Independent Local Independent Local Processor 1 Processing Processing Independent
1
2
Independent Local Processing Independent Local Processing Independent Local Processing Independent Local Processing Independent Local Processing Independent Local Processing Independent Local Processing Independent Local Processing
Processor 1 Processor 2 … Processor n Communication
3
4
Local Processing
5
input
6
input
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
f f f f f f f Local Processing Local Processing Local Processing f Network Transfer Final Result Driver Machine f f
22
23
24
25
26
27
28
29
s Local Processing Final Result Driver Machine z Local Processing c c c Network Transfer s s s s
30
31