Xiangyao Yu 10/21/2020
CS 764: Topics in Database Management Systems Lecture 14: MapReduce
1
CS 764: Topics in Database Management Systems Lecture 14: MapReduce - - PowerPoint PPT Presentation
CS 764: Topics in Database Management Systems Lecture 14: MapReduce Xiangyao Yu 10/21/2020 1 Announcement Mid-term course evaluation DDL: 10/23 Please submit project proposal to the review website DDL: Oct 26 Please submit a review for the
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
Network
CPU Mem CPU Mem CPU Mem
Google File System (GFS)
…
Thousands
16
Network
CPU Mem CPU Mem CPU Mem
Google File System (GFS)
…
Thousands
17
Network
CPU Mem CPU Mem CPU Mem
Google File System (GFS)
…
Thousands
18
Network
CPU Mem CPU Mem CPU Mem
Google File System (GFS)
…
Thousands
map map map reduce reduce
19
Network
CPU Mem CPU Mem CPU Mem
Google File System (GFS)
…
Thousands
map reduce reduce map map
20
Network
CPU Mem CPU Mem CPU Mem
Google File System (GFS)
…
Thousands
map reduce reduce map map
21
Network
CPU Mem CPU Mem CPU Mem
Google File System (GFS)
…
Thousands
map map map reduce reduce
22
Network
CPU Mem CPU Mem CPU Mem
Google File System (GFS)
…
Thousands
map map map reduce reduce
23
Network
CPU Mem CPU Mem CPU Mem
Google File System (GFS)
…
Thousands
map map map reduce reduce
24
25
26
27
28
machines assigned to the MapReduce computation and peaks at over 30 GB/s when 1764 workers have been assigned
reading the input data
29
30
31
32
33
[1] Stonebraker, Michael, et al. "MapReduce and parallel DBMSs: friends or foes?." Communications of the ACM 2010
34
[1] Stonebraker, Michael, et al. "MapReduce and parallel DBMSs: friends or foes?." Communications of the ACM 2010
35
[1] Stonebraker, Michael, et al. "MapReduce and parallel DBMSs: friends or foes?." Communications of the ACM 2010
36
37
SELECT * FROM S, R WHERE S.id = R.id
38