Data Intensive Computing
- B. Ramamurthy
This work is Partially Supported by NSF DUE Grant#: 0737243, 0920335 bina@buffalo.edu
1 6/23/2010 Bina Ramamurthy 2010
Data Intensive Computing B. Ramamurthy This work is Partially - - PowerPoint PPT Presentation
Data Intensive Computing B. Ramamurthy This work is Partially Supported by NSF DUE Grant#: 0737243, 0920335 bina@buffalo.edu 6/23/2010 Bina Ramamurthy 2010 1 Indian Parable: Elephant and the Blind men 6/23/2010 Bina Ramamurthy 2010 2
1 6/23/2010 Bina Ramamurthy 2010
6/23/2010 Bina Ramamurthy 2010 2
6/23/2010 Bina Ramamurthy 2010 3
6/23/2010 Bina Ramamurthy 2010 4
6/23/2010 Bina Ramamurthy 2010 5
fcrawler.looksmart.com - - [26/Apr/2000:00:00:12 -0400] "GET /contacts.html HTTP/1.0" 200 4595 "-" "FAST-WebCrawler/2.1-pre2 (ashen@looksmart.net)" fcrawler.looksmart.com - - [26/Apr/2000:00:17:19 -0400] "GET /news/news.html HTTP/1.0" 200 16716 "-" "FAST-WebCrawler/2.1-pre2 (ashen@looksmart.net)" ppp931.on.bellglobal.com - - [26/Apr/2000:00:16:12 -0400] "GET /download/windows/asctab31.zip HTTP/1.0" 200 1540096 "http://www.htmlgoodies.com/downloads/freeware/webdevelopment/15.html" "Mozilla/4.7 [en]C-SYMPA (Win95; U)" 123.123.123.123 - - [26/Apr/2000:00:23:48 -0400] "GET /pics/wpaper.gif HTTP/1.0" 200 6248 "http://www.jafsoft.com/asctortf/" "Mozilla/4.05 (Macintosh; I; PPC)" 123.123.123.123 - - [26/Apr/2000:00:23:47 -0400] "GET /asctortf/ HTTP/1.0" 200 8130 "http://search.netscape.com/Computers/Data_Formats/Document/Text/RTF" "Mozilla/4.05 (Macintosh; I; PPC)" 123.123.123.123 - - [26/Apr/2000:00:23:48 -0400] "GET /pics/5star2000.gif HTTP/1.0" 200 4005 "http://www.jafsoft.com/asctortf/" "Mozilla/4.05 (Macintosh; I; PPC)" 123.123.123.123 - - [26/Apr/2000:00:23:50 -0400] "GET /pics/5star.gif HTTP/1.0" 200 1031 "http://www.jafsoft.com/asctortf/" "Mozilla/4.05 (Macintosh; I; PPC)" 123.123.123.123 - - [26/Apr/2000:00:23:51 -0400] "GET /pics/a2hlogo.jpg HTTP/1.0" 200 4282 "http://www.jafsoft.com/asctortf/" "Mozilla/4.05 (Macintosh; I; PPC)" 123.123.123.123 - - [26/Apr/2000:00:23:51 -0400] "GET /cgi-bin/newcount?jafsof3&width=4&font=digital&noshow HTTP/1.0" 200 36 "http://www.jafsoft.com/asctortf/" "Mozilla/4.05 (Macintosh; I; PPC)"
6/23/2010 Page 6 Bina Ramamurthy 2010
7 6/23/2010 Bina Ramamurthy 2010
8
Data scale Compute scale Payroll Kilo Mega Giga Tera
MFLOPS GFLOPS TFLOPS PFLOPS
Peta Digital Signal Processing Weblog Mining Business Analytics Realtime Systems Massively Multiplayer Online game (MMOG)
Other variables: Communication Bandwidth, ?
Exa
6/23/2010 Bina Ramamurthy 2010
02/28/09 9
1000 2000 3000 4000 5000 6000 7000 LOC CIA Amazon YOUTube ChoicePt Sprint Google AT&T NERSC Climate
Top ten largest databases (2007)
Terabytes
Ref: http://www.businessintelligencelowdown.com/2007/02/top_10_largest_.html
6/23/2010 9 Bina Ramamurthy 2010
Pipelined Instruction level Concurrent Thread level Service Object level Indexed File level Mega Block level Virtual System Level Data size: small Data size: large
10 6/23/2010 Bina Ramamurthy 2010
11 6/23/2010 Bina Ramamurthy 2010
12 6/23/2010 Bina Ramamurthy 2010
13 6/23/2010 Bina Ramamurthy 2010
6/23/2010 14 Bina Ramamurthy 2010
15 6/23/2010 Bina Ramamurthy 2010
astronomy to healthcare has become essential for planning and performance.
– Data is an important asset to any organization – Discovery of knowledge; Enabling discovery; annotation of data – Complex computational models – No single environment is good enough: need elastic, on-demand capacities
– programming models, and – Supporting algorithms and data structures.
6/23/2010 16 Bina Ramamurthy 2010
6/23/2010 17 Bina Ramamurthy 2010
6/23/2010 18 Bina Ramamurthy 2010
processing on large clusters. Communication of ACM 51, 1 (Jan. 2008), 107-113.
6/23/2010 19 Bina Ramamurthy 2010
Dogs Cats Snakes Fish (Pet database size: TByte) map map map map split split split split combine combine combine reduce reduce reduce part0 part1 part2
6/23/2010 20 Bina Ramamurthy 2010
Count Count Count Large scale data splits Parse-hash Parse-hash Parse-hash Parse-hash Map <key, 1> <key, value>pair Reducers (say, Count) P-0000 P-0001 P-0002 , count1 , count2 ,count3
6/23/2010 21 Bina Ramamurthy 2010
6/23/2010 22 Bina Ramamurthy 2010
6/23/2010 23 Bina Ramamurthy 2010
6/23/2010 24 Bina Ramamurthy 2010
6/23/2010 25 Bina Ramamurthy 2010
6/23/2010 26 Bina Ramamurthy 2010
6/23/2010 27 Bina Ramamurthy 2010
6/23/2010 28
Namenode B
replication
Rack1 Rack2 Client Blocks Datanodes Datanodes Client Write Read Metadata ops
Metadata(Name, replicas..) (/ home/ foo/ data,6. ..
Block ops
Bina Ramamurthy 2010
6/23/2010 29
Application Local file system Master node Name Nodes HDFS Client HDFS Server Block size: 2K Block size: 128M Replicated
Bina Ramamurthy 2010
6/23/2010 30
Bina Ramamurthy 2010
6/23/2010 31
Bina Ramamurthy 2010
6/23/2010 32
The placement of the replicas is critical to HDFS reliability and performance. Optimizing replica placement distinguishes HDFS from other distributed file systems. Rack-aware replica placement:
Goal: improve reliability, availability and network bandwidth utilization
Many racks, communication between racks are through switches Network bandwidth between machines on the same rack is greater than those in different racks. Namenode determines the rack id for each DataNode. Replicas are typically placed on unique racks
Simple but non-optimal Writes are expensive Typical replication factor is 3
Replicas are placed: one on a node in a local rack, one on a different node in the local rack and one on a node in a different rack. 1/3 of the replica on a node, 2/3 on a rack and 1/3 distributed evenly across remaining racks.
Bina Ramamurthy 2010
6/23/2010 33
Application Local file system Master node Name Nodes HDFS Client HDFS Server Block size: 2K Block size: 128M Replicated
heartbeat blockmap Bina Ramamurthy 2010
6/23/2010 Bina Ramamurthy 2010 34
6/23/2010 35 Bina Ramamurthy 2010
6/23/2010 Bina Ramamurthy 2010 36
6/23/2010 37 Bina Ramamurthy 2010
6/23/2010 38 Bina Ramamurthy 2010