[HDFS] Why data writes matter A write is performed once, But read - PDF document

CS455: Introduction to Distributed Systems [Spring 2020] Dept. Of Computer Science , Colorado State University CS 455: I NTRODUCTION T O D ISTRIBUTED S YSTEMS [HDFS] Why data writes matter … A write is performed once, But read happens many times (over) The writes are a harbinger, not just of Shrideep Pallickara Subsequent resource utilizations But also for how fast analytics lead to insights Computer Science Colorado State University CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT http: ht p://www.cs. cs.co colost state.edu/~cs4 cs455 Topics covered in this lecture ¨ Hadoop Distributed File System ¤ Writing Data ¤ Replication ¤ Data integrity ¤ Parallel Copying ¤ Coherency Model Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455 L17.1 S LIDES C REATED B Y : S HRIDEEP P ALLICKARA

CS455: Introduction to Distributed Systems [Spring 2020] Dept. Of Computer Science , Colorado State University W RITING D ATA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455 File writes ¨ We will look at creating a new file and writing data to it ¨ File creation is done using create() on DistributedFileSystem ¨ DistributedFileSystem does an RPC to the namenode ¤ Namenode checks existence of file and permissions ¤ Creates file in the filesystem’s namespace with no blocks in it Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455 L17.2 S LIDES C REATED B Y : S HRIDEEP P ALLICKARA

CS455: Introduction to Distributed Systems [Spring 2020] Dept. Of Computer Science , Colorado State University Data flow in HDFS [ writes ] 1: create HDFS 2: create Distributed NameNode Client 3: write File System namenode 6:close FSData OutputStream Client JVM client node 4: write packet 5: ack packet 4 4 DataNode DataNode DataNode 5 5 datanode datanode datanode Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455 Anatomy of a file write ¨ DistributedFileSystem returns an FSDataOutputStream for client to write data to ¨ FSDataOutputStream wraps a DFSOutputStream ¤ DFSOutputStream handles communications with the datanodes and the namenode Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455 L17.3 S LIDES C REATED B Y : S HRIDEEP P ALLICKARA

CS455: Introduction to Distributed Systems [Spring 2020] Dept. Of Computer Science , Colorado State University As the client writes data … ¨ DFSOutputStream splits it into packets ¤ Written to an internal queue, the data queue ¨ Data queue is consumed by the DataStreamer ¨ DataStreamer asks namenode to allocate new blocks ¤ Pick list of suitable datanodes to store replicas ¤ List of datanodes forms a pipeline Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455 Assuming a replication level of 3 ¨ DataStreamer streams packets to the first datanode in the pipeline ¤ 1 st datanode stores the packet and forwards it to the 2 nd datanode in pipeline ¨ The second datanode stores the packet and forwards it to the 3 rd (and last) datanode in pipeline Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455 L17.4 S LIDES C REATED B Y : S HRIDEEP P ALLICKARA

CS455: Introduction to Distributed Systems [Spring 2020] Dept. Of Computer Science , Colorado State University Managing acknowledgements ¨ DFSOutputStream maintains an internal queue of packets waiting to be ACKed by datanodes ¤ This is the ack queue ¨ When is a packet removed from the ACK queue? ¤ Only when it has been acknowledged by all the datanodes in the pipeline Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455 Handling datanode failures during writes [1/2] ¨ The pipeline is closed ¨ Current block on good datanodes is given a new identity ¤ Allows partial block on failed node to be deleted if that datanode recovers later on Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455 L17.5 S LIDES C REATED B Y : S HRIDEEP P ALLICKARA

CS455: Introduction to Distributed Systems [Spring 2020] Dept. Of Computer Science , Colorado State University Handling datanode failures during writes [2/2] ¨ Failed datanode is removed from the pipeline ¨ Remainder of the block’s data is written to the two good datanodes in the pipeline ¨ Namenode notices block is under-replicated ¤ Arranges for replicas to be created on another node ¨ Subsequent blocks are treated as normal Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455 It is possible that multiple datanodes fail while a block is being written ¨ As long as dfs.replication.min (default 1) replicas are written, the write will succeed ¨ Block is asynchronously replicated across cluster until its target replication factor is reached ¤ dfs.replication (default 3) Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455 L17.6 S LIDES C REATED B Y : S HRIDEEP P ALLICKARA

CS455: Introduction to Distributed Systems [Spring 2020] Dept. Of Computer Science , Colorado State University When a client has finished writing data ¨ It calls close() on the stream ¨ Flushes all remaining packets to the datanode pipeline ¤ Wait for acknowledgements before contacting the namenode to signal that file is complete ¨ Namenode knows about blocks that comprise the file ¤ DataStreamer requests block allocations ¤ Client only waits for blocks to be minimally replicated Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT http: ht p://www.cs. cs.co colost state.edu/~cs4 cs455 R EPLICA P LACEMENTS CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455 L17.7 S LIDES C REATED B Y : S HRIDEEP P ALLICKARA

CS455: Introduction to Distributed Systems [Spring 2020] Dept. Of Computer Science , Colorado State University Replica placement [1/2] ¨ Trade-off between reliability, read bandwidth, and write bandwidth ¨ Placing all replicas on a single node? ¤ Lowest write bandwidth penalty since replication pipeline runs on a single node ¤ Offers no redundancy Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT http: ht p://www.cs. cs.co colost state.edu/~cs4 cs455 Replica placement [2/2] ¨ Read bandwidth is high for off-rack reads ¨ Placing replicas in different data centers ¤ Maximizes redundancy at the the cost of bandwidth Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455 L17.8 S LIDES C REATED B Y : S HRIDEEP P ALLICKARA

CS455: Introduction to Distributed Systems [Spring 2020] Dept. Of Computer Science , Colorado State University Default replication strategy in Hadoop ¨ Place first replica on the same node as the client ¤ If client runs outside the cluster, 1 st node is chosen at random ¨ The second replica is placed on a different rack from the first ¤ Chosen at random ¨ Third replica is placed on the same rack as the second ¤ Different node is chosen at random ¨ Further replicas are placed on random nodes in the cluster ¤ Avoid placing too many replicas on the same rack Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT http: ht p://www.cs. cs.co colost state.edu/~cs4 cs455 Default strategy balances ¨ Reliability ¤ Blocks are stored on different racks ¨ Write bandwidth ¤ Writes traverse a single network switch ¨ Read bandwidth ¤ Choice of two racks to read from ¨ Block distribution across cluster ¤ Clients write a single block on the local rack Professor: S HRIDEEP P ALLICKARA CS455: Introduction to Distributed Systems C OM TER S CI NCE D EPAR OMPUTE CIENCE EPARTMEN ENT http: ht p://www.cs. cs.co colost state.edu/~cs4 cs455 L17.9 S LIDES C REATED B Y : S HRIDEEP P ALLICKARA

[HDFS] Why data writes matter A write is performed once, But read - PDF document

CS455: Introduction to Distributed Systems [Spring 2020] Dept. Of Computer Science , Colorado State University CS 455: I NTRODUCTION T O D ISTRIBUTED S YSTEMS [HDFS] Why data writes matter A write is performed once, But read happens many

RAPIDS: PYTHON GPU-ACCELERATED DATA SCIENCE Keith Kraus 3-18-2019 Dante Gama Dessavre DATA

Hadoop Distributed File System (HDFS) 10/05/2018 1 HDFS Overview A distributed file system

Hadoop Distributed File System (HDFS) 1 HDFS Overview A distributed file system Built on

Hadoop Distributed File System (HDFS) 1 HDFS Overview A distributed file system Built on the

HDFS Under the Hood Sanjay Radia Sradia@yahoo-inc.com Grid Computing, Hadoop Yahoo Inc.

Storage Formats Storage Formats 1 1 Overview We covered storage of unstructured files in HDFS

Mapping with learning objectives & modality Dr. Elissa Thomann Mitchell, HDFS,

HBase on top of HDFS Seminar Software Systems Engineering "Mobile, Security, Cloud

The Human Development Approach Francesco Burchi Francesco.burchi@die-gdi.de Master HDFS -

HDFS Hadoop Distributed File System Motivation File Management Streaming Data Fault Tolerance

SparkSQL 1 Where are we? Pig Latin HiveQL Pig Hive ??? Hadoop MapReduce Spark RDD

Clouds CS398 - ACC Prof. Robert J. Brunner Ben Congdon Tyler Kim Announcements Project

Apache Flume Getting data into Hadoop Problem Getting data into HDFS is not difficult: %

Introduction to Big-data Management Review and next steps 1 What We Covered Storage (HDFS)

Storage and Indexing 11/19/2018 1 Overview We covered storage of unstructured files in HDFS

SparkSQL 11/14/2018 1 Where are we? Pig Latin HiveQL Pig Hive ??? Hadoop MapReduce

4200:225 Equilibrium Thermodynamics Unit I. Earth, Air, Fire, and Water Chapter 4.

Compressing DMA Engine: Leveraging Activation Sparsity For Training Deep Neural Networks Minsoo

Preview question Officially the name of the Tor network is not an acronym, but the or part

Accelerating the Tucker Decomposition with Compressed Sparse Tensors Shaden Smith and George

Texture Compression in Real-Time Using the GPU Jason Tranchida Senior Programmer THQ | Volition

Why Actors Rock: Designing a Distributed Database with libcppa Matthias Vallentin

Non-Linear Compression: Gzip Me Not! Michael F. Nowlan Bryan Ford Ramakrishna Gummadi

Compressing and Searching XML Data Via Two Zips Paolo Ferragina Dipartimento di Informatica,

[HDFS] Why data writes matter A write is performed once, But read - PDF document

CS455: Introduction to Distributed Systems [Spring 2020] Dept. Of Computer Science , Colorado State University CS 455: I NTRODUCTION T O D ISTRIBUTED S YSTEMS [HDFS] Why data writes matter A write is performed once, But read happens many

RAPIDS: PYTHON GPU-ACCELERATED DATA SCIENCE Keith Kraus 3-18-2019 Dante Gama Dessavre DATA

Hadoop Distributed File System (HDFS) 10/05/2018 1 HDFS Overview A distributed file system

Hadoop Distributed File System (HDFS) 1 HDFS Overview A distributed file system Built on

Hadoop Distributed File System (HDFS) 1 HDFS Overview A distributed file system Built on the

HDFS Under the Hood Sanjay Radia Sradia@yahoo-inc.com Grid Computing, Hadoop Yahoo Inc.

Storage Formats Storage Formats 1 1 Overview We covered storage of unstructured files in HDFS

Mapping with learning objectives &amp; modality Dr. Elissa Thomann Mitchell, HDFS,

HBase on top of HDFS Seminar Software Systems Engineering &quot;Mobile, Security, Cloud

The Human Development Approach Francesco Burchi Francesco.burchi@die-gdi.de Master HDFS -

HDFS Hadoop Distributed File System Motivation File Management Streaming Data Fault Tolerance

SparkSQL 1 Where are we? Pig Latin HiveQL Pig Hive ??? Hadoop MapReduce Spark RDD

Clouds CS398 - ACC Prof. Robert J. Brunner Ben Congdon Tyler Kim Announcements Project

Apache Flume Getting data into Hadoop Problem Getting data into HDFS is not difficult: %

Introduction to Big-data Management Review and next steps 1 What We Covered Storage (HDFS)

Storage and Indexing 11/19/2018 1 Overview We covered storage of unstructured files in HDFS

SparkSQL 11/14/2018 1 Where are we? Pig Latin HiveQL Pig Hive ??? Hadoop MapReduce

4200:225 Equilibrium Thermodynamics Unit I. Earth, Air, Fire, and Water Chapter 4.

Compressing DMA Engine: Leveraging Activation Sparsity For Training Deep Neural Networks Minsoo

Preview question Officially the name of the Tor network is not an acronym, but the or part

Accelerating the Tucker Decomposition with Compressed Sparse Tensors Shaden Smith and George

Texture Compression in Real-Time Using the GPU Jason Tranchida Senior Programmer THQ | Volition

Why Actors Rock: Designing a Distributed Database with libcppa Matthias Vallentin

Non-Linear Compression: Gzip Me Not! Michael F. Nowlan Bryan Ford Ramakrishna Gummadi

Compressing and Searching XML Data Via Two Zips Paolo Ferragina Dipartimento di Informatica,

Mapping with learning objectives & modality Dr. Elissa Thomann Mitchell, HDFS,

HBase on top of HDFS Seminar Software Systems Engineering "Mobile, Security, Cloud