SLIDES CREATED BY: SHRIDEEP PALLICKARA L17.1
CS455: Introduction to Distributed Systems [Spring 2020]
- Dept. Of Computer Science, Colorado State University
COM
OMPUTE TER SCI CIENCE NCE DEPAR EPARTMEN ENT
CS455: Introduction to Distributed Systems ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455
CS 455: INTRODUCTION TO DISTRIBUTED SYSTEMS
[HDFS]
Shrideep Pallickara Computer Science Colorado State University
Why data writes matter …
A write is performed once, But read happens many times (over) The writes are a harbinger, not just of Subsequent resource utilizations But also for how fast analytics lead to insights
COM
OMPUTE TER SCI CIENCE NCE DEPAR EPARTMEN ENT
CS455: Introduction to Distributed Systems ht http: p://www.cs. cs.co colost state.edu/~cs4 cs455 Professor: SHRIDEEP PALLICKARA
Topics covered in this lecture
¨ Hadoop Distributed File System ¤ Writing Data ¤ Replication ¤ Data integrity ¤ Parallel Copying ¤ Coherency Model