On Average Latency for File Access in Distributed Coded Storage - PowerPoint PPT Presentation

On Average Latency for File Access in Distributed Coded Storage Parimal Parag Archana Bura Jean-Fran¸ cois Chamberland Electrical Communication Engineering Indian Institute of Science Electrical and Computer Engineering Texas A&M University IEEE INFOCOM May 2, 2017 1/ 19

Dominant traffic on Internet Peak Period Traffic Composition (North America) Real-Time Entertainment 100 Web Browsing Marketplaces 80 Filesharing Tunneling 60 Social Networking Storage Communications 40 Gaming Outside Top 5 20 0 Upstream Downstream Aggregate ◮ Real-Time Entertainment: 64.54% for downstream and 36.56 % for mobile access 1 1https://www.sandvine.com/downloads/general/global-internet-phenomena/2015/global-internet- phenomena-report-latin-america-and-north-america.pdf 2/ 19

Established Solutions – Content Delivery Network File 1 File 3 File 5 File 1 Routed Requests File 2 File 1 File 4 File 6 File 3 Vault File 4 File 2 File 3 File 6 File 5 File 6 File 2 File 4 File 5 Congestion Prevention and Outage Protection ◮ Mirroring content with local servers ◮ Media file on multiple servers 3/ 19

Question: Duplication versus MDS Coding A A A B B C B D Reduction of access time ◮ How many fragments for a single message? ◮ How to encode and store at the distributed storage nodes? 4/ 19

Pertinent References (very incomplete) N. B. Shah, K. Lee, and K. Ramchandran, “When do redundant requests reduce latency?” IEEE Trans. Commun., 2016. G. Joshi, Y. Liu, and E. Soljanin, “On the delay-storage trade-off in content download from coded distributed storage systems” IEEE Journ. Spec. Areas. Commun., 2014. Dimakis, Godfrey, Wu, Wainwright, and Ramchandran, “Network Coding for Distributed Storage Systems ” IEEE Trans. Info. Theory, 2010. A. Eryilmaz, A. Ozdaglar, M. M´ edard, and E. Ahmed, “On the delay and throughput gains of coding in unreliable networks,” IEEE Trans. Info. Theory, 2008. D. Wang, D. Silva, F. R. Kschischang, “Robust Network Coding in the Presence of Untrusted Nodes”, IEEE Trans. Info. Theory, 2010. A. Dimakis, K. Ramchandran, Y. Wu, C. Suh, “A Survey on Network Codes for Distributed Storage”, Proceedings of IEEE, 2011. Karp, Luby, Meyer auf der Heide, “Efficient PRAM simulation on a distributed memory machine”, ACM symposium on Theory of computing, 1992. Adler, Chakrabarti, Mitzenmacher, Rasmussen, “Parallel randomized load balancing”, ACM symposium on Theory of computing, 1995. Gardner, Zbarsky, Velednitsky, Harchol-Balter, Scheller-Wolf, “Understanding Response Time in the Redundancy-d System”, SIGMETRICS, 2016. B. Li, A. Ramamoorthy, R. Srikant, “Mean-field-analysis of coding versus replication in cloud storage systems”, INFOCOM, 2016. 5/ 19

System Model File storage ◮ Each media file divided into k pieces ◮ Pieces encoded and stored on n servers Arrival of requests ◮ Each request wants entire media file ◮ Poisson arrival of requests with rate λ Time in the system ◮ Till the reception of whole file Service at each server ◮ IID exponential service time with rate k / n 6/ 19

Parallel Processing of Requests A B C D ◮ Service rate available to each request is proportional to number of servers processing the requests in parallel 7/ 19

State Space Structure Keeping Track of Partially Fulfilled Requests ◮ Element of state vector Y S ( t ) is number of users with given subset S of pieces Continuous-Time Markov Chain ◮ Y ( t ) = { Y S ( t ) : S ⊂ [ n ] , | S | < k } is a Markov process 8/ 19

Priority Scheduling A A B B C C D D Mean shortest remaining time processing ◮ Priority to jobs with higher number of pieces ◮ FIFO scheduling among jobs with equal number of pieces 9/ 19

State Space Collapse Theorem For duplication and coding schemes under priority scheduling and parallel processing model, collection S ( t ) = { S ⊂ [ n ] : Y S ( t ) > 0 , | S | < k } of information subsets is totally ordered in terms of set inclusion Corollary Let Y i ( t ) be number of requests with i information symbols at time t , then Y ( t ) = ( Y 0 ( t ) , Y 1 ( t ) , . . . , Y k − 1 ( t )) is Markov process 10/ 19

State Transitions of Collapsed System Arrival of Requests ◮ Unit increase in Y 0 ( t ) = Y 0 ( t − ) + 1 with rate λ Getting Additional Symbol ◮ Unit increase in Y i ( t ) = Y i ( t − ) + 1 ◮ Unit decrease in Y i − 1 ( t ) = Y i − 1 ( t − ) − 1 Getting Last Missing Symbol ◮ Unit decrease in Y k − 1 ( t ) = Y k − 1 ( t − ) − 1 11/ 19

Tandem Queue Interpretation λ γ 0 γ 1 Y 0 ( t ) Y 1 ( t ) Tandem Queue with Pooled Resources ◮ Servers with empty buffers help upstream ◮ Aggregate service at level i becomes l i ( t ) − 1 � γ j where l i ( t ) = k ∧ { l > i : Y l ( t ) > 0 } j = i ◮ No explicit description of stationary distribution for multi-dimensional Markov process 12/ 19

Bounding and Separating λ µ 0 µ 1 Theorem † When λ < min µ i , tandem queue has product form distribution k − 1 � y i � λ 1 − λ � π ( y ) = µ i µ i i =0 Uniform Bounds on Service Rate Transition rates are uniformly bounded by l i ( y ) − 1 k − 1 � � γ j � Γ i γ i ≤ γ j ≤ j = i j = i † F. P. Kelly, Reversibility and Stochastic Networks. New York, NY, USA: Cambridge University Press, 2011. 13/ 19

Bounds on Tandem Queue λ Y 0 ( t ) Γ 0 Y 1 ( t ) Γ 1 λ γ 0 γ 1 Y 0 ( t ) Y 1 ( t ) λ γ 0 γ 1 Y 0 ( t ) Y 1 ( t ) Upper Bound Lower Bound Higher values for service rates Lower values for service rate yield lower bound on queue yield upper bound on queue distribution distribution k − 1 � y i k − 1 � y i λ � 1 − λ λ � 1 − λ � � π ( y ) = π ( y ) = Γ i Γ i γ i γ i i =0 i =0 14/ 19

Approximating Pooled Tandem Queue λ ˆ µ 0 ˆ Y 0 ( t ) ˆ ˆ µ 1 Y 1 ( t ) λ γ 0 γ 1 Y 0 ( t ) Y 1 ( t ) Independence Approximation with Statistical Averaging Service rate is equal to base service rate γ i plus cascade effect, averaged over time k − 1 � y i λ � 1 − λ µ k − 1 = γ k − 1 ˆ � π ( y ) = ˆ ˆ µ i ˆ µ i µ i = γ i + ˆ ˆ µ i +1 ˆ π i +1 (0) i =0 15/ 19

Mean Sojourn Time Replication Coding (4 , 2) MDS Code 15 15 Upper Bound Upper Bound Simulation Simulation Approximation Approximation Lower Bound Lower Bound 10 10 5 5 0 0 0 . 1 0 . 2 0 . 4 0 . 6 0 . 8 0 . 95 0 . 1 0 . 2 0 . 4 0 . 6 0 . 8 0 . 95 Arrival Rate Arrival Rate ◮ MDS coding significantly outperforms replication ◮ Bounding techniques are only meaningful under light loads ◮ Approximation is accurate over range of loads 16/ 19

Comparing Replication versus MDS Coding 5 Repetition Simulation Repetition Approximation MDS Simulation 4 MDS Approximation Mean Sojourn Time 3 2 1 2 4 8 12 16 20 Number of Servers Arrival rate 0 . 3 units and coding rate n / k = 2 17/ 19

Mean Sojourn Time versus Message Length 10 Repetition Coding Simulation Repetition Coding Approximation 8 Mean Sojourn Time W MDS Coding Simulation MDS Coding Approximation 6 4 2 0 1 4 8 12 16 20 24 Message length k Figure: For rate λ = 0 . 45 and n = 24 servers. 18/ 19

Summary and Discussion Main Contributions ◮ Analytical framework for study of distributed computation and storage systems ◮ Upper and lower bounds to analyze replication and MDS codes ◮ A tight closed-form approximation to study distributed storage codes ◮ MDS codes are better suited for large distributed systems ◮ Mean access time is better for MDS codes for all code-rates 19/ 19

On Average Latency for File Access in Distributed Coded Storage - PowerPoint PPT Presentation

On Average Latency for File Access in Distributed Coded Storage Parimal Parag Archana Bura Jean-Fran cois Chamberland Electrical Communication Engineering Indian Institute of Science Electrical and Computer Engineering Texas A&M

Distributed File Systems Distributed File Systems A distributed file system (DFS) is a

Distributed File Systems Issues in Distributed File Service Case Studies: Sun

M12 X-coded 10Gb/s M12 X-Coded Field installable for Rail D4 Industrial Ethernet, Ethernet/IP

File Management What is a file? Elements of file management File organization

Click on M odel File for CAD Click on M odel File for CAD Click on Model File for CAD Click

~FILE SYSTEM~ SUNU WIBIRAMA OUTLINE FILE SYSTEM ACCESS METHODS DIRECTORY STRUCTURE FILE

Asynchronous I/O Stack: A Low-latency Kernel I/O Stack for Ultra-Low Latency SSDs Jinkyu Jeong

CPSC 410/611: File Management What is a file? Elements of file management File

Week 10: File Management What is a file? Elements of file management File

Novel Latency Bounds for Distributed Coded Storage Jean-Francois Chamberland Parimal Parag

Distributed-File Systems Background Naming and Transparency Remote File Access

Improved Computation-Communication Trade-Off for Coded Distributed Computing using Linear

Coded Computational Photography ! EE367/CS448I: Computational Imaging and Display !

Turbo Codes and Turbo-Coded Modulation Turbo Codes and Turbo-Coded Modulation in CDMA Mobile

Average Connectivity and Average Edge-connectivity in Graphs Suil O joint work with Jaehoon Kim

Hadoop Distributed File System (HDFS) 10/05/2018 1 HDFS Overview A distributed file system

Distributed Sources Via Lookup Services Tatiana Walther http://orcid.org/0000-0001-8127-2988

Adaptive Real-Time Resource Management Richard West Boston University Computer Science

Outpatient and Professional Services Eligible for Telemedicine Note: Only includes Fee-For-S

Scalable QoS for Distributed Storage Clusters using Dynamic Token Allocation Yuhan Peng 1 ,

Distributed Transaction Management Database Management Systems, 2 nd Edition. R. Ramakrishnan and

Database Management Objectives of Lecture 7 Systems Transactions Models Transactions Models

CS 839: Design the Next-Generation Database Lecture 6: Deterministic Database Xiangyao Yu

Concurrency Control II and Distributed Transactions CS 240: Computing Systems and Concurrency