FAWN FAST ARRAY OF WIMPY NODES VIRAJ SULE FAWN is a cluster - PowerPoint PPT Presentation

FAWN FAST ARRAY OF WIMPY NODES VIRAJ SULE

• FAWN is a cluster architecture for low-power data-intensive computing. • FAWN-KV is a consistent, highly available and high performance key-value storage system built over FAWN prototype.

(1) “The workloads these systems support share several characteristics: they are I/O, not computation, requiring random access over large datasets, they are massively parallel, with thousands of concurrent mostly independent operations and the size of objects stored is typically small. ” Read the above statement, indicate why workloads of these characteristics represent a challenge to the system design? • In I/O, CPU has to stall while waiting for data to be loaded or unloaded. • Random access over large datasets would be inefficient in case we need to access the data sequentially. • Size of objects is small then there will large amount of data; consequently, large metadata in terms of numbers. • Systems requiring large clusters includes DRAM which are expensive and consume large amount of power.

(2) “ The key design choice in FAWN-KV is the use of a log structured per-node datastore called FAWN-DS that provides high performance reads and writes using flash memory.” “These performance problems motivate log -structured techniques for flash filesystems and data structures” What key benefit does a log structured data organization bring to the KV store? • Log structured data organization provides with high write throughput because all the updates on data and metadata are written in sequential order in the log.

(3) “ To provide this property(Writes are sequential and Read is random access), FAWN-DS maintains an in-DRAM hash table (Hash Index) that maps keys to an offset in the append-only Data Log on flash. ” What are potential issues of the design? • Large number of key-value pairs will lead to large metadata. • As DRAM is volatile, the hash table will be lost once we turn OFF the cluster.

(4) “ It stores only a fragment of the actual key in memory to find a location in the log; ” Is there a correction concern in this design? • No • With the 15-bit key fragment, only 1 in 32,768 retrievals from the flash will be incorrect. • minor issue over drastically reduced memory requirements.

(5) “ Basic functions: Store, Lookup, Delete ” Use Figure 2(a) to explain how these basic functions are executed? • Store • It appends an entry to the log, updates the corresponding hash table to point this offset within the Data Log, and sets the valid bit to true. • Lookup • Retrieve the hash entry containing the offset, indexes into the Data Log, and returns the data blob. • Delete • Invalidates the hash entry corresponding to the key by clearing the valid flag and writing a delete entry to the end of data file.

(6) “ As an optimization, FAWN-DS periodically checkpoints the index by writing the Hash Index and a pointer to the last log entry to flash. ”. Why does this checkpointing help with the recovery efficiency? How is a KV item deleted from the store? • After a failure, FAWN-DS uses the checkpoint as a starting point to reconstruct the in-memory Hash Index quickly. • This can be done because Data Log contains all the information necessary to reconstruct the Hash Index from scratch.

References: • FAWN paper • http://muratbuffalo.blogspot.com/2011/02/chain-replication-for-supporting- high.html • Lectures Slides

FAWN FAST ARRAY OF WIMPY NODES VIRAJ SULE FAWN is a cluster - PowerPoint PPT Presentation

FAWN FAST ARRAY OF WIMPY NODES VIRAJ SULE FAWN is a cluster architecture for low-power data-intensive computing. FAWN-KV is a consistent, highly available and high performance key-value storage system built over FAWN prototype. (1)

FAWN - a Fast Array of Wimpy Nodes Tomasz Dubrownik University of Warsaw January 12, 2011

Breakfast Menu Breakfast Menu Paper: PopSet Fawn 120g Size: 594 x 420 mm Scale: 40%

CSE 6350 File and Storage System Infrastructure in Data centers Supporting Internet-wide Services

Leaves of Brass a A study of how conduc4ve materials behave on leaves Fawn Qiu | 10. 2011

FAWN - Fast Array of Wimpy Nodes David G. Andersen et al. Presented by: Ravi Kiran Boggavarapu

FAWN: A Fast Array of Wimpy Nodes David G. Andersen, Jason Franklin, Michael Kaminsky * , Amar

Transforming Organ Donation In South Africa LOVE LIFE; GIFT LIFE LLGL was established in April

U&D Corridor Advisory Committee Meeting Minutes June 2, 2015 6:30 DATE & TIME:

Planning Commission June 6, 2017 Ascent Subdivision 1 Review Process 2 Vicinity Map South

A Brief History of Chain Replication Christopher Meiklejohn // @cmeik QCon 2015, November 17th,

Aerial Survey of Mule Deer August 10, 2019 Cody McKee, Wildlife Staff Specialist Why Survey?

Hig igh-Performance Key- Carnegie Mellon Value Store University Intel Labs Presented by:

Topology-Transparent Schedules for Energy Limited Ad hoc Networks Peter J. Dukes Charles J.

(from Chapters 10/11 of the text) document.write(theArray[ii] + <br/>" ); }

Java Interfaces } An interface is more abstract than a class } Almost always completely abstract in

Objectives: Discuss arrays Syntax Multi-dimensional arrays Arrays

Does Locality imply Efficient Testability? Omri Ben-Eliezer WOLA 2019 Monotonicity testing: Yet

Storing and Retrieving Data Database Management Systems need to: Store large volumes of

CS 294-73 Software Engineering for Scientific Computing Lecture

Overview Last lecture Software engineering CS3157: Advanced Will cover most in

Processor Architecture: Current Trends A B Transfer a truckload at a time from A to B Processor

SIMD Systems Programmierung Paralleler und Verteilter Systeme (PPV) Sommer 2015 Frank Feinbube,

Graphics Processing CS418 Computer Graphics John C. Hart Graphics Processing Graphics

Parallel Models An abstract description of a real world parallel machine. Attempts to

FAWN FAST ARRAY OF WIMPY NODES VIRAJ SULE FAWN is a cluster - PowerPoint PPT Presentation

FAWN FAST ARRAY OF WIMPY NODES VIRAJ SULE FAWN is a cluster architecture for low-power data-intensive computing. FAWN-KV is a consistent, highly available and high performance key-value storage system built over FAWN prototype. (1)

FAWN - a Fast Array of Wimpy Nodes Tomasz Dubrownik University of Warsaw January 12, 2011

Breakfast Menu Breakfast Menu Paper: PopSet Fawn 120g Size: 594 x 420 mm Scale: 40%

CSE 6350 File and Storage System Infrastructure in Data centers Supporting Internet-wide Services

Leaves of Brass a A study of how conduc4ve materials behave on leaves Fawn Qiu | 10. 2011

FAWN - Fast Array of Wimpy Nodes David G. Andersen et al. Presented by: Ravi Kiran Boggavarapu

FAWN: A Fast Array of Wimpy Nodes David G. Andersen, Jason Franklin, Michael Kaminsky * , Amar

Transforming Organ Donation In South Africa LOVE LIFE; GIFT LIFE LLGL was established in April

U&amp;D Corridor Advisory Committee Meeting Minutes June 2, 2015 6:30 DATE &amp; TIME:

Planning Commission June 6, 2017 Ascent Subdivision 1 Review Process 2 Vicinity Map South

A Brief History of Chain Replication Christopher Meiklejohn // @cmeik QCon 2015, November 17th,

Aerial Survey of Mule Deer August 10, 2019 Cody McKee, Wildlife Staff Specialist Why Survey?

Hig igh-Performance Key- Carnegie Mellon Value Store University Intel Labs Presented by:

Topology-Transparent Schedules for Energy Limited Ad hoc Networks Peter J. Dukes Charles J.

(from Chapters 10/11 of the text) document.write(theArray[ii] + &lt;br/&gt;&quot; ); }

Java Interfaces } An interface is more abstract than a class } Almost always completely abstract in

Objectives: Discuss arrays Syntax Multi-dimensional arrays Arrays

Does Locality imply Efficient Testability? Omri Ben-Eliezer WOLA 2019 Monotonicity testing: Yet

Storing and Retrieving Data Database Management Systems need to: Store large volumes of

CS 294-73 Software Engineering for Scientific Computing Lecture

Overview Last lecture Software engineering CS3157: Advanced Will cover most in

Processor Architecture: Current Trends A B Transfer a truckload at a time from A to B Processor

SIMD Systems Programmierung Paralleler und Verteilter Systeme (PPV) Sommer 2015 Frank Feinbube,

Graphics Processing CS418 Computer Graphics John C. Hart Graphics Processing Graphics

Parallel Models An abstract description of a real world parallel machine. Attempts to

U&D Corridor Advisory Committee Meeting Minutes June 2, 2015 6:30 DATE & TIME:

(from Chapters 10/11 of the text) document.write(theArray[ii] + <br/>" ); }