Storing Trees on Disk Drives Medha Bhadkamkar, Fernando Farfan, - - PowerPoint PPT Presentation

storing trees on disk drives
SMART_READER_LITE
LIVE PREVIEW

Storing Trees on Disk Drives Medha Bhadkamkar, Fernando Farfan, - - PowerPoint PPT Presentation

Storing Trees on Disk Drives Medha Bhadkamkar, Fernando Farfan, Vagelis Hristidis, Raju FLORIDA INTERNATIONAL UNIVERSITY Rangaswami 15th December, 2005 FAST 2005 WiP Report 1 Introduction Tree data are becoming commonplace: Offer an


slide-1
SLIDE 1

15th December, 2005 FAST 2005 WiP Report 1

Storing Trees on Disk Drives

Medha Bhadkamkar, Fernando Farfan, Vagelis Hristidis, Raju Rangaswami

FLORIDA INTERNATIONAL UNIVERSITY

slide-2
SLIDE 2

15th December, 2005 FAST 2005 WiP Report 2

Introduction

 Tree data are becoming commonplace:

 Offer an intuitive, natural way for organizing information.  Examples: XML, multi-res video, natural sciences data (e.g.

Bioinformatics), even traditional directory-file hierarchies.

 Disk drives are ubiquitous and seem irreplaceable  Current approaches:

 Use relational databases  Use flat files

 Our contributions

 Examine the tree storage problem  Propose native data layout strategies for tree data

slide-3
SLIDE 3

15th December, 2005 FAST 2005 WiP Report 3

Tree Structured Placement

Idea: Optimize common accesses

  • Parent to child
  • Node to sibling

Assumptions:

  • Each node occupies an entire disk block
  • Semi-sequential access information available
slide-4
SLIDE 4

15th December, 2005 FAST 2005 WiP Report 4

Optimized Tree-Structured Placement

 Problems with basic

tree placement:

 Significant fragmentation.  Large random seeks

 Solution:

 Use non-free tracks  Use rotationally-optimal

track-regions

slide-5
SLIDE 5

15th December, 2005 FAST 2005 WiP Report 5

Grouping

 Sequential

 Add nodes to ‘supernode’ until its capacity allows.  Use depth-first traversal to get next node  Low fragmentation

 Tree-preserving

 Groups adjacent nodes  Avoids cycles in original tree  Preserves original tree structure in grouping  Greater fragmentation

slide-6
SLIDE 6

15th December, 2005 FAST 2005 WiP Report 6

Grouping Examples

Sequential Tree-preserving Assumption: Supernode can fit 5 nodes

slide-7
SLIDE 7

15th December, 2005 FAST 2005 WiP Report 7

Building Supernode Trees

Sequential Supernode List Tree-Preserving Supernode Tree Sequential Supernode Tree

  • Uses sequential grouping
  • Nodes linked in the order they are created
  • Uses tree-preserving grouping
  • Edges according to original tree
  • Uses sequential grouping
  • Several possibilities for edge creation
  • Avoid cycles
slide-8
SLIDE 8

15th December, 2005 FAST 2005 WiP Report 8

Performance Evaluation

slide-9
SLIDE 9

15th December, 2005 FAST 2005 WiP Report 9

Future Work

 Multiple drives  Modeling more complex data and access patterns

 Allows data and application directed layout  Requires detailed model for the disk-drive

 Storing graphs on disk drives…

 More generic than trees!  Can use directed and weighted  Can model several data-types and access patterns  Can model relational data as well!