with Top ZDDs K. Matsuda (The University of Tokyo) S. Denzumi (The - PowerPoint PPT Presentation

Storing Set Families More Compactly with Top ZDDs K. Matsuda (The University of Tokyo) S. Denzumi (The University of Tokyo) K. Sadakane (The University of Tokyo) 18 th Symposium on Experimental Algorithms June 16 — 18, 2020

Abstract • Purpose – Compress zero-suppressed binary decision diagram (ZDD) ≒ labeled binary directed acyclic graph (DAG) • Method – Expand a tree compression algorithm to DAGs • Result – Theoretic: Exponentially smaller than input – Experimental: Smaller than a related research in almost all cases 2020 2 18th Symposium on Experimental Algorithms -06-16

Contents • Preliminary – ZDD – Tree compression algorithms • Proposed data structure – Construction algorithm – Complexity analysis • Experiment • Conclusion 2020 3 18th Symposium on Experimental Algorithms -06-16

Preliminary ZDD DAG compression Top tree compression

ZDD • Zero-suppressed binary decision diagram [Minato 93] – Labeled binary directed acyclic graph – Represents a family of sets { {1, 2}, – Share equivalent subgraphs {1, 3}, {2, 3} } 1 • Terminology 0 1 – Branching nodes 2 0 2 1 ⁎ Label 0 1 ⁎ 0-edges and 1-edges 3 – Sink nodes 0 1 ⁎ Top or bottom ⊥ ⊤ 2020 5 18th Symposium on Experimental Algorithms -06-16

Tree compression methods • Tree grammar – Based on grammar compression for strings [Charikar et al. 05] – Traversing on compressed representations require linear time to the size of grammar [Busatto et al. 04,], [Lohrey et al. 13] • Succinct data structures – Labeled tree: LOUDS [Jacobson 89] BP [Munro, Raman 01] – Unlabeled tree: [Ferragina et al. 09] 2020 6 18th Symposium on Experimental Algorithms -06-16

Tree compression • Transform-based compression – Shares equivalent sub structures – DAG compression [Downey et al. 80] ⁎ Shares all equivalent subtrees – Top DAG compression [Bille et al. 13] ⁎ Shares equivalent subcomponents 2020 7 18th Symposium on Experimental Algorithms -06-16

DAG compression • Compress labeled DAGs – [Downey et al. 80] – Share all equivalent subtrees DAG compression 2020 8 18th Symposium on Experimental Algorithms -06-16

Problem of DAG comp. • Cannot compress substructures that repeats vertically • Example: DAG compression 𝑜 Simple, but not compressed 2020 9 18th Symposium on Experimental Algorithms -06-16

Top DAG compression • Compress labeled DAGs [Bille et al. 13] – Transform an input tree to top tree, and compress the top tree by DAG compression Input tree 2020 10 18th Symposium on Experimental Algorithms -06-16

Top DAG compression • In comparison to DAG compression: – [Best case] O( n / log σ n ) times smaller – [Worst case] O(log σ n ) times larger • Greedy construction [Bille et al. 13] – #node = O( 𝑜 log log 𝜏 𝑜 /log 𝜏 𝑜 ) – Proof is in [Hủbchle-Schneider and Raman 15] • Optimal construction [Lohrey et al. 17], [Dudek, Gawrychowski 18] – #node = O( 𝑜 /log 𝜏 𝑜 ) (information theoretic lowerbound) (n: #node of the input tree, σ: #label) 2020 11 18th Symposium on Experimental Algorithms -06-16

Top tree • A binary tree 𝒰 that represents the way to decompose the input tree T – Each node of the top tree corresponds to a cluster of T – The root of the top tree corresponds to whole T – A cluster is an induced subgraph of a set of connected edges – Every cluster has at most 2 boundary nodes – A cluster is made by horizontal or vertical merge of 2 clusters that have the same node as a boundary node 2020 12 18th Symposium on Experimental Algorithms -06-16

Top tree • A binary tree 𝒰 that represents the way to decompose the input tree T – A cluster is an induced subgraph of a set of connected edges – Every cluster has at most 2 boundary nodes A cluster 2020 13 18th Symposium on Experimental Algorithms -06-16

Top tree • Example: V (b) 1 V (b) 2 7 H H H (c) (e) (c) 3 6 4 5 top tree 𝒰 Input tree 𝑈 2020 14 18th Symposium on Experimental Algorithms -06-16

Top tree • Example: V 1 V 2 7 H H H 3 6 4 5 top DAG 𝒰𝐸 Input tree 𝑈 2020 15 18th Symposium on Experimental Algorithms -06-16

Advantage of top DAG • Top DAG compression allows sharing the same substructure that appear at different height DAG compression 𝑜 𝑜 top DAG compression log 𝑜 2020 16 18th Symposium on Experimental Algorithms -06-16

Two types of merging • Vertical merge: (a), (b) • Horizontal merge: (c), (d), (e) [Bille et al. 13] 2020 17 18th Symposium on Experimental Algorithms -06-16

Horizontal merge • Merge two clusters that have the same node as their top boundary nodes Corresponding top tree Example clusters H (c) left right A B A B 2020 18 18th Symposium on Experimental Algorithms -06-16

Vertical merge • Merge two clusters that have the same node as their top and bottom boundary Corresponding top tree clusters Example V (a) left right A A B B 2020 19 18th Symposium on Experimental Algorithms -06-16

Top tree construction • Top tree is not uniquely determined from the input tree • Greedy construction – Repeat 1 — 3 until the tree T become 1 edge – 1. Choose pairs of clusters that can be horizontally merged as much as possible – 2. Choose pairs of clusters that can be vertically merged from remaining nodes as much as possible – 3. Merge the all pairs chosen at 1 and 2 2020 20 18th Symposium on Experimental Algorithms -06-16

Greedy construction • Example 1 2 7 3 6 4 5 top tree 𝒰 Tree 𝑈 2020 21 18th Symposium on Experimental Algorithms -06-16

Greedy construction • Example 1 2 7 H H H 3 6 4 5 top tree 𝒰 Tree 𝑈 2020 22 18th Symposium on Experimental Algorithms -06-16

Greedy construction • Example 1 2 H H H 3 4 Tree 𝑈 top tree 𝒰 2020 23 18th Symposium on Experimental Algorithms -06-16

Greedy construction • Example 1 V 2 H H H 3 4 Tree 𝑈 top tree 𝒰 2020 24 18th Symposium on Experimental Algorithms -06-16

Greedy construction • Example 1 V 2 H H H 4 Tree 𝑈 top tree 𝒰 2020 25 18th Symposium on Experimental Algorithms -06-16

Greedy construction • Example V 1 V 2 H H H 4 Tree 𝑈 top tree 𝒰 2020 26 18th Symposium on Experimental Algorithms -06-16

Greedy construction • Example V 1 V 4 H H H Tree 𝑈 top tree 𝒰 2020 27 18th Symposium on Experimental Algorithms -06-16

Complexity: Greedy method • n : #node of an input tree, σ : #label Theorem [Bille et al. 13] The height of the top tree made by greedy construction is O(log n ) Theorem [Hủbchle-Schneider, Raman 15] The number of nodes of top DAG obtained after DAG compression to the top tree made by greedy construction is O( ( n log log σ n ) / log σ n ) 2020 28 18th Symposium on Experimental Algorithms -06-16

Operations on top DAG • Following operations are in O(log n ) time – ( x : x -th node in DFS, T ( x ) : a subtree rooted by x ) – access( x ) : label of x – parent( x ) : preorder of the parent of x – depth( x ) : depth of x – height( x ) : height of x – size( x ) : number of nodes in T ( x ) – firstchild( x ) : preorder of the first child of x – nextsibling( x ) : preorder of the next sibling of x – la( x, i ) : preorder of i -th ancestor of x – nca( x, y ) : preorder of nearest common ancestor of x and y 2020 29 18th Symposium on Experimental Algorithms -06-16

Proposed method Top ZDD Construction algorithm Experiment

Construction of top ZDD • 1. Find a spanning tree from input ZDD – The edges not included in the spanning tree is called non tree edges • 2. Transform the spanning tree to a top tree by greedy construction • 3. For each non tree edge, store the edge at the nearest common ancestor of the source node and the destination node (Edges point sink nodes are exception) • 4. Share equivalent subtrees 2020 31 18th Symposium on Experimental Algorithms -06-16

Example of construction • Step 0. Input 1 2 3 4 ⊤ ⊥ Original ZDD 2020 32 18th Symposium on Experimental Algorithms -06-16

Example of construction • Step 1. Find a spanning tree 1 1 2 2 7 3 3 6 4 4 5 ⊤ ⊥ Original ZDD 2020 33 18th Symposium on Experimental Algorithms -06-16

Example of construction • Step 2. Construct a top tree 1 1 V 2 2 7 V 3 3 6 H H H 4 4 5 ⊤ ⊥ Original ZDD 2020 34 18th Symposium on Experimental Algorithms -06-16

Example of construction • Step 3. Store non tree edges 1 1 V 2 2 7 V 3 3 6 H H H 4 4 5 ⊤ ⊥ Original ZDD 2020 35 18th Symposium on Experimental Algorithms -06-16

Example of construction • Step 4. Share equivalent subtrees 1 1 V 2 2 7 V 3 3 6 H H 4 4 5 ⊤ ⊥ Original ZDD 2020 36 18th Symposium on Experimental Algorithms -06-16

Theoretical results Theorem Memory usage of top ZDD is O(log n) in the best case • Examples Theorem Edge traversal is O(log 2 n ) time 2020 37 18th Symposium on Experimental Algorithms -06-16

with Top ZDDs K. Matsuda (The University of Tokyo) S. Denzumi (The - PowerPoint PPT Presentation

Storing Set Families More Compactly with Top ZDDs K. Matsuda (The University of Tokyo) S. Denzumi (The University of Tokyo) K. Sadakane (The University of Tokyo) 18 th Symposium on Experimental Algorithms June 16 18, 2020 Abstract Purpose

To TOP or NOT to TOP www.SAS.com To TOP or NOT to TOP Using the TOP command in Linux By Len van

Boosted Top Tagging Seung J. Lee Outline Introduction: top jets @ LHC Modern boosted top

Puncher/Squeezer Riveting Tools BEST PRACTICES 2018 Tool Uses Top Rail Punch Top Rail

Class 42: Free symmetric top Class 42: Free symmetric top Free symmetric top in body frame Assume

Top ten mental tips Number one Know your real goal Top ten mental tips Number two Get nervous

THE HOMOTOPY TYPE OF G/ TOP QAYUM KHAN 1. Definition of G/ TOP Recall TOP n is the topological

Chapter 3: Top-k Query Processing and Indexing 3.1 Top-k Algorithms 3.2 Approximate Top-k Query

Top Physics @FCCee Patrizia Azzi - INFN Padova & CERN 1 how is top physics doing now?

Colour & Precision Top Physics Peter Skands (Monash University) Perturbative aspects of top

Top polarisation at colliders Top polarisation: what physics can it probe Probes of the top

Quality Sampling Top Ten Reasons to Drill a Hole Top Ten Reasons to Drill a Hole Getting paid

The 2013 Top Ten Project Lynette Robertson & Rhonda Seamons The Top Top Ten List from the

Data Science. Credit Scoring. Digital Finance TOP-2 TOP-50 BEST ALTERNATIVE EUROPE TOP 100:

The Top Ten Project Rhonda Seamons The Top Top Ten List from the Book at Barnes & Noble 1.

Less is More Neil Heseltine Hill Top Farm Malham @hilltopfarmgirl Hill Top Farm

Founded 2009 Top selling Jira app 18 Employees Top selling Confluence app 6 Nationalities Gold Top

Complementing Bchi automata Guillaume Sadegh LRDE EPITA Research and Development Laboratory

Chapter 24 Sorting networks NEW CS 473: Theory II, Fall 2015 November 19, 2015 24.1 Model of

The role of Guidelines in EU competition law David Bailey Brick Court Chambers Kings College

The Diagonal Approach to Health System Strengthening: A Roadmap for Expanding Access in LMICs

Line segment intersection Find all pairs of intersecting line segments. Find all pairs of

Business Strategy Strategic Alternatives Strategic Alternatives Nikolaos Karanasios Assistant

INF4820 Algorithms for AI and NLP Hierarchical Clustering Erik Velldal & Stephan

Boris Klemz, Martin N ollenburg, Roman Prutkin 1) Freie Universit at Berlin 2) Vienna

with Top ZDDs K. Matsuda (The University of Tokyo) S. Denzumi (The - PowerPoint PPT Presentation

Storing Set Families More Compactly with Top ZDDs K. Matsuda (The University of Tokyo) S. Denzumi (The University of Tokyo) K. Sadakane (The University of Tokyo) 18 th Symposium on Experimental Algorithms June 16 18, 2020 Abstract Purpose

To TOP or NOT to TOP www.SAS.com To TOP or NOT to TOP Using the TOP command in Linux By Len van

Boosted Top Tagging Seung J. Lee Outline Introduction: top jets @ LHC Modern boosted top

Puncher/Squeezer Riveting Tools BEST PRACTICES 2018 Tool Uses Top Rail Punch Top Rail

Class 42: Free symmetric top Class 42: Free symmetric top Free symmetric top in body frame Assume

Top ten mental tips Number one Know your real goal Top ten mental tips Number two Get nervous

THE HOMOTOPY TYPE OF G/ TOP QAYUM KHAN 1. Definition of G/ TOP Recall TOP n is the topological

Chapter 3: Top-k Query Processing and Indexing 3.1 Top-k Algorithms 3.2 Approximate Top-k Query

Top Physics @FCCee Patrizia Azzi - INFN Padova &amp; CERN 1 how is top physics doing now?

Colour &amp; Precision Top Physics Peter Skands (Monash University) Perturbative aspects of top

Top polarisation at colliders Top polarisation: what physics can it probe Probes of the top

Quality Sampling Top Ten Reasons to Drill a Hole Top Ten Reasons to Drill a Hole Getting paid

The 2013 Top Ten Project Lynette Robertson &amp; Rhonda Seamons The Top Top Ten List from the

Data Science. Credit Scoring. Digital Finance TOP-2 TOP-50 BEST ALTERNATIVE EUROPE TOP 100:

The Top Ten Project Rhonda Seamons The Top Top Ten List from the Book at Barnes &amp; Noble 1.

Less is More Neil Heseltine Hill Top Farm Malham @hilltopfarmgirl Hill Top Farm

Founded 2009 Top selling Jira app 18 Employees Top selling Confluence app 6 Nationalities Gold Top

Complementing Bchi automata Guillaume Sadegh LRDE EPITA Research and Development Laboratory

Chapter 24 Sorting networks NEW CS 473: Theory II, Fall 2015 November 19, 2015 24.1 Model of

The role of Guidelines in EU competition law David Bailey Brick Court Chambers Kings College

The Diagonal Approach to Health System Strengthening: A Roadmap for Expanding Access in LMICs

Line segment intersection Find all pairs of intersecting line segments. Find all pairs of

Business Strategy Strategic Alternatives Strategic Alternatives Nikolaos Karanasios Assistant

INF4820 Algorithms for AI and NLP Hierarchical Clustering Erik Velldal &amp; Stephan

Boris Klemz, Martin N ollenburg, Roman Prutkin 1) Freie Universit at Berlin 2) Vienna

Top Physics @FCCee Patrizia Azzi - INFN Padova & CERN 1 how is top physics doing now?

Colour & Precision Top Physics Peter Skands (Monash University) Perturbative aspects of top

The 2013 Top Ten Project Lynette Robertson & Rhonda Seamons The Top Top Ten List from the

The Top Ten Project Rhonda Seamons The Top Top Ten List from the Book at Barnes & Noble 1.

INF4820 Algorithms for AI and NLP Hierarchical Clustering Erik Velldal & Stephan