CPSC 221: Data Structures Dictionary ADT Binary Search Trees Alan - PowerPoint PPT Presentation

CPSC 221: Data Structures Dictionary ADT Binary Search Trees Alan J. Hu (Using Steve Wolfman’s Slides)

Learning Goals After this unit, you should be able to... • Determine if a given tree is an instance of a particular type (e.g. binary search tree, heap, etc.) • Describe and use pre-, in- and post-order traversal algorithms • Describe the properties of binary trees, binary search trees, and more general trees; Implement iterative and recursive algorithms for navigating them in C++ • Compare and contrast ordered versus unordered trees in terms of complexity and scope of application • Insert and delete elements from a binary tree

Today’s Outline • Binary Trees • Dictionary ADT • Binary Search Trees • Deletion • Some troubling questions

Binary Trees • Binary tree is recursive definition! – an empty tree (NULL, in our case) A – or, a root node with two subtrees • Properties B C – max # of leaves: D E F – max # of nodes: • Representation: G H Data I J left right pointer pointer

Binary Trees • Binary tree is recursive definition! – an empty tree (NULL, in our case) A – or, a root node with two subtrees • Properties B C – max # of leaves: 2 h – max # of nodes: 2 h+1 -1 D E F • Representation: G H Data I J left right pointer pointer

Representation struct Node { A KTYPE key; left right DTYPE data; pointer pointer Node * left; Node * right; B C }; left right left right pointer pointer pointer pointer A D E F B C left right left right left right pointer pointer pointer pointer pointer pointer D E F

What We Can Do So Far • Stack • List – Push – Insert – Pop – Remove – Find • Queue • Priority Queue – Enqueue – Dequeue – Insert – DeleteMin What’s wrong with Lists?

Dictionary ADT • midterm • Dictionary operations – would be tastier with insert – create brownies • brownies • prog-project – destroy - tasty – so painful… who invented – insert templates? – find • wolf – delete find(wolf) – the perfect mix of oomph and Scrabble value • wolf - the perfect mix of oomph and Scrabble value • Stores values associated with user-specified keys – values may be any (homogenous) type – keys may be any (homogenous) comparable type

Search/Set ADT • Berner • Dictionary operations • Whippet insert – create • Alsatian • Min Pin – destroy • Sarplaninac – insert • Beardie – find • Sarloos – delete • Malamute find(Wolf) • Poodle NOT FOUND • Stores keys – keys may be any (homogenous) comparable – quickly tests for membership

A Modest Few Uses • Arrays and “Associative” Arrays • Sets • Dictionaries • Router tables • Page tables • Symbol tables • C++ Structures • Python’s __dict__ that stores fields/methods

Desiderata • Fast insertion – runtime: • Fast searching – runtime: • Fast deletion – runtime:

Naïve Implementations insert find delete • Linked list • Unsorted array • Sorted array

Naïve Implementations insert find delete • Linked list • Unsorted array • Sorted array so close!

Binary Search Tree Dictionary Data Structure • Binary tree property 8 – each node has  2 children – result: 5 11 • storage is small • operations are simple • average depth is small 2 6 10 12 • Search tree property – all keys in left subtree 4 7 9 14 smaller than root’s key – all keys in right subtree larger than root’s key 13 – result: • easy to find any given key

Example and Counter-Example 5 8 4 8 5 11 1 7 11 2 7 6 10 18 3 4 15 20 21 BINARY SEARCH TREE NOT A BINARY SEARCH TREE

In Order Listing struct Node { // constructors omitted KTYPE key; 10 DTYPE data; Node *left, *right; }; 5 15 2 9 20 17 7 30 In order listing: 2  5  7  9  10  15  17  20  30

Aside: Traversals • Pre-Order Traversal: Process the data at the node first, then process left child, then process right child. • Post-Order Traversal: Process left child, then process right child, then process data at the node. • In-Order Traversal: Process left child, then process data at the node, then process right child. Code? 19

Aside: Traversals • Pre-Order Traversal: Process the data at the node first, then process left child, then process right child. • Post-Order Traversal: Process left child, then process right child, then process data at the node. • In-Order Traversal: Process left child, then process data at the node, then process right child. Who cares? These are the most common ways in which code processes trees. 20

Finding a Node Node *& find(Comparable key, 10 Node *& root) { if (root == NULL) return root; 5 15 else if (key < root->key) return find(key, 2 9 20 root->left); else if (key > root->key) return find(key, 17 7 30 root->right); a. O(1) else b. O(lg n) return root; c. O(n) } runtime: d. O(n lg n) e. None of these

Finding a Node Node *& find(Comparable key, 10 Node *& root) { if (root == NULL) return root; 5 15 else if (key < root->key) return find(key, 2 9 20 root->left); else if (key > root->key) return find(key, 17 7 30 root->right); else WARNING: Much fancy footwork with return root; refs (&) coming. You can do all of this } without refs... just watch out for special cases.

Iterative Find Node * find(Comparable key, 10 Node * root) { while (root != NULL && root->key != key) { 5 15 if (key < root->key) root = root->left; 2 9 20 else root = root->right; } 17 7 30 return root; } Look familiar? (It’s trickier to get the ref return to work here. We won’t worry.)

Insert void insert(Comparable key, 10 Node *& root) { Node *& target(find(key, root)); 5 15 assert(target == NULL); 2 9 20 target = new Node(key); } 17 7 30 runtime: Funky game we can play with the *& version.

Reminder: Value vs. Reference Parameters • Value parameters (Object foo) – copies parameter – no side effects • Reference parameters (Object & foo) – shares parameter – can affect actual value – use when the value needs to be changed • Const reference parameters (const Object & foo) – shares parameter – cannot affect actual value – use when the value is too intricate for pass-by-value

BuildTree for BSTs • Suppose the data 1, 2, 3, 4, 5, 6, 7, 8, 9 is inserted into an initially empty BST: – in order – in reverse order – median first, then left median, right median, etc.

Analysis of BuildTree • Worst case: O(n 2 ) as we’ve seen • Average case assuming all orderings equally likely turns out to be O(n lg n).

Bonus: FindMin/FindMax • Find minimum 10 5 15 • Find maximum 2 9 20 17 7 30

Double Bonus: Successor Find the next larger node 10 in this node’s subtree. // Note: If no succ, returns (a useful) NULL. 5 15 Node *& succ(Node *& root) { if (root->right == NULL) return root->right; else 2 9 20 return min(root->right); } 17 7 30 Node *& min(Node *& root) { if (root->left == NULL) return root; else return min(root->left); }

More Double Bonus: Predecessor Find the next smaller node 10 in this node’s subtree. Node *& pred(Node *& root) { 5 15 if (root->left == NULL) return root->left; else return max(root->left); 2 9 20 } Node *& max(Node *& root) { 17 7 30 if (root->right == NULL) return root; else return max(root->right); }

Today’s Outline • Some Tree Review (here for reference, not discussed) • Binary Trees • Dictionary ADT • Binary Search Trees • Deletion • Some troubling questions

Deletion 10 5 15 2 9 20 17 7 30 Why might deletion be harder than insertion?

Lazy Deletion (“Tombstones”) • Instead of physically deleting nodes, just mark them as 10 deleted + simpler 5 15 + physical deletions done in batches + some adds just flip deleted flag 2 9 20 – extra memory for “tombstone” – many lazy deletions slow finds 17 7 30 – some operations may have to be modified (e.g., min and max)

Lazy Deletion Delete(17) 10 Delete(15) 5 15 Delete(5) 2 9 20 Find(9) Find(16) 17 7 30 Insert(5) Find(17)

Real Deletion - Leaf Case 30 20 17 15 10 9 7 5 2 Delete(17)

Real Deletion - One Child Case 10 Delete(15) 5 15 2 9 20 7 30

Real Deletion - Two Child Case 30 20 10 9 7 5 2 Delete(5)

30 20 Finally… 10 9 7 2

CPSC 221: Data Structures Dictionary ADT Binary Search Trees Alan - PowerPoint PPT Presentation

CPSC 221: Data Structures Dictionary ADT Binary Search Trees Alan J. Hu (Using Steve Wolfmans Slides) Learning Goals After this unit, you should be able to... Determine if a given tree is an instance of a particular type (e.g. binary

The Dictionary ADT The dictionary ADT models a searchable collection findElement(k): if the

Trees Tiziana Ligorio 1 Todays Plan Trees Binary Tree ADT Binary Search Tree ADT 2

CPSC 221: Data Structures Dictionary ADT Hashing Alan J. Hu (Using mainly Steve Wolfmans Old

Balanced Search Trees Binary Search Trees Binary Search Tree Binary Search Tree A binary tree is

Binary search trees Dictionary ADT Binary search tree properties Search Insertion February 04,

Binary search trees Traversals (continued) Dictionary ADT Binary search tree February 12, 2020

Binary Search Trees and Balanced Binary Search Trees using AVL Trees Mark Redekopp David Kempe

Dictionary ADT Todays announcements: MT1 tonight, 7-9:00p WOOD 2 HW2 out, due Feb 5,

CMSC 206 Binary Search Trees 1 Binary Search Tree n A Binary Search Tree is a Binary Tree in

Binary Search Trees A binary search tree is a binary tree T such that - each internal node

CMSC 206 Dictionaries and Hashing The Dictionary ADT n a dictionary (table) is an abstract

Abstract Data Types EECS 214, Fall 2017 What is an ADT? An ADT defjnes: An ADT omits: How the

Trees Linear Vs non-linear data structures Types of binary trees Binary tree traversals

Stack and Queue ADT Stack Queue 2 ADT Example All main programs rely on concept of

ECE 2574: Data Structures and Algorithms - Queue ADT C. L. Wyatt Today we will look at the Queue

Binary Numbers Binary numbers look like this Binary Numbers or Binary Code Binary numbers or

File System Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Farnaz Farahanipad 1001134035

Management of Superfund Remedies in Post Construction Tracy Hopkins, EPA HQ Amanda Van Epps,

Data structures Organize your data to support various queries using little time an space

Lock-free Concurrent Level Hashing for Persistent Memory Zhangyu Chen , Yu Hua, Bo Ding, Pengfei

Minors under Structural Parameterizations Bart M.P. Jansen and Astrid Pieterse Problem is a

ECE 2574: Data Structures and Algorithms - Linked-Based Implementations C. L. Wyatt Fall 2017

CS 10: Problem solving via Object Oriented Programming Balance Agenda 1. Balanced Binary Trees

Today Amortized analysis Dynamic tables Amortized Analysis and Splay Trees Splay