7 Hashing: chaining Summer Term 2010 Robert Elssser Robert - PowerPoint PPT Presentation

7 Hashing: chaining Summer Term 2010 Robert Elsässer Robert Elsässer

Possible ways of treating collisions Treatment of collisions: � Collisions are treated differently in different methods. � A data set with key s is called a colliding element if bucket B h ( s) is already taken by another data set. � What can we do with colliding elements? 1. Chaining: Implement the buckets as linked lists. Colliding elements are stored in these lists. 2. Open Addressing: Colliding elements are stored in other vacant buckets. During storage and lookup, these are found through so-called probing. 05.05.2010 Theory 1 - Hashing: Chaining 2

Chaining (1) � The hash table is an array (length m ) of lists. y ( g ) Each bucket is implemented by a list. class hashTable { List[] ht; // an array of lists hashTable (int m){ // Construktor ht = new List[m]; for (int i = 0; i < m; i++) ht[i] = new List(); // Construct a list } ... } � � Two different ways of using lists: Two different ways of using lists: 1. Direct chaining: Hash table only contains list headers; the data sets are stored in the lists lists. 2. Separate chaining: Hash table contains at most one data set in each bucket as well as a list header Colliding elements are stored in the list header. Colliding elements are stored in the list. 05.05.2010 Theory 1 - Hashing: Chaining 3

Haching by chaining Keys are stored in overflow lists y h ( k ) = k mod 7 4 6 0 1 2 3 5 hash table T pointer 53 12 15 5 2 43 5 colliding elements 19 This type of chaining is also known as direct chaining. Thi t f h i i i l k di t h i i 05.05.2010 Theory 1 - Hashing: Chaining 4

Chaining Lookup key k p y - Compute h ( k ) and overflow list T [ h ( k )] - Look for k in the overflow list Insert a key k - Lookup k (fails) - Insert k in the overflow list Insert k in the overflow list Remove a key k - Lookup k (successfully) Lookup k (successfully) - Remove k from the overflow list only list operations 05.05.2010 Theory 1 - Hashing: Chaining 5

Implementation in Java class TableEntry { private Object key,value; i t Obj t k l } abstract class HashTable { private TableEntry[] tableEntry; p private int capacity; ate t capac ty; // Constructor HashTable (int capacity) { this.capacity = capacity; tableEntry = new TableEntry [capacity]; for (int i = 0; i <= capacity-1; i++) f (i i 0 i i 1 i ) tableEntry[i] = null; } // the hash function protected abstract int h (Object key); // insert element with given key and value (if not there already) public abstract void insert (Object key Object value); // delete element with given key (if there) public abstract void delete (Object key); // lookup element with given key public abstract Object search (Object key); } // class hashTable 05.05.2010 Theory 1 - Hashing: Chaining 6

Implementation in Java class ChainedTableEntry extends TableEntry { // C // Constructor t t ChainedTableEntry(Object key, Object value) { super(key, value); this.next = null; } private ChainedTableEntry next; } class ChainedHashTable extends HashTable { // the hash function public int h(Object key) { public int h(Object key) { return key.hashCode() % capacity ; } // lookup key in the hash table public Object search (Object key) { ChainedTableEntry p; p = (ChainedTableEntry) tableEntry[h(key)]; // Go through the liste until end reached or key found while (p != null && !p.key.equals(key)) { p p = p.next; p next; } // Return result if (p != null) return p.value; else return null; } 05.05.2010 Theory 1 - Hashing: Chaining 7

Implementation in Java /* Insert an element with given key and value (if not there) */ public void insert (Object key, Object value) { bli id i t (Obj t k Obj t l ) { ChainedTableEntry entry = new ChainedTableEntry(key, value); // Get table entry for key int k = h (key); ChainedTableEntry p; C a ed ab e t y p; p = (ChainedTableEntry) tableEntry [k]; if (p == null){ tableEntry[k] = entry; return ; } // Lookup key while (!p.key.equals(key) && p.next != null) { p = p.next; } // Insert the element (if not there) if (!p.key.equals(key)) p.next = entry; } 05.05.2010 Theory 1 - Hashing: Chaining 8

Implementation in Java // Delete element with given key (if there) public void delete (Object key) { int k = h (key); ChainedTableEntry p; p = (ChainedTableEntry) TableEntry [k]; TableEntry[k] = recDelete(p, key); TableEntry[k] recDelete(p, key); } // Delete element with key recursively (if there) public ChainedTableEntry recDelete (ChainedTableEntry p, Object key) { /* recDelete returns a pointer to the start of the list that p points to, in which key was deleted */ if (p == null) return null; if (p.key.equals(key)) return p.getNext(); p g (); // otherwise: p.next = recDelete(p.next, key); return p; } public void printTable () {...} bli id i tT bl () { } } // class ChainedHashTable 05.05.2010 Theory 1 - Hashing: Chaining 9

Test program public class ChainedHashingTest { public static void main(String args[]){ bli t ti id i (St i []){ Integer[] t= new Integer[args.length]; for (int i = 0; i < args.length; i++) t[i] = Integer.valueOf(args[i]); ChainedHashTable h = new ChainedHashTable(7); ChainedHashTable h = new ChainedHashTable(7); for (int i = 0; i <= t.length - 1; i++) h.insert(t[i], null); h.printTable (); h.delete(t[0]); h.delete(t[1]); h.delete(t[6]); h.printTable(); } } Call : Call : java ChainedHashingTest 12 53 5 15 2 19 43 Output : 0: -| 0: -| 1: 15 -> 43 -| 1: 15 43 | 1: 15 -| 1: 15 | 2: 2 -| 2: 2 -| 3: -| 3: -| 4: 53 -| 4: -| 5: 12 -> 5 -> 19 -| 5: 5 -> 19 -| 6: | 6: -| 6: | 6: -| 05.05.2010 Theory 1 - Hashing: Chaining 10

Analysis of direct chaining Uniform hashing assumption: Uniform hashing assumption: � All hash addresses are chosen with the same probability, i.e.: Pr ( h ( k i ) = j ) = 1/ m � independent from operation to operation Average chain length for n entries: n / m = Definition: C´ n = Expected number of entries inspected during a failed search C´ E t d b f t i i t d d i f il d h C n = Expected number of entries inspected during a successful search Analysis: Analysis: 05.05.2010 Theory 1 - Hashing: Chaining 11

Chaining Advantages: g + C n and C´ n are small + > 1 possible + real distances + suitable for secondary memory + suitable for secondary memory Efficiency of lookup C n (successful) C ( f l) C´ ( C´ n (unsuccessful) f l) 0.50 1.250 0.50 0.90 1.450 0.90 0.95 1.457 0.95 1 00 1.00 1 500 1.500 1 00 1.00 2.00 2.000 2.00 3.00 2.500 3.00 Disad antages Disadvantages: - Additional space for pointers - Colliding elements are outside the hash table 05.05.2010 Theory 1 - Hashing: Chaining 12

Summary Analysis of hashing with chaining: Analysis of hashing with chaining: � worst case: h ( s ) always yields the same value, all data sets are in a list. Behavior as in linear lists. � average case: – Successful lookup & delete: – Successful lookup & delete: complexity (in inspections) ≈ 1 + 0.5 × load factor – Failed lookup & insert: complexity ≈ load factor This holds for direct chaining, with separate chaining the complexity is a bit hi h bit higher. � best case: lookup is an immediate success: complexity ∈ O (1). lookup is an immediate success: complexity ∈ O (1). 05.05.2010 Theory 1 - Hashing: Chaining 13

7 Hashing: chaining Summer Term 2010 Robert Elssser Robert - PowerPoint PPT Presentation

7 Hashing: chaining Summer Term 2010 Robert Elssser Robert Elssser Possible ways of treating collisions Treatment of collisions: Collisions are treated differently in different methods. A data set with key s is called a

14. Hashing Hash Tables, Pre-Hashing, Hashing, Resolving Collisions using Chaining, Simple

Overview Intro to Hashing Intro to Hashing Hashing with Chaining Whats hashing?

14. Hashing Hash Tables, Pre-Hashing, Hashing, Resolving Collisions using Chaining, Simple

Today. Cuckoo hashing. Today. Cuckoo hashing. Johnson-Lindenstrass. Cuckoo hashing. Hashing

Chaining Operator in Climb Method Chaining jQuery Method Chaining Extended Climb Christopher

Hashing (Application of Probability) Ashwinee Panda Final CS 70 Lecture! 9 Aug 2018 Overview

Hashing Algorithms Hash functions Separate Chaining Linear Probing Double Hashing Symbol-Table

Using first order logic (Ch. 9) Backward chaining Backward chaining is almost the opposite of

Lecture 8: Hashing I Lecture Overview Dictionaries and Python Motivation Prehashing

Database Systems Index: Hashing Based on slides by Feifei Li, University of Utah Hashing n

Hashing Connections 2-Universal Hash Function Perfect Hashing Anil Maheshwari Proofs

Union-Find [10] In the last class Hashing Collision Handling for Hashing Closed

Hashing Chapter 5 1 Objectives Understand the idea of hashing Compare hashing to sorting

Advanced Algorithms COMS31900 Hashing part one Chaining, true randomness and universal

Week 9 Oliver Kullmann Generalising arrays Hash tables Direct addressing Hashing in

Week 9 Oliver Kullmann Generalising arrays Hash tables Direct addressing Hashing in

Overflow Handling Linear Probing Get And Put An overflow occurs when the home bucket for

CSE 6350 File and Storage System Infrastructure in Data centers Supporting Internet-wide Services

Data Structures in Java Session 14 Instructor: Bert Huang

Hashing CptS 223 Advanced Data Structures Larry Holder School of Electrical Engineering and

EECS 3401 AI and Logic Prog. Lecture 8 Adapted from slides of Brachman & Levesque

Symbol Table ALSU Textbook Chapter 2.7 and 6.5 Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Searching Binary search (log n ) Better Can we achieve (1) search time? Chapter 9

H a s h t a b l e s H a s h t a b l e s n a i v e l y A h a s h t

7 Hashing: chaining Summer Term 2010 Robert Elssser Robert - PowerPoint PPT Presentation

7 Hashing: chaining Summer Term 2010 Robert Elssser Robert Elssser Possible ways of treating collisions Treatment of collisions: Collisions are treated differently in different methods. A data set with key s is called a

14. Hashing Hash Tables, Pre-Hashing, Hashing, Resolving Collisions using Chaining, Simple

Overview Intro to Hashing Intro to Hashing Hashing with Chaining Whats hashing?

14. Hashing Hash Tables, Pre-Hashing, Hashing, Resolving Collisions using Chaining, Simple

Today. Cuckoo hashing. Today. Cuckoo hashing. Johnson-Lindenstrass. Cuckoo hashing. Hashing

Chaining Operator in Climb Method Chaining jQuery Method Chaining Extended Climb Christopher

Hashing (Application of Probability) Ashwinee Panda Final CS 70 Lecture! 9 Aug 2018 Overview

Hashing Algorithms Hash functions Separate Chaining Linear Probing Double Hashing Symbol-Table

Using first order logic (Ch. 9) Backward chaining Backward chaining is almost the opposite of

Lecture 8: Hashing I Lecture Overview Dictionaries and Python Motivation Prehashing

Database Systems Index: Hashing Based on slides by Feifei Li, University of Utah Hashing n

Hashing Connections 2-Universal Hash Function Perfect Hashing Anil Maheshwari Proofs

Union-Find [10] In the last class Hashing Collision Handling for Hashing Closed

Hashing Chapter 5 1 Objectives Understand the idea of hashing Compare hashing to sorting

Advanced Algorithms COMS31900 Hashing part one Chaining, true randomness and universal

Week 9 Oliver Kullmann Generalising arrays Hash tables Direct addressing Hashing in

Week 9 Oliver Kullmann Generalising arrays Hash tables Direct addressing Hashing in

Overflow Handling Linear Probing Get And Put An overflow occurs when the home bucket for

CSE 6350 File and Storage System Infrastructure in Data centers Supporting Internet-wide Services

Data Structures in Java Session 14 Instructor: Bert Huang

Hashing CptS 223 Advanced Data Structures Larry Holder School of Electrical Engineering and

EECS 3401 AI and Logic Prog. Lecture 8 Adapted from slides of Brachman &amp; Levesque

Symbol Table ALSU Textbook Chapter 2.7 and 6.5 Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Searching Binary search (log n ) Better Can we achieve (1) search time? Chapter 9

H a s h t a b l e s H a s h t a b l e s n a i v e l y A h a s h t

EECS 3401 AI and Logic Prog. Lecture 8 Adapted from slides of Brachman & Levesque