CS 225 Data Structures Oc October 26 26 Ha Hashing G G Carl - PowerPoint PPT Presentation

CS 225 Data Structures Oc October 26 26 – Ha Hashing G G Carl Evans

What if 𝑃(𝑚𝑝𝑕 ! 𝑜) is not fast enough? Do you feel lucky?

A H A Hash T Table b based D Dictionary Client Code: 1 Dictionary<KeyType, ValueType> d; 2 d[k] = v; A Hash Table consists of three things: 1. A hash function, f(k) 2. An array 3. Something to handle chaos when it occurs!

A P A Perf rfect H Hash F Function (Angrave, CS 241) Key Value (Beckman, CS 421) (Challen, CS 125) Hash function (Davis, CS 101) (Evans, CS 126) (Fagen-Ulmschneider, CS 107) (Gunter, CS 422) (Herman, CS 233)

Ha Hash h Func Function Our hash function consists of two parts: • A hash : • A compression: Choosing a good hash function is tricky… • Don’t create your own (yet*) • Very smart people have created very bad hash functions

Ha Hash h Func Function Characteristics of a good hash function: 1. Computation Time: 2. Deterministic: 3. Satisfy the SUHA:

Gen Gener eral al Purpose e Has ash Fu Functi tion Keyspaces … Easy to create if: |KeySpace| N ~

Gen Gener eral al Purpose e Has ash Fu Functi tion Keyspaces … Easy to create if: |KeySpace| N ~ Difficult to Create: …

Ha Hash h Func Function In CS 225, we focus on general purpose hash functions. Other hash functions exists with different properties (eg: cryptographic hash functions)

(Example of open hashing) Collision Col on H Handling: Se : Separate Ch Chaining S = { 16, 8, 4, 13, 29, 11, 22 } |S| = n h(k) = k % 7 |Array| = N 0 1 2 3 4 5 6 Worst Case SUHA Insert Remove/Find

(Example of closed hashing) Collision Col on H Handling: P : Prob obe-ba base sed d Ha Hashi shing ng S = { 16, 8, 4, 13, 29, 11, 22 } |S| = n h(k) = k % 7 |Array| = N 0 1 2 3 4 5 6

(Example of closed hashing) Col Collision on H Handling: Li : Linear P r Prob obing S = { 16, 8, 4, 13, 29, 11, 22 } |S| = n h(k) = k % 7 |Array| = N Try h(k) = (k + 0) % 7, if full… 0 Try h(k) = (k + 1) % 7, if full… 1 Try h(k) = (k + 2) % 7, if full… 2 Try … 3 4 5 6 Worst Case SUHA Insert Remove/Find

A P A Problem w w/ L / Linear P r Probing Primary clustering: Description: Remedy:

(Example of closed hashing) Collision Col on H Handling: D : Dou ouble h hashing S = { 16, 8, 4, 13, 29, 11, 22 } |S| = n h(k) = k % 7 |Array| = N Try h(k) = (k + 0*h 2 (k)) % 7, if full… 0 Try h(k) = (k + 1*h 2 (k)) % 7, if full… 1 Try h(k) = (k + 2*h 2 (k)) % 7, if full… 2 Try … 3 4 h(k, i) = (h 1 (k) + i*h 2 (k)) % 7 5 6

Ru Running T Times The expected number of probes for find(key) under SUHA Linear Probing: (Don’t memorize these • Successful: ½(1 + 1/(1-α)) equations, no need.) • Unsuccessful: ½(1 + 1/(1-α)) 2 Instead, observe: Double Hashing: • Successful: 1/α * ln(1/(1-α)) - As α increases: • Unsuccessful: 1/(1-α) Separate Chaining: - If α is constant: • Successful: 1 + α/2 • Unsuccessful: 1 + α

Ru Running T Times The expected number of probes for find(key) under SUHA Linear Probing: • Successful: ½(1 + 1/(1-α)) • Unsuccessful: ½(1 + 1/(1-α)) 2 Double Hashing: • Successful: 1/α * ln(1/(1-α)) • Unsuccessful: 1/(1-α)

Re ReHashing What if the array fills?

Which collision resolution strategy is better? • Big Records: • Structure Speed: What structure do hash tables replace? What constraint exists on hashing that doesn’t exist with BSTs? Why talk about BSTs at all?

Ru Running T Times Hash Table AVL Linked List Amortized: Find Worst Case: Amortized: Insert Worst Case: Storage Space

st std da data struc uctur ures std::map

st std da data struc uctur ures std::map ::operator[] ::insert ::erase ::lower_bound(key) è Iterator to first element ≤ key ::upper_bound(key) è Iterator to first element > key

st std da data struc uctur ures std::unordered_map ::operator[] ::insert ::erase ::lower_bound(key) è Iterator to first element ≤ key ::upper_bound(key) è Iterator to first element > key

st std da data struc uctur ures std::unordered_map ::operator[] ::insert ::erase ::lower_bound(key) è Iterator to first element ≤ key ::upper_bound(key) è Iterator to first element > key ::load_factor() ::max_load_factor(ml) è Sets the max load factor

CS 225 Data Structures Oc October 26 26 Ha Hashing G G Carl - PowerPoint PPT Presentation

CS 225 Data Structures Oc October 26 26 Ha Hashing G G Carl Evans What if ( ! ) is not fast enough? Do you feel lucky? A H A Hash T Table b based D Dictionary Client Code: 1 Dictionary<KeyType,

Fanny ny B Bay fire p protect ctio ion s n servic ice # #225 225 Fanny B Fann y Bay f

CS 225 Data Structures Se Septembe ber 23 23 It Iterators G G Carl Evans CS 225 CS

INFCIRC/225/Rev 5 Implementation at a Facility Level: Common Issues and Best Practices Oleg

CS 225 Data Structures Se Septembe ber 23 23 St Stac acks, Que ueue ues and and

CS 225 Data Structures September 24 It Iterators Wade Fagen-Ulm lmschneid ider CS 225 So

CS 225 Data Structures Fe February 11 It Itera erators rs Wade e Fa Fagen-Ul Ulms

Sisters of the Presentation of Mary A name - A heritage - A prophecy 225 years of

1 5mm Gusset | Clip in & Supplied Flat Final size 225 x 319mm | A2 Sheet Required 2 20mm

I- PRESENTATION I- PRESENTATION I- PRESENTATION Blue Sea Marine Supply is the creation of former

Paleozoic era (570-225 mya) -first vertebrates emerged -first mammal-like reptiles -Pangaea

Using Git Hooks to Help Your 40 40 40 Joo Santos Engineering Teams Work Software Engineer

Multicast Address-Set Claim (MASC) Implemen tation P a vlin Radosla v o v (USC/ISI)

Genetic Medicines for Rare CNS Diseases OCTOBER 2019 / Company Overview $225.5M Developing

2018.2/225 MATERNITY LEARNING Presentation attached to Minutes The Chair welcomed Mr Adam

Inve In vestor stor Presentat sentation ion USD 225 25-275 75m m Senior r Secured d Bond

OJSC ESRE Efrolen (Polyisobutylene/PIB) High Molecular Weight : P-230, P-225, P-220,

Testing and Debugging Gordon Fraser and Andreas Zeller Saarland University 1 Some Bugs 2 2

M87 Avery E. Broderick Sheperd Doeleman (MIT Haystack) Avi Loeb (Harvard) Vincent Fish (MIT

CPSC 102 Debugging Z A IN R IZ V I A Little History Lesson 1946 Page from Harvard

CSE 440: Introduction to HCI User Interface Design, Prototyping, and Evaluation James Fogarty

return password return hash( password ) return hash( password, salt )

Notes on Hashing Owen Jow Last updated May 05, 2018 This document is intended to give a

DATABASE SYSTEM IMPLEMENTATION GT 4420/6422 // SPRING 2019 // @JOY_ARULRAJ LECTURE #20: PARALLEL

CS 473: Algorithms Chandra Chekuri Ruta Mehta University of Illinois, Urbana-Champaign Fall

CS 225 Data Structures Oc October 26 26 Ha Hashing G G Carl - PowerPoint PPT Presentation

CS 225 Data Structures Oc October 26 26 Ha Hashing G G Carl Evans What if ( ! ) is not fast enough? Do you feel lucky? A H A Hash T Table b based D Dictionary Client Code: 1 Dictionary<KeyType,

Fanny ny B Bay fire p protect ctio ion s n servic ice # #225 225 Fanny B Fann y Bay f

CS 225 Data Structures Se Septembe ber 23 23 It Iterators G G Carl Evans CS 225 CS

INFCIRC/225/Rev 5 Implementation at a Facility Level: Common Issues and Best Practices Oleg

CS 225 Data Structures Se Septembe ber 23 23 St Stac acks, Que ueue ues and and

CS 225 Data Structures September 24 It Iterators Wade Fagen-Ulm lmschneid ider CS 225 So

CS 225 Data Structures Fe February 11 It Itera erators rs Wade e Fa Fagen-Ul Ulms

Sisters of the Presentation of Mary A name - A heritage - A prophecy 225 years of

1 5mm Gusset | Clip in &amp; Supplied Flat Final size 225 x 319mm | A2 Sheet Required 2 20mm

I- PRESENTATION I- PRESENTATION I- PRESENTATION Blue Sea Marine Supply is the creation of former

Paleozoic era (570-225 mya) -first vertebrates emerged -first mammal-like reptiles -Pangaea

Using Git Hooks to Help Your 40 40 40 Joo Santos Engineering Teams Work Software Engineer

Multicast Address-Set Claim (MASC) Implemen tation P a vlin Radosla v o v (USC/ISI)

Genetic Medicines for Rare CNS Diseases OCTOBER 2019 / Company Overview $225.5M Developing

2018.2/225 MATERNITY LEARNING Presentation attached to Minutes The Chair welcomed Mr Adam

Inve In vestor stor Presentat sentation ion USD 225 25-275 75m m Senior r Secured d Bond

OJSC ESRE Efrolen (Polyisobutylene/PIB) High Molecular Weight : P-230, P-225, P-220,

Testing and Debugging Gordon Fraser and Andreas Zeller Saarland University 1 Some Bugs 2 2

M87 Avery E. Broderick Sheperd Doeleman (MIT Haystack) Avi Loeb (Harvard) Vincent Fish (MIT

CPSC 102 Debugging Z A IN R IZ V I A Little History Lesson 1946 Page from Harvard

CSE 440: Introduction to HCI User Interface Design, Prototyping, and Evaluation James Fogarty

return password return hash( password ) return hash( password, salt )

Notes on Hashing Owen Jow Last updated May 05, 2018 This document is intended to give a

DATABASE SYSTEM IMPLEMENTATION GT 4420/6422 // SPRING 2019 // @JOY_ARULRAJ LECTURE #20: PARALLEL

CS 473: Algorithms Chandra Chekuri Ruta Mehta University of Illinois, Urbana-Champaign Fall

1 5mm Gusset | Clip in & Supplied Flat Final size 225 x 319mm | A2 Sheet Required 2 20mm