Binary Search Searching an Array 1 Linear Search Go through the - PowerPoint PPT Presentation

Binary Search

Searching an Array 1

Linear Search  Go through the array position by position until we find x int search(int x, int[] A, int n) //@requires n == \length(A); /*@ensures (\result == -1 && !is_in(x, A, 0, n)) || (0 <= \result && \result < n && A[\result] == x); @*/ { for (int i = 0; i < n; i++) { if (A[i] == x) return i; } return -1; Loop invariants } omitted  Worst case complexity: O(n) 2

Linear Search on Sorted Arrays  Stop early if we find an element greater than x int search(int x, int[] A, int n) //@requires n == \length(A); //@requires is_sorted(A, 0, n); /*@ensures (\result == -1 && !is_in(x, A, 0, n)) || (0 <= \result && \result < n && A[\result] == x); @*/ { for (int i = 0; i < n; i++) { if (A[i] == x) return i; if (x < A[i]) return -1; //@assert A[i] < x; Loop invariants } omitted return -1; }  Worst case complexity: still O(n) o e.g., if x is larger than any element in A 3

Can we do Better on Sorted Arrays?  Look in the middle! Piece of cake! o compare midpoint element with x o if found, great! o if x is smaller, look for x in the lower half o if x is bigger, look for x in the upper half  This is Binary Search  Why better? o we are throwing out half of the array each time!  with linear search, we were throwing out just one element! o if array has length n , we can halve it only log n times 4

A Cautionary Tale Jon Bentley Only 10% of programmers can write binary search o 90% had bugs!  Binary search dates back to 1946 (at least) o First correct description in 1962  Jon Bentley wrote the Jon Bentley , definitive binary search Algorithms professor at CMU in the 1980s  proved it correct Read more at https://reprog.wordpress.com/2010/04/19/ar e-you-one-of-the-10-percent/ 5

More of a Cautionary Tale Joshua Bloch  Joshua Bloch finds a bug in Jon Joshua Bloch , Bentley’s definitive binary search! • student of Jon Bentley • works at Google o that Bentley had proved correct!!! • occasionally adjunct prof. at CMU  Went on to implementing several searching and sorting algorithms used in Android, Java and Python o e.g., TimSort Read more at https://ai.googleblog.com/2006/06/extra-extra-read-all- about-it-nearly.html 6

Even More of a Cautionary Tale  Researchers find a bug in Joshua Bloch’s code for TimSort o Implemented it in a language with contracts (JML – Java Modelling Language) o Tried to prove correctness using KeY theorem prover Some of the same contract mechanisms as C0 (and a few more) (we borrowed our contracts of them) Read more at http://www.envisage-project.eu/proving-android- java-and-python-sorting-algorithm-is-broken-and- how-to-fix-it/ 7

Piece of cake?  Implementing binary search is not as simple as it sounds o many professionals have failed!  We want to proceed carefully and methodically  Contracts will be our guide! 8

Binary Search 9

0 1 2 3 4 5 6 7 Binary A: 2 3 5 9 11 13 17 find midpoint of A[0,7) Search • index 3 • A[3] = 9 0 1 2 3 4 5 6 7 A: 2 3 5 9 11 13 17  A is sorted 4 < 9  Looking for • ignore A[4,7) • ignore also A[3] 0 1 2 3 4 5 6 7 x = 4 A: 2 3 5 9 11 13 17 find midpoint of A[0,3) • index 1 • A[1] = 3 0 1 2 3 4 5 6 7 A: 2 3 5 9 11 13 17 3 < 4 • ignore A[0,1) • ignore also A[1] 0 1 2 3 4 5 6 7 2 3 5 9 11 13 17 A: find midpoint of A[2,3) • index 2 • A[2] = 5 0 1 2 3 4 5 6 7 A: 2 3 5 9 11 13 17 4 < 5 • ignore A[3,3) • ignore also A[2] 0 1 2 3 4 5 6 7 A: 2 3 5 9 11 13 17 nothing left! • A[2,2) is empty • 4 isn’t in A 10

lo hi 0 1 2 3 4 5 6 7 Binary A: 2 3 5 9 11 13 17 find midpoint of A[lo,hi) Search • index mid = 3 mid lo hi • A[mid] = 9 0 1 2 3 4 5 6 7 A: 2 3 5 9 11 13 17  A[lo, hi) is sorted 4 < A[mid]  At each step, we • ignore A[mid+1,hi) hi lo • ignore also A[mid] 0 1 2 3 4 5 6 7 o examine a A: 2 3 5 9 11 13 17 find midpoint of A[lo,hi) segment A[lo, hi) • index mid = 1 mid lo hi o find its midpoint • A[mid] = 3 0 1 2 3 4 5 6 7 A: mid 2 3 5 9 11 13 17 A[mid] < 4 o compare x = 4 • ignore A[lo,mid) lo hi • ignore also A[mid] 0 1 2 3 4 5 6 7 with A[mid] 2 3 5 9 11 13 17 A: find midpoint of A[lo,hi) • index mid = 2 lo,mid hi • A[mid] = 5 0 1 2 3 4 5 6 7 A: 2 3 5 9 11 13 17 4 < A[mid] • ignore A[mid+1,hi) lo,hi • ignore also A[mid] 0 1 2 3 4 5 6 7 A: 2 3 5 9 11 13 17 nothing left! • A[lo,hi) is empty • 4 isn’t in A 11

lo hi 0 1 2 3 4 5 6 7 Binary A: 2 3 5 9 11 13 17 find midpoint of A[lo,hi) Search • index mid = 3 mid lo hi • A[mid] = 9 0 1 2 3 4 5 6 7 A: 2 3 5 9 11 13 17  Let’s look for A[mid] < 11 x = 11 • ignore A[lo,mid) hi lo • ignore also A[mid] 0 1 2 3 4 5 6 7  At each step, we A: 2 3 5 9 11 13 17 o examine a find midpoint of A[lo,hi) • index mid = 5 mid lo hi segment A[lo, hi) • A[mid] = 13 0 1 2 3 4 5 6 7 o find its midpoint A: 2 3 5 9 11 13 17 11 < A[mid] mid • ignore A[lo,mid) lo hi • ignore also A[mid] o compare x = 11 0 1 2 3 4 5 6 7 2 3 5 9 11 13 17 A: with A[mid] find midpoint of A[lo,hi) • index mid = 4 lo,mid hi • A[mid] = 11 0 1 2 3 4 5 6 7 A: 2 3 5 9 11 13 17 11 = A[mid] • found! • return 4 12

Implementing Binary Search 13

Setting up Binary Search Same contracts as search: different algorithm to solve int binsearch(int x, int[] A, int n) the same problem //@requires n == \length(A); //@requires is_sorted(A, 0, n); /*@ensures (\result == -1 && !is_in(x, A, 0, n)) || (0 <= \result && \result < n && A[\result] == x); @*/ { int lo = 0; lo starts at 0, int hi = n; hi at n while (lo < hi) { … bunch of } steps return -1; } returns -1 if x not found 14

What do we Know at Each Step?  At an arbitrary iteration, the picture is: Too small! If x is in A, Too big! it’s got to be here 0 lo hi n A: … … A[0, lo) < x x < A[hi, n)  These are candidate loop invariant: o gt_seg(x, A, 0, lo) : that’s A[0, lo) < x o lt_seg(x, A, hi, n): that’s x < A[hi, n) o and of course 0 <= lo && lo <= hi && hi <= n 15

0 ≤ lo ≤ hi ≤ n … … Adding Loop Invariants A[0, lo) < x x < A[hi, n) int binsearch(int x, int[] A, int n) //@requires n == \length(A); //@requires is_sorted(A, 0, n); /*@ensures (\result == -1 && !is_in(x, A, 0, n)) || (0 <= \result && \result < n && A[\result] == x); @*/ { int lo = 0; int hi = n; while (lo < hi) //@loop_invariant 0 <= lo && lo <= hi && hi <= n; //@loop_invariant gt_seg(x, A, 0, lo); //@loop_invariant lt_seg(x, A, hi, n); { … } return -1; } 16

Are these Useful Loop Invariants? 0 ≤ lo ≤ hi ≤ n Can they help prove the postcondition? … …  Is return -1 correct? A[0, lo) < x x < A[hi, n) (assuming invariants are valid)  To show : if preconditions are met, then x  A[0, n) A. lo ≥ hi by line 9 (negation of loop guard) B. lo ≤ hi by line 10 (LI 1) 1. int binsearch(int x, int[] A, int n) C. lo = hi by math on A, B 2. //@requires n == \length(A); 3. //@requires is_sorted(A, 0, n); D. A[0,lo) < x by line 11 (LI 2) 4. /*@ensures (\result == -1 && !is_in(x, A, 0, n)) E. x  A[0,lo) by math on D || (0 <= \result && \result < n && A[\result] == x); @*/ 5. 6. { F. x < A[hi,n) by line 12 (LI 3) int lo = 0; 7. G. x  A[hi,n) by math on F int hi = n; 8. while (lo < hi) 9. H. x  A[0,n) by math on C, E, G 10. //@loop_invariant 0 <= lo && lo <= hi && hi <= n;  11. //@loop_invariant gt_seg(x, A, 0, lo); 12. //@loop_invariant lt_seg(x, A, hi, n);  This is a standard EXIT 13. { … 14. argument 15. } 16. return -1; 17. } 17

Are the Loop Invariants Valid? 0 ≤ lo ≤ hi ≤ n INIT … … o lo = 0 by line 7 and hi = n by line 8 A[0, lo) < x x < A[hi, n)  To show : 0 ≤ 0 by math  To show : 0 ≤ n by line 2 (preconditions) and \length  To show : n ≤ n by math 1. int binsearch(int x, int[] A, int n)  To show : A[0, 0) < x 2. //@requires n == \length(A); 3. //@requires is_sorted(A, 0, n);  To show : x < A[n, n) 4. /*@ensures (\result == -1 && !is_in(x, A, 0, n))  by math (empty intervals) || (0 <= \result && \result < n && A[\result] == x); @*/  5. 6. { int lo = 0; 7. PRES int hi = n; 8. while (lo < hi) 9.  Trivial 10. //@loop_invariant 0 <= lo && lo <= hi && hi <= n; 11. //@loop_invariant gt_seg(x, A, 0, lo); o body is empty 12. //@loop_invariant lt_seg(x, A, hi, n); 13. { o nothing changes!!! … 14. 15. }  16. //@assert lo == hi; from correctness 17. return -1; proof 18. } 18

Binary Search Searching an Array 1 Linear Search Go through the - PowerPoint PPT Presentation

Binary Search Searching an Array 1 Linear Search Go through the array position by position until we find x int search(int x, int[] A, int n) //@requires n == \length(A); /*@ensures (\result == -1 && !is_in(x, A, 0, n)) || (0

Balanced Search Trees Binary Search Trees Binary Search Tree Binary Search Tree A binary tree is

Sorting Divide and Conquer 1 Searching an n -element Array Linear Search Binary Search

Binary Search Trees and Balanced Binary Search Trees using AVL Trees Mark Redekopp David Kempe

CMSC 206 Binary Search Trees 1 Binary Search Tree n A Binary Search Tree is a Binary Tree in

Searching Elements in an Array: Linear and Binary Search Spring Semester 2007 Programming and

Chapter 5 Searching and Binary Search Trees 5.1 Searching sequence The purpose of searching :

Binary Numbers Binary numbers look like this Binary Numbers or Binary Code Binary numbers or

A Quick Review Decimal to binary Binary to decimal Binary to hexadecimal

Hashing Searching Consider the problem of searching an array for a given value If the

singly linked lists Sept. 18, 2017 1 Recall last lecture: Java array array array array of

Outline Searching Computers Computers Computers Topic 2: Searching Topic 2: Searching Topic

Binary Search Trees A binary search tree is a binary tree T such that - each internal node

Searching Binary search (log n ) Better Can we achieve (1) search time? Chapter 9

Searching Consider the problem of searching an array for a given value Hashing If the

This is a slide Group Philsophy Binary Search Tree (BST) A binary search tree 12 5 15 3

binary search trees Oct. 30, 2017 1 (binary search) tree binary (search tree) 2 class

Trees a tree represents a hierarchy - organization structure of a corporation Electronics

Timing Sign-off for Selective Voltage Binning Vladimir Zolotov*, Eric Foreman, Jeffrey Hemmett,

CT 320: Network and System 2 Booting Administration 1. Bootstrap loaders 2. Run levels 3.

Balanced Allocation with Random Walk Based Sampling Dengwang Tang Electrical and Computer

Extended Binary Trees Recurrence relations After today, you should be able to explain what

CS171 Introduction to Computer Science II Priority Queues and Binary Heap Priority Queues and

Binarylevel program analysis: Executable File Formats Gang Tan CSE 597 Spring 2019 Penn

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case Maximilian Kasy

Sambuz

Useful Links

Newsletter

Mail Us