Chapter 04: Recurrences (Divide and Conquer). The MergeSort algorithm - PDF document

Chapter 04: Recurrences (Divide and Conquer). The MergeSort algorithm . Merge( A, p, q, r ) { MergeSort( A, p, r ) { (2) n 1 = q − p + 1; n 2 = r − q ; (1) if p < r { ( n 1 + 1) for i = 1 to n 1 (1) q = ⌊ ( p + r ) / 2 ⌋ ; ( n 1 ) L [ i ] = A [ p + i − 1]; (1) MergeSort( A ( p, q )); ( n 2 + 1) for j = 1 to n 2 (1) MergeSort( A ( q + 1 , r ); ( n 2 ) R [ j ] = A [ q + j ]; (1) Merge( A, p, q, r ); (2) L [ n 1 + 1] = ∞ ; R [ n 2 + 1] = ∞ ; } (2) i = 1; j = 1; ( r − p + 2) for k = p to r (3( r − p + 1)) if L [ i ] ≤ R [ j ] { A [ k ] = L [ i ]; i + +; } else { A [ k ] = R [ j ]; j + +; } The Merge problem size is n ′ = n 1 + n 2 = r − p + 1. The Merge count, T merge ( n ′ ), and the MergeSort count, T ( n ), are then T merge ( n ′ ) = 2 n 1 + 2 n 2 + 8 + ( n ′ + 1) + 3 n ′ = 6 n ′ + 9 � 5 + 2 T � n � n � � + 6 n + 9 = 2 T + 6 n + 14 , n > 1 2 2 T ( n ) = 1 , n = 1 , The base case ( n = 1) follows because only the “if p < r ” test in the driver is executed when n − 1. 1

T merge ( n ′ ) = 2 n 1 + 2 n 2 + 8 + ( n ′ + 1) + 3 n ′ = 6 n ′ + 9 � 5 + 2 T � n � n � � + 6 n + 9 = 2 T + 6 n + 14 , n > 1 2 2 T ( n ) = 1 , n = 1 , Note: As stated, the expression for T ( n ) is approximate because the size of the two recursive calls to MergeSort may differ by one. If n is even, then the expression is exact; both calls have arguments of size n/ 2. If n is odd, the first call has an argument which is one larger than the second call. 2

The expression for T ( n ) is called a recurrence . � n � T ( n ) = 2 T + 6 n + 14 . 2 As a first attempt to solve for T ( n ), we “unwind” the recurrence, restricting our attention to the subsequence n = 2 k , for k = 0 , 1 , 2 , . . . . Why the restriction? Because the progression n → n/ 2 → n/ 2 2 → . . . → 1 never encounters any fractions. Hence the recurrence is no longer approximate, provided n → ∞ along this particular sequence. Note the pattern: For any 2 m > 1, we have T (2 m ) = 2 T (2 m − 1 ) + 6 · 2 m + 14. Starting with some n = 2 k , we apply the pattern sequentially as follows. T (2 k ) = 2 T (2 k − 1 ) + 6 · 2 k + 14 = 2[2 T (2 k − 2 ) + 6 · 2 k − 1 + 14] + 6 · 2 k + 14 = 2 2 T (2 k − 2 ) + 6 · 2 · 2 k + 14(1 + 2) = 2 2 [2 T (2 k − 3 ) + 6 · 2 k − 2 + 14] + 6 · 2 · 2 k + 14(1 + 2) = 2 3 T (2 k − 3 ) + 6 · 3 · 2 k + 14(1 + 2 + 4) = . . . = 2 k T (2 0 ) + 6 · k · 2 k + 14(1 + 2 + 4 + . . . + 2 k − 1 ) k − 1 2 j = 2 k T (1) + 6 · k · 2 k + 14 · 2 k − 1 = 2 k T (1) + 6 · k · 2 k + 14 � 2 − 1 j =0 = [ T (1) + 14]2 k + 6 k 2 k − 14 . T (1) is a constant, K 1 , and k = lg n . So, in terms of n , T ( n ) = ( K 1 + 14) n + 6 n lg n − 14 n lg n = 6 + K 1 + 14 T ( n ) 14 − n lg n → 6 lg n T ( n ) n lg n ≤ 7 , for n = 2 k sufficiently large . 5 ≤ 3

So, T ( n ) ∈ Θ( n lg n ), provided n grows through the restricted sequence 1 , 2 , 4 , 8 , . . . . Can we extend the result to the conventional sequence n = 1 , 2 , 3 , . . . ? If we maintain separate terms for the recursive calls, we have �� n � n �� T ( n ) = T + T n − + 6 n + 14 2 2 �� n �� n �� = T + T + 6 n + 14 . 2 2 Continuing with T (1) = K 1 , we compute T ( n ) for some initial values. T (1) = K 1 T (2) = T (1) + T (1) + 6(2) + 14 = 2 K 1 + 26 T (3) = T (2) + T (1) + 6(3) + 14 = K 1 + 2 K 1 + 26 + 32 = 3 K 1 + 58 T (4) = T (2) + T (2) + 6(4) + 14 = 4 K 1 + 52 + 38 = 4 K 1 + 90 . We observe that T ( n ) is monotonically increasing, and we could rigorously establish this result by induction. 4

Now consider an arbitrary n , situated in an interval [2 k , 2 k +1 ) for which the above partial result holds. That is, 2 k lg 2 k = T (2 k ) T (2 k ) 5 ≤ 2 k · k ≤ 7 T (2 k +1 ) T (2 k +1 ) 5 ≤ 2 k +1 lg 2 k +1 = 2 k +1 · ( k + 1) ≤ 7 . Starting with the monotone property of T , we reason that T ( n ) ≤ T (2 k +1 ), for n ∈ [2 k , 2 k +1 ), and continue as follows. T ( n ) ≤ T (2 k +1 ) ≤ 7 · [2 k +1 · ( k + 1)] ≤ 7 · [2 k +1 (2 k )] = 7 · 4 · k 2 k � � � � = 28 x lg x x =2 k ≤ 28 x lg x x = n = 28 n lg n, � � � � ince x lg x is also monotone increasing. So, T ( n ) ∈ O ( n lg n ) without restriction. For a lower bound, T ( n ) ≥ T (2 k ) ≥ 5 · [2 k · k ] ≥ 5 2 · 2 k +1 · k ≥ 5 2 · 2 k +1 · k + 1 2 = 5 4 · ( k + 1)2 k +1 = 5 x =2 k +1 ≥ 5 x = n = 5 � � � � x lg x x lg x 4 n lg n. � � 4 4 � � Thus, we have 5 4 ≤ T ( n ) n lg n ≤ 28 , for all sufficiently large n. So, T ( n ) ∈ Ω( n lg n ). Together with the upper bound above, T ( n ) ∈ Θ( n lg n ) without restriction. 5

Master Template for recurrences Suppose T ( n ) > 0 is a function satisfying the following recurrence, and suppose f ( n ) ≥ 0. � n � T ( n ) = aT + f ( n ) , if n ≥ b b T ( n ) ≤ K, for 1 ≤ n < b, where a ≥ 1 , b > 1, and n/b can be either ⌊ n/b ⌋ or ⌈ n/b ⌉ . Then each of the following three cases provides an immediate solution for the recurrence. (a) weak glue : If ∃ ǫ > 0 with f ( n ) = O ( n [log b a ] − ǫ ), then T ( n ) = Θ( n log b a ). (b) medium glue: If f ( n ) = Θ( n log b a ), then T ( n ) = Θ( n log b a · lg n ). (c) strong glue: If ∃ ǫ > 0 with f ( n ) = Ω( n [log b a ]+ ǫ ) and ∃ c with 0 < c < 1 , af ( n/b ) ≤ cf ( n ) for all sufficiently large n , then T ( n ) = Θ( f ( n )). Observations. 1. We will call f ( n ) the “glue function” as it represents the work required to prepare for the recursive calls and to assemble their returned results into a solution to the overall problem. 2. We will call the polynomial g ( n ) = n log b a the “reference function.” It represents the work done in establishing the tree of recursive calls. In the context of algorithm analysis, the recurrence represents a recursive algorithm that divides its input into b parts and recursively considers a of them. 3. The variations g − ( n ) = n (log b a ) − ǫ and g + ( n ) = n (log b a )+ ǫ are used in cases (a) and (c), respectively, of the template. We refer to these variations as “exponentially reduced” and “exponentially enhanced” versions of the reference. 6

Example . For MergeSort: T ( n ) = 2 T ( n/ 2) + f ( n ), where f ∈ Θ( n ). This glue function then satisfies K 1 n ≤ f ( n ) ≤ K 2 n for all sufficiently large n . The reference function is g ( n ) = n log 2 2 = n 1 = n . Case (a) of the master template reads: weak glue : If ∃ ǫ > 0 with f ( n ) = O ( n [log b a ] − ǫ ), then T ( n ) = Θ( n log b a ). To check out case (a), let g − ( n ) = n 1 − ǫ be an exponentially reduced version of the reference. For large n , we have g − ( n ) ≥ K 1 n f ( n ) n 1 − ǫ = K 1 n ǫ → ∞ , which implies f ( n ) � = O ( n 1 − ǫ ) for any ǫ > 0. Case (a) fails. Case (b) of the master template reads: medium glue: If f ( n ) = Θ( n log b a ), then T ( n ) = Θ( n log b a · lg n ). To check out case (b), we investigate the ratio of the glue function to the reference function: 0 < K 1 ≤ K 1 n ≤ f ( n ) g ( n ) ≤ K 2 n = K 2 < ∞ , n n which implies f ∈ Θ( g ). Case (b) succeeds, and therefore T ( n ) = Θ( n lg n ). 7

Case (c) of the master template reads: strong glue: If ∃ ǫ > 0 with f ( n ) = Ω( n [log b a ]+ ǫ ) and ∃ c with 0 < c < 1 , af ( n/b ) ≤ cf ( n ) for all sufficiently large n , then T ( n ) = Θ( f ( n )). In the problem at hand, if g + = n 1+ ǫ any exponentially enhanced reference, we have g + ( n ) ≤ K 2 n f ( n ) n 1+ ǫ = K 2 n ǫ → 0 . For any ǫ > 0, there is no possible positive lower bound under f/g + . Therefore f �∈ Ω( g + ) and case (c) fails. In general, you can check out the cases in the master template in any order. If you find one that works, then you can ignore all of the others. The homework asks you to prove that the three cases of the master template are mutually exclusive. That is, if one of the cases succeeds, the other two must fail. 8

Example. Suppose T ( n ) = 2 T ( n/ 3) + √ n. We have f ( n ) = n 1 / 2 , and g ( n ) = n log 3 2 . Note that 0 . 5 = log 3 3 1 / 2 = log 3 1 . 732 < log 3 2. Choose ǫ as shown above. Then n 1 / 2 f ( n ) g − ( n ) = n (log 3 2) − ǫ → 0 . Thus f ∈ o ( g − ), which implies f ∈ O ( g − ), which enables case (a), which gives T ( n ) = Θ( n log 3 2 ). 9

Example. T ( n ) = 49 T ( n/ 25) + n 3 / 2 lg n . Here a = 49 , b = 25, so log 25 49 = 1 . 209. The reference is g ( n ) = n 1 . 209 . The glue is f ( n ) = n 1 . 5 lg n . g + ( n ) = n 1 . 5 log 2 n f ( n ) = n 0 . 291 − ǫ log 2 n → ∞ n 1 . 209+ ǫ as n → ∞ for any 0 < ǫ < 0 . 291. That is, f ( n ) = ω ( n 1 . 209+ ǫ ), which implies f ( n ) = Ω( n 1 . 209+ ǫ ). This leads to a possible application of case (c) of the template. There is, however, a regularity condition that must be satisfied: af ( n/b ) ≤ cf ( n ) for some constant 0 < c < 1 and all sufficiently large n . In the problem at hand, � n � 3 / 2 · lg n af ( n/b ) = 49 f ( n/ 25) = 49 · 25 25 49 n 3 / 2 (25) 3 / 2 · [(lg n ) − (lg 25)] ≤ 49 125 n 3 / 2 lg n = 49 = 125 f ( n ) . Consequently, the value c = 49 / 125 satisfies the regularity condition. Then T ( n ) = Θ( f ( n )) = Θ( n 3 / 2 lg n ) via case (c) of the template. 10

Chapter 04: Recurrences (Divide and Conquer). The MergeSort algorithm - PDF document

Chapter 04: Recurrences (Divide and Conquer). The MergeSort algorithm . Merge( A, p, q, r ) { MergeSort( A, p, r ) { (2) n 1 = q p + 1; n 2 = r q ; (1) if p < r { ( n 1 + 1) for i = 1 to n 1 (1) q = ( p + r ) / 2 ; ( n 1 ) L [

Week 3 Oliver Kullmann Divide-and- Conquer Solving Recurrences Merge Sort Solving

Divide-Conquer-Glue Algorithms Divide-and-conquer. Divide up problem into several subproblems.

Divide-Conquer-Glue Algorithms Divide-and-conquer. Mergesort and Counting Inversions Divide

CS 240A : Divide-and-Conquer with Cilk++ Divide & Conquer Paradigm Solving recurrences

4.4 Growth Rates of Solutions to Recurrences Divide and Conquer Algorithms One of the most basic

Divide and conquer 1 The main idea for the divide and conquer is trying to divide a problem into

Divide and Conquer CSE 421 Algorithms Richard Anderson Lecture 12 Recurrences and Divide and

Week 2 Growth of Functions Divide-and- Divide and Conquer Conquer Min-Max- Problem Tutorial

Divide and Conquer Algorithm Design Techniques Greedy Divide and Conquer Dynamic Programming

Plan for Today Finish recurrences Inversion Counting Closest Pair of Points Divide and Conquer

Divide and Conquer Summary Divide Identify one or more subproblems Conquer Solve

Divide and Conquer Algorithm Theory WS 2012/13 Fabian Kuhn Divide And Conquer Principle

Divide and conquer Philip II of Macedon Divide and conquer 1) Divide your problem into

Sorting (Part II: Divide and Conquer) CSE 373 Data Structures Lecture 14 Readings Reading

28: More Sorting Mergesort review analysis Lower bound on comparison-based sorting Mergesort: A

Divide and Conquer Algorithms Divide-and-Conquer The most-well known algorithm design strategy:

Institutionalizing Lessons Learned October 25, 2006 Loren Plisco Region II Background

Recursive Algorithms Department of Computer Science University of Maryland, College Park

Periodic Solutions of Abel Equation and Signal Reconstruction from Integral Measurements D.

role of unstable periodic orbits in the pattern formation of turbulent motion Jos Surez

A Realistic Evaluation of Memory Hardware Errors and Software System Susceptibility Xin Li 1

Securing EcmaScript 5: Adventures in Strategic Compromise Mark S. Miller and the Cajadores Thanks

On the Quantitative Semantics of Regular Expressions over Real-Valued Signals Alexey Bakhirkin

1 2 3 4 5 6 Go runtime deadlock detector 7 8 9 10 11 12 13 14 15 16 17 18 19 20

Chapter 04: Recurrences (Divide and Conquer). The MergeSort algorithm - PDF document

Chapter 04: Recurrences (Divide and Conquer). The MergeSort algorithm . Merge( A, p, q, r ) { MergeSort( A, p, r ) { (2) n 1 = q p + 1; n 2 = r q ; (1) if p < r { ( n 1 + 1) for i = 1 to n 1 (1) q = ( p + r ) / 2 ; ( n 1 ) L [

Week 3 Oliver Kullmann Divide-and- Conquer Solving Recurrences Merge Sort Solving

Divide-Conquer-Glue Algorithms Divide-and-conquer. Divide up problem into several subproblems.

Divide-Conquer-Glue Algorithms Divide-and-conquer. Mergesort and Counting Inversions Divide

CS 240A : Divide-and-Conquer with Cilk++ Divide &amp; Conquer Paradigm Solving recurrences

4.4 Growth Rates of Solutions to Recurrences Divide and Conquer Algorithms One of the most basic

Divide and conquer 1 The main idea for the divide and conquer is trying to divide a problem into

Divide and Conquer CSE 421 Algorithms Richard Anderson Lecture 12 Recurrences and Divide and

Week 2 Growth of Functions Divide-and- Divide and Conquer Conquer Min-Max- Problem Tutorial

Divide and Conquer Algorithm Design Techniques Greedy Divide and Conquer Dynamic Programming

Plan for Today Finish recurrences Inversion Counting Closest Pair of Points Divide and Conquer

Divide and Conquer Summary Divide Identify one or more subproblems Conquer Solve

Divide and Conquer Algorithm Theory WS 2012/13 Fabian Kuhn Divide And Conquer Principle

Divide and conquer Philip II of Macedon Divide and conquer 1) Divide your problem into

Sorting (Part II: Divide and Conquer) CSE 373 Data Structures Lecture 14 Readings Reading

28: More Sorting Mergesort review analysis Lower bound on comparison-based sorting Mergesort: A

Divide and Conquer Algorithms Divide-and-Conquer The most-well known algorithm design strategy:

Institutionalizing Lessons Learned October 25, 2006 Loren Plisco Region II Background

Recursive Algorithms Department of Computer Science University of Maryland, College Park

Periodic Solutions of Abel Equation and Signal Reconstruction from Integral Measurements D.

role of unstable periodic orbits in the pattern formation of turbulent motion Jos Surez

A Realistic Evaluation of Memory Hardware Errors and Software System Susceptibility Xin Li 1

Securing EcmaScript 5: Adventures in Strategic Compromise Mark S. Miller and the Cajadores Thanks

On the Quantitative Semantics of Regular Expressions over Real-Valued Signals Alexey Bakhirkin

1 2 3 4 5 6 Go runtime deadlock detector 7 8 9 10 11 12 13 14 15 16 17 18 19 20

CS 240A : Divide-and-Conquer with Cilk++ Divide & Conquer Paradigm Solving recurrences