[PPT] - Week 2: Greedy Algorithms Karan Singh 373F19 - Karan Singh 1 PowerPoint Presentation

SLIDE 1

CSC373 Week 2: Greedy Algorithms

373F19 - Karan Singh 1

Karan Singh

SLIDE 2

Recap

373F19 - Karan Singh 2

Divide & Conquer
Master theorem
Counting inversions in 𝑃(𝑜 log 𝑜)
Finding closest pair of points in ℝ2 in 𝑃 𝑜 log 𝑜
Fast integer multiplication in 𝑃 𝑜log2 3
Fast matrix multiplication in 𝑃 𝑜log2 7
Finding 𝑙𝑢ℎ smallest element (in particular, median) in

𝑃(𝑜)

SLIDE 3

Greedy Algorithms

373F19 - Karan Singh 3

Greedy (also known as myopic) algorithm outline
We want to find a solution 𝑦 that maximizes some
bjective function 𝑔
But the space of possible solutions 𝑦 is too large
The solution 𝑦 is typically composed of several parts (e.g.

𝑦 may be a set, composed of its elements)

Instead of directly computing 𝑦…
Compute it one part at a time
Select the next part “greedily” to get maximum immediate benefit

(this needs to be defined carefully for each problem)

May not be optimal because there is no foresight
But sometimes this can be optimal too!

SLIDE 4

Interval Scheduling

373F19 - Karan Singh 4

Problem
Job 𝑘 starts at time 𝑡

𝑘 and finishes at time 𝑔 𝑘

Two jobs are compatible if they don’t overlap
Goal: find maximum-size subset of mutually compatible jobs

SLIDE 5

Interval Scheduling

373F19 - Karan Singh 5

Greedy template
Consider jobs in some “natural” order
Take each job if it’s compatible with the ones already

chosen

What order?
Earliest start time: ascending order of 𝑡

𝑘

Earliest finish time: ascending order of 𝑔

𝑘

Shortest interval: ascending order of 𝑔

𝑘 − 𝑡 𝑘

Fewest conflicts: ascending order of 𝑑

𝑘, where 𝑑 𝑘 is the

number of remaining jobs that conflict with 𝑘

SLIDE 6

Example

373F19 - Karan Singh 6

Earliest start time: ascending order of 𝑡

𝑘

Earliest finish time: ascending order of 𝑔

𝑘

Shortest interval: ascending order of 𝑔

𝑘 − 𝑡 𝑘

Fewest conflicts: ascending order of 𝑑

𝑘, where 𝑑 𝑘 is the number of

remaining jobs that conflict with 𝑘

SLIDE 7

Interval Scheduling

373F19 - Karan Singh 7

Does it work?

earliest start time Counterexamples for shortest interval fewest conflicts

SLIDE 8

Interval Scheduling

373F19 - Karan Singh 8

Implementing greedy with earliest finish time (EFT)
Sort jobs by finish time. Say 𝑔

1 ≤ 𝑔 2 ≤ ⋯ ≤ 𝑔 𝑜

When deciding whether job 𝑘 should be included, we

need to check whether it’s compatible with all previously added jobs

We only need to check if 𝑡

𝑘 ≥ 𝑔 𝑗∗, where 𝑗∗ is the last added job

This is because for any jobs 𝑗 added before 𝑗∗, 𝑔

𝑗 ≤ 𝑔 𝑗∗

So we can simply store and maintain the finish time of the last

added job

Running time: 𝑃 𝑜 log 𝑜

SLIDE 9

Interval Scheduling

373F19 - Karan Singh 9

Optimality of greedy with EFT
Suppose for contradiction that greedy is not optimal
Say greedy selects jobs 𝑗1, 𝑗2, … , 𝑗𝑙 sorted by finish time
Consider the optimal solution 𝑘1, 𝑘2, … , 𝑘𝑛 (also sorted by

finish time) which matches greedy for as long as possible

That is, we want 𝑘1 = 𝑗1, … , 𝑘𝑠 = 𝑗𝑠 for greatest possible 𝑠

SLIDE 10

Interval Scheduling

373F19 - Karan Singh 10

Optimality of greedy with EFT
Both 𝑗𝑠+1 and 𝑘𝑠+1 were compatible with the previous

selection (𝑗1 = 𝑘1, … , 𝑗𝑠 = 𝑘𝑠)

Consider the solution 𝑗1, 𝑗2, … , 𝑗𝑠, 𝑗𝑠+1, 𝑘𝑠+2, … , 𝑘𝑛
It should still be feasible (since 𝑔

𝑗𝑠+1 ≤ 𝑔 𝑘𝑠+1)

It is still optimal
And it matches with greedy for one more step (contradiction!)

Another standard method is induction

SLIDE 11

Interval Partitioning

373F19 - Karan Singh 11

Problem
Job 𝑘 starts at time 𝑡

𝑘 and finishes at time 𝑔 𝑘

Two jobs are compatible if they don’t overlap
Goal: group jobs into fewest partitions such that jobs in

the same partition are compatible

One idea
Find the maximum compatible set using the previous

greedy EFT algorithm, call it one partition, recurse on the remaining jobs.

Doesn’t work (check by yourselves)

SLIDE 12

Interval Partitioning

373F19 - Karan Singh 12

Think of scheduling lectures for various courses

into as few classrooms as possible

This schedule uses 4 classrooms for scheduling 10

lectures

SLIDE 13

Interval Partitioning

373F19 - Karan Singh 13

Think of scheduling lectures for various courses

into as few classrooms as possible

This schedule uses 3 classrooms for scheduling 10

lectures

SLIDE 14

Interval Partitioning

373F19 - Karan Singh 14

Let’s go back to the greedy template!
Go through lectures in some “natural” order
Assign each lecture to a compatible classroom (which?),

and create a new classroom if the lecture conflicts with every existing classroom

Order of lectures?
Earliest start time: ascending order of 𝑡

𝑘

Earliest finish time: ascending order of 𝑔

𝑘

Shortest interval: ascending order of 𝑔

𝑘 − 𝑡 𝑘

Fewest conflicts: ascending order of 𝑑

𝑘, where 𝑑 𝑘 is the

number of remaining jobs that conflict with 𝑘

SLIDE 15

Interval Partitioning

373F19 - Karan Singh 15

At least when you

assign each lecture to an arbitrary feasible classroom, three of these heuristics do not work.

The fourth one works!

(next slide)

SLIDE 16

Interval Partitioning

373F19 - Karan Singh 16

SLIDE 17

Interval Partitioning

373F19 - Karan Singh 17

Running time
Key step: check if the next lecture can be scheduled at

some classroom

Store classrooms in a priority queue
key = finish time of its last lecture
Is lecture 𝑘 compatible with some classroom?
Same as “Is 𝑡

𝑘 at least as large as the minimum key?”

If yes: add lecture 𝑘 to classroom 𝑙 with minimum key, and

increase its key to 𝑔

𝑘

Otherwise: create a new classroom, add lecture 𝑘, set key to 𝑔

𝑘

𝑃(𝑜) priority queue operations, 𝑃(𝑜 log 𝑜) time

SLIDE 18

Interval Partitioning

373F19 - Karan Singh 18

Proof of optimality (lower bound)
# classrooms needed ≥ maximum “depth” at any point
depth = number of lectures running at that time
We now show that our greedy algorithm uses only these

many classrooms!

SLIDE 19

Interval Partitioning

373F19 - Karan Singh 19

Proof of optimality (upper bound)
Let 𝑒 = # classrooms used by greedy
Classroom 𝑒 was opened because there was a schedule 𝑘

which was incompatible with some lectures already scheduled in each of 𝑒 − 1 other classrooms

All these 𝑒 lectures end after 𝑡

𝑘

Since we sorted by start time, they all start at/before 𝑡

𝑘

So at time 𝑡

𝑘, we have 𝑒 overlapping lectures

Hence, depth ≥ 𝑒
So all schedules use ≥ 𝑒 classrooms.
QED!

SLIDE 20

Interval Graphs

373F19 - Karan Singh 20

Interval scheduling and interval partitioning can be

seen as graph problems

Input
Graph 𝐻 = (𝑊, 𝐹)
Vertices 𝑊 = jobs/lectures
Edge 𝑗, 𝑘 ∈ 𝐹 if jobs 𝑗 and 𝑘 are incompatible
Interval scheduling = maximum independent set

(MIS)

Interval partitioning = graph colouring

SLIDE 21

Interval Graphs

373F19 - Karan Singh 21

MIS and graph colouring are NP-hard for general

graphs

But they’re efficiently solvable for interval graphs
Interval graphs = graphs which can be obtained from

incompatibility of intervals

In fact, this holds even when we are not given an interval

representation of the graph

Can we extend this result further?
Yes! Chordal graphs
Every cycle with 4 or more vertices has a chord

SLIDE 22

Minimizing Lateness

373F19 - Karan Singh 22

Problem
We have a single machine
Each job 𝑘 requires 𝑢𝑘 units of time and is due by time 𝑒𝑘
If it’s scheduled to start at 𝑡

𝑘, it will finish at 𝑔 𝑘 = 𝑡 𝑘 + 𝑢𝑘

Lateness: ℓ𝑘 = max 0, 𝑔

𝑘 − 𝑒𝑘

Goal: minimize the maximum lateness, 𝑀 = max

𝑘

ℓ𝑘

Total lateness minimization is NP-complete
Contrast with interval scheduling
We can decide the start time
All jobs must be scheduled on a single machine

SLIDE 23

Minimizing Lateness

373F19 - Karan Singh 23

Example

Input An example schedule

SLIDE 24

Minimizing Lateness

373F19 - Karan Singh 24

Let’s go back to greedy template
Consider jobs one-by-one in some “natural” order
Schedule jobs in this order (nothing special to do here,

since we have to schedule all jobs and there is only one machine available)

Natural orders?
Shortest processing time first: ascending order of

processing time 𝑢𝑘

Earliest deadline first: ascending order of due time 𝑒𝑘
Smallest slack first: ascending order of 𝑒𝑘 − 𝑢𝑘

SLIDE 25

Minimizing Lateness

373F19 - Karan Singh 25

Counterexamples
Shortest processing time first
Ascending order of processing time 𝑢𝑘
Smallest slack first
Ascending order of 𝑒𝑘 − 𝑢𝑘

SLIDE 26

Minimizing Lateness

373F19 - Karan Singh 26

By now, you

should know what’s coming…

We’ll prove

that earliest deadline first works!

SLIDE 27

Minimizing Lateness

373F19 - Karan Singh 27

Observation 1
There is an optimal schedule with no idle time

SLIDE 28

Minimizing Lateness

373F19 - Karan Singh 28

Observation 2
Earliest deadline first has no idle time
Let us define an “inversion”
𝑗, 𝑘 such that 𝑒𝑗 < 𝑒𝑘 but 𝑘 is scheduled before 𝑗
Observation 3
By definition, earliest deadline first has no inversions
Observation 4
If a schedule with no idle time has an inversion, it has a

pair of inverted jobs scheduled consecutively

SLIDE 29

Minimizing Lateness

373F19 - Karan Singh 29

Claim
Swapping adjacently scheduled inverted jobs doesn’t

increase lateness but reduces #inversions by one

Proof
Let ℓ and ℓ′ denote lateness before/after swap
Clearly, ℓ𝑙 = ℓ𝑙

′ for all 𝑙 ≠ 𝑗, 𝑘

Also, clearly, ℓ𝑗

′ ≤ ℓ𝑗

SLIDE 30

Minimizing Lateness

373F19 - Karan Singh 30

Claim
Swapping adjacently scheduled inverted jobs doesn’t

increase lateness but reduces #inversions by one

Proof
ℓ𝑘

′ = 𝑔 𝑘 ′ − 𝑒𝑘 = 𝑔 𝑗 − 𝑒𝑘 ≤ 𝑔 𝑗 − 𝑒𝑗 = ℓ𝑗

𝑀′ = max ℓ𝑗

′, ℓ𝑘 ′, max 𝑙≠𝑗,𝑘 ℓ𝑙 ′

≤ max ℓ𝑗, ℓ𝑗, max

𝑙≠𝑗,𝑘 ℓ𝑙 ≤ 𝑀

SLIDE 31

Minimizing Lateness

373F19 - Karan Singh 31

Proof of optimality of earliest deadline first
Suppose for contradiction that it’s not optimal
Consider an optimal schedule 𝑇∗ which has fewest inversions

among all optimal schedules

We can assume it has no idle time
If 𝑇∗ has zero inversions, it’s exactly earliest deadline first
So assume 𝑇∗ has at least one inversion
So it must have an adjacent inversion (𝑗, 𝑘)
But swapping these jobs doesn’t increase lateness (so new schedule

stays optimal) and reduces the number of inversions by 1

Contradiction given that 𝑇∗ has fewest inversions among all optimal

schedules.

QED!

SLIDE 32

Lossless Compression

373F19 - Karan Singh 32

Problem
We have a document that is written using 𝑜 distinct labels
Naïve encoding: represent each label using 𝑙 = log 𝑜 bits
If the document has length 𝑛, this uses 𝑛 log 𝑜 bits
Say for English documents with no punctuations etc, we

have 𝑜 = 26, so we can use 5 bits.

𝑏 = 00000
𝑐 = 00001
𝑑 = 00010
𝑒 = 00011
…

SLIDE 33

Lossless Compression

373F19 - Karan Singh 33

Is this optimal?
What if 𝑏, 𝑓, 𝑠, 𝑡 are much more frequent in the

document than 𝑦, 𝑟, 𝑨?

Can we assign shorter codes to more frequent letters?
Say we assign…
𝑏 = 0, 𝑐 = 1, 𝑑 = 01, …
See a problem?
What if we observe the encoding ‘01’?
Is it ‘ab’? Or is it ‘c’?

SLIDE 34

Lossless Compression

373F19 - Karan Singh 34

To avoid conflicts, we need prefix-free encoding
Map each label 𝑦 to a bit-string 𝑑(𝑦) such that for all

distinct labels 𝑦 and 𝑧, 𝑑(𝑦) is not a prefix of 𝑑 𝑧

Then it’s impossible to have a scenario like this

………………………..

So we can read left to right, find the first point where it

becomes a valid encoding, decode the label, and continue

𝑑(𝑦) 𝑑(𝑧)

SLIDE 35

Lossless Compression

373F19 - Karan Singh 35

Formal problem
Given 𝑜 symbols and their frequencies (𝑥1, … , 𝑥𝑜), find a

prefix-free encoding with lengths (ℓ1, … , ℓ𝑜) assigned to the symbols which minimizes σ𝑗=1

𝑜

𝑥𝑗 ⋅ ℓ𝑗

Note that σ𝑗=1

𝑜

𝑥𝑗 ⋅ ℓ𝑗 is the length of the compressed document

Example
(𝑥𝑏, 𝑥𝑐, 𝑥𝑑, 𝑥𝑒, 𝑥𝑓, 𝑥𝑔) = (42,20,5,10,11,12)
No need to remember the numbers 

SLIDE 36

Lossless Compression

373F19 - Karan Singh 36

Observation: prefix-free encoding = tree

𝑏 → 0, 𝑓 → 100, 𝑔 → 101, 𝑑 → 1100, 𝑒 → 1101, 𝑐 → 111

SLIDE 37

Lossless Compression

373F19 - Karan Singh 37

Huffman Coding
Build a priority queue by adding 𝑦, 𝑥𝑦 for each symbol 𝑦
While |queue|≥ 2
Take the two symbols with the lowest weight (𝑦, 𝑥𝑦) and (𝑧, 𝑥𝑧)
Merge them into one symbol with weight 𝑥𝑦 + 𝑥𝑧
Let’s see this on the previous example

SLIDE 38

Lossless Compression

373F19 - Karan Singh 38

SLIDE 39

Lossless Compression

373F19 - Karan Singh 39

SLIDE 40

Lossless Compression

373F19 - Karan Singh 40

SLIDE 41

Lossless Compression

373F19 - Karan Singh 41

SLIDE 42

Lossless Compression

373F19 - Karan Singh 42

SLIDE 43

Lossless Compression

373F19 - Karan Singh 43

Final Outcome

𝑏 → 0, 𝑓 → 100, 𝑔 → 101, 𝑑 → 1100, 𝑒 → 1101, 𝑐 → 111

SLIDE 44

Lossless Compression

373F19 - Karan Singh 44

Running time
𝑃(𝑜 log 𝑜)
Can be made 𝑃(𝑜) if the labels are given to you sorted by

their frequencies

Proof of optimality
Induction on the number of symbols 𝑜
Base case: For 𝑜 = 2, there are only two possible

encodings, both are optimal, assign 1 bit to each symbol

Hypothesis: Assume it returns an optimal encoding with

𝑜 − 1 symbols

SLIDE 45

Lossless Compression

373F19 - Karan Singh 45

Proof of optimality
Consider the case of 𝑜 symbols
Lemma 1: If 𝑥𝑦 < 𝑥𝑧, then ℓ𝑦 ≥ ℓ𝑧 in any optimal tree.
Proof sketch: Otherwise, swapping 𝑦 and 𝑧 would strictly reduce

the overall length (exercise!).

Lemma 2: There is an optimal tree 𝑈 in which the two

least frequent symbols are siblings.

Proof sketch: First prove that they must have the same longest

length assigned to them. Then, if they’re not siblings, chop and rearrange the tree to make them siblings (exercise!).

Now, we can compare the tree 𝐼 produced by Huffman

vs such an optimal tree 𝑈

SLIDE 46

Lossless Compression

373F19 - Karan Singh 46

Proof of optimality
Let 𝑦 and 𝑧 be the two least frequency symbols
In Huffman, we combine them in the first step into “xy”
Let 𝐼′ and 𝑈′ be trees obtained from 𝐼 and 𝑈 by treating

𝑦𝑧 as one symbol with frequency 𝑥𝑦 + 𝑥𝑧

Use induction hypothesis: 𝑀𝑓𝑜𝑕𝑢ℎ 𝐼′ ≤ 𝑀𝑓𝑜𝑕𝑢ℎ(𝑈′)
𝑀𝑓𝑜𝑕𝑢ℎ 𝐼 = 𝑀𝑓𝑜𝑕𝑢ℎ 𝐼′ + 𝑥𝑦 + 𝑥𝑧 ⋅ 1
𝑀𝑓𝑜𝑕𝑢ℎ 𝑈 = 𝑀𝑓𝑜𝑕𝑢ℎ 𝑈′ + 𝑥𝑦 + 𝑥𝑧 ⋅ 1
QED!

SLIDE 47

Other Greedy Algorithms

373F19 - Karan Singh 47

If you aren’t familiar with the following algorithms,

spend some time checking them out!

Dijkstra’s shortest path algorithm
Kruskal and Prim’s minimum spanning tree algorithms