[PPT] - 15-150 Fall 2020 Lecture 6 Stephen Brookes Most of the time I don't PowerPoint Presentation

SLIDE 1

15-150 Fall 2020 Lecture 6

Stephen Brookes

Most of the time I don't have much fun. The rest of the time I don't have any fun at all.

SLIDE 2

today

Sorting an integer list

Using specifications

to guide program design

“Helper functions should help!”

datatype definitions boolean connectives case expressions <> means ≠

SML features

. . . .

SLIDE 3

datatypes

ML has datatype declarations
Allow us to introduce new types,

with constructors for building values datatype order = LESS | EQUAL | GREATER datatype ’a option = NONE | SOME of ’a NONE : int option SOME 42 : int option ’a list is a built-in datatype with constructors nil and ::

SLIDE 4

comparing ints

datatype order = LESS | EQUAL | GREATER

a datatype

definition introducing the type

rder

with values LESS, EQUAL, GREATER

SLIDE 5

comparing ints

datatype order = LESS | EQUAL | GREATER

a datatype

definition introducing the type

rder

with values LESS, EQUAL, GREATER

fun compare(x:int, y:int):order = if x<y then LESS else if y<x then GREATER else EQUAL

SLIDE 6

comparing ints

datatype order = LESS | EQUAL | GREATER compare : int * int -> order compare(x,y) = LESS if x<y compare(x,y) = EQUAL if x=y compare(x,y) = GREATER if x>y

a datatype

definition introducing the type

rder

with values LESS, EQUAL, GREATER

fun compare(x:int, y:int):order = if x<y then LESS else if y<x then GREATER else EQUAL

SLIDE 7

properties

≤ is a linear ordering
< is defined by

and satisfies

If a ≤ b and b ≤ a then a = b (antisymmetric) If a ≤ b and b ≤ c then a ≤ c (transitive) Either a ≤ b or b ≤ a (connected) a < b if and only if (a ≤ b and a ≠ b) a < b or b < a or a = b (trichotomy)

f < and ≤ on integers

SLIDE 8

sorted

A list is <-sorted (or just sorted) if and only if each item in the list is ≤ all later items.

sorted : int list -> bool fun sorted [ ] = true | sorted [x] = true | sorted (x::y::L) = (x <= y) andalso sorted(y::L)

SLIDE 9

sorted

A list is <-sorted (or just sorted) if and only if each item in the list is ≤ all later items.

sorted : int list -> bool

For all L : int list, sorted(L) = true if L is sorted = false otherwise

fun sorted [ ] = true | sorted [x] = true | sorted (x::y::L) = (x <= y) andalso sorted(y::L)

SLIDE 10

sorted

A list is <-sorted (or just sorted) if and only if each item in the list is ≤ all later items.

sorted : int list -> bool

For all L : int list, sorted(L) = true if L is sorted = false otherwise

fun sorted [ ] = true | sorted [x] = true | sorted (x::y::L) = (x <= y) andalso sorted(y::L)

(Prove this, by induction on list length) (Note the relevance of transitivity etc.)

SLIDE 11

specs and code

We use sorted only in specifications.
Our sorting functions won’t use it.
But you could use it for testing...

SLIDE 12

specs and code

We use sorted only in specifications.
Our sorting functions won’t use it.
But you could use it for testing...

For every integer list L there is a unique sorted permutation of L

SLIDE 13

insertion sort

Insertion sort is a simple sorting algorithm that builds the sorted list recursively, one item at a time.

If the list is empty, do nothing.
Otherwise, each recursive call inserts an item from

the input list into its correct position in the sorted list so far.

SLIDE 14

insertion sort

Insertion sort is a simple sorting algorithm that builds the sorted list recursively, one item at a time.

If the list is empty, do nothing.
Otherwise, each recursive call inserts an item from

the input list into its correct position in the sorted list so far.

(Wikipedia doesn’t give good specs!)

SLIDE 15

insertion sort

If the list is empty, do nothing.
Otherwise, recursively sort the tail, then

insert the head item into its correct position in the (already sorted) tail.

SLIDE 16

insertion sort

If the list is empty, do nothing.
Otherwise, recursively sort the tail, then

insert the head item into its correct position in the (already sorted) tail.

… We need a helper function

SLIDE 17

insertion sort

If the list is empty, do nothing.
Otherwise, recursively sort the tail, then

insert the head item into its correct position in the (already sorted) tail.

… We need a helper function ins : int * int list -> int list REQUIRES … ENSURES …

SLIDE 18

insertion

ins : int * int list -> int list

REQUIRES L is a sorted list ENSURES ins(x, L) = a sorted permutation of x::L

per·mu·ta·tion

noun  A way, in which a list of things can be arranged:   "his thoughts raced ahead to fifty different permutations of what he must do"

Powered by Oxford Dictionaries

inserts x into its correct position in L

SLIDE 19

insertion

ins : int * int list -> int list

REQUIRES L is a sorted list ENSURES ins(x, L) = a sorted permutation of x::L

fun ins (x, [ ]) = [x]

per·mu·ta·tion

noun  A way, in which a list of things can be arranged:   "his thoughts raced ahead to fifty different permutations of what he must do"

Powered by Oxford Dictionaries

inserts x into its correct position in L

SLIDE 20

insertion

ins : int * int list -> int list

REQUIRES L is a sorted list ENSURES ins(x, L) = a sorted permutation of x::L

fun ins (x, [ ]) = [x] | ins (x, y::R) =

per·mu·ta·tion

noun  A way, in which a list of things can be arranged:   "his thoughts raced ahead to fifty different permutations of what he must do"

Powered by Oxford Dictionaries

inserts x into its correct position in L

SLIDE 21

insertion

ins : int * int list -> int list

REQUIRES L is a sorted list ENSURES ins(x, L) = a sorted permutation of x::L

fun ins (x, [ ]) = [x] | ins (x, y::R) = if x > y

per·mu·ta·tion

noun  A way, in which a list of things can be arranged:   "his thoughts raced ahead to fifty different permutations of what he must do"

Powered by Oxford Dictionaries

inserts x into its correct position in L

SLIDE 22

insertion

ins : int * int list -> int list

REQUIRES L is a sorted list ENSURES ins(x, L) = a sorted permutation of x::L

fun ins (x, [ ]) = [x] | ins (x, y::R) = if x > y then y :: ins(x, R)

per·mu·ta·tion

noun  A way, in which a list of things can be arranged:   "his thoughts raced ahead to fifty different permutations of what he must do"

Powered by Oxford Dictionaries

inserts x into its correct position in L

SLIDE 23

insertion

else x :: (y :: R) ins : int * int list -> int list

REQUIRES L is a sorted list ENSURES ins(x, L) = a sorted permutation of x::L

fun ins (x, [ ]) = [x] | ins (x, y::R) = if x > y then y :: ins(x, R)

per·mu·ta·tion

noun  A way, in which a list of things can be arranged:   "his thoughts raced ahead to fifty different permutations of what he must do"

Powered by Oxford Dictionaries

inserts x into its correct position in L

SLIDE 24

ins equations

ins (x, [ ]) = [x] ins (x, y::R) = if x > y then y::ins(x, R) else x::(y::R)

For all values x, y : int and R : int list,

ins (x, y::R) = y::ins(x, R) if x > y = x::(y::R) otherwise

SLIDE 25

Proof: By induction on length of L.

Base case: When L has length 0, L is [ ].

[ ] is sorted, and ins(x, [ ]) = [x] is a sorted perm of x::[ ].

Inductive case: Let k>0 and L be sorted, of length k.

Let y, R be the head, tail of L: so L = y::R. R is sorted, of length < k, and y ≤ all of R. Need to show: ins(x, y::R) = a sorted perm of x::(y::R)

For all sorted integer lists L, all values x:int, ins(x, L) = a sorted permutation of x::L.

IH: For all sorted lists A of length < k, all values x, ins(x, A) = a sorted perm of x::A.

correctness

SLIDE 26

inductive case

R is sorted, length < k, and y ≤ all of R.

By IH, ins(x, R) = a sorted perm of x::R

If x>y we have ins(x, y::R) = y::ins(x,R) This list is sorted because... This list is a perm of x::(y::R) because... Otherwise, x≤y and ins(x, y::R) = x::(y::R) This list is sorted because... This list is a perm of x::(y::R) because...

In all cases, ins(x, y::R) = a sorted perm of x::(y::R)

ins (x, y::R) = = x::(y::R) otherwise, i.e. if x ≤ y y::ins(x, R) if x > y

(some more details)

SLIDE 27

comments

Fill in the missing details in that proof sketch
Notice where you use basic properties of ≤
these properties are crucial
often used implicitly, without mention
that’s OK, except that you need to realize it

Now that we have ins, let’s define isort…

SLIDE 28

isort

isort : int list -> int list

ENSURES isort(L) = a sorted perm of L

SLIDE 29

isort

isort : int list -> int list

fun isort [ ] = [ ]

ENSURES isort(L) = a sorted perm of L

SLIDE 30

isort

| isort (x::R) = ins (x, isort R)

isort : int list -> int list

fun isort [ ] = [ ]

ENSURES isort(L) = a sorted perm of L

SLIDE 31

isort

| isort (x::R) = ins (x, isort R)

isort : int list -> int list

fun isort [ ] = [ ]

ENSURES isort(L) = a sorted perm of L

“isort (x::R) inserts x into its correct position in the sorted tail, isort R”

SLIDE 32

Proof: By structural induction on L.

Base case: for L = [ ].

Show that isort [ ] = a sorted perm of [ ].

Inductive case: for L = y::R.

IH: isort R = a sorted perm of R. Show: isort(y::R) = a sorted perm of y::R.

For all values L: int list, isort L = a sorted permutation of L.

By the proven ins spec, it follows that ins (y, isort R) = a sorted perm of y::R

correctness

isort (y::R) = ins (y, isort R) isort R is a sorted perm of R

SLIDE 33

comments

The proof was “by structural induction on L”
Every list value L is either [ ] (nil)
r y::R, where R is a “smaller” list value
We could just as well have said

“by induction on length of L”

[ ] has length 0
0 ≤ length R < length(y::R)

isort (y::R) calls isort R

SLIDE 34

perm facts

A perm of a perm of L is a perm of L In the correctness proof we used some obvious facts about permutations. y::(a perm of R) is a perm of (y::R)

SLIDE 35

corollaries

SLIDE 36

corollaries

isort is a total function from int list to int list

SLIDE 37

corollaries

isort is a total function from int list to int list When e evaluates to L, isort e evaluates to the sorted version of L

SLIDE 38

a variation

fun isort’ [ ] = [ ] | isort’ [x] = [x] | isort’ (x::R) = ins (x, isort’ R)

| isort (x::R) = ins (x, isort R)

fun isort [ ] = [ ]

SLIDE 39

a variation

fun isort’ [ ] = [ ] | isort’ [x] = [x] | isort’ (x::R) = ins (x, isort’ R)

is this clause redundant

| isort (x::R) = ins (x, isort R)

fun isort [ ] = [ ]

SLIDE 40

If in doubt, test, then prove

variation

fun isort’ [ ] = [ ] | isort’ [x] = [x] | isort’ (x::R) = ins (x, isort’ R)

isort’ : int list -> int list

SLIDE 41

If in doubt, test, then prove

variation

fun isort’ [ ] = [ ] | isort’ [x] = [x] | isort’ (x::R) = ins (x, isort’ R)

isort’ : int list -> int list

SLIDE 42

If in doubt, test, then prove

variation

fun isort’ [ ] = [ ] | isort’ [x] = [x] | isort’ (x::R) = ins (x, isort’ R)

isort’ : int list -> int list

SLIDE 43

equivalent

isort and isort’ are extensionally equivalent:

For all L : int list, isort L = isort’ L.

Proof? See lecture notes…

OR: Re-do the isort proof for isort’ (easy)

Hence they satisfy the same spec, so

For all L : int list, isort L = isort’ L = the sorted perm of L

SLIDE 44

equivalent

isort and isort’ are extensionally equivalent:

For all L : int list, isort L = isort’ L.

Proof? See lecture notes…

No need for extra clause but it doesn’t do any harm

OR: Re-do the isort proof for isort’ (easy)

Hence they satisfy the same spec, so

For all L : int list, isort L = isort’ L = the sorted perm of L

SLIDE 45

work

Let Wins(n) be the work for ins(x, L)

when x, L are values and L has length n

Let Wisort(n) be the work for isort(L)

when L is a list of length n

SLIDE 46

work

Let Wins(n) be the work for ins(x, L)

when x, L are values and L has length n

Let Wisort(n) be the work for isort(L)

when L is a list of length n Wins(n) is O(n)

SLIDE 47

work

Let Wins(n) be the work for ins(x, L)

when x, L are values and L has length n

Let Wisort(n) be the work for isort(L)

when L is a list of length n Wins(n) is O(n) Wisort(0) = 1 Wisort(n) = 1 + Wins(n-1) + Wisort(n-1) for n > 0

SLIDE 48

work

Let Wins(n) be the work for ins(x, L)

when x, L are values and L has length n

Let Wisort(n) be the work for isort(L)

when L is a list of length n Wins(n) is O(n)

SLIDE 49

work

Let Wins(n) be the work for ins(x, L)

when x, L are values and L has length n

Let Wisort(n) be the work for isort(L)

when L is a list of length n Wins(n) is O(n) Wisort(0) = 1 Wisort(n) = O(n) + Wisort(n-1) for n > 0

SLIDE 50

work

Let Wins(n) be the work for ins(x, L)

when x, L are values and L has length n

Let Wisort(n) be the work for isort(L)

when L is a list of length n Wins(n) is O(n) Wisort(0) = 1 Wisort(n) = O(n) + Wisort(n-1) for n > 0 Wisort(n) is O(n2)

SLIDE 51

work

Let Wins(n) be the work for ins(x, L)

when x, L are values and L has length n

Let Wisort(n) be the work for isort(L)

when L is a list of length n Wins(n) is O(n) Wisort(0) = 1 Wisort(n) = O(n) + Wisort(n-1) for n > 0 Wisort(n) is O(n2)

THIS IS SLOW! WE CAN DO BETTER!

SLIDE 52

mergesort

Conceptually, a merge sort works as follows:

1. Divide the unsorted list into n sublists,

each containing 1 element.

2. Repeatedly Merge sublists to produce new

sublists until there is only 1 sublist left.

SLIDE 53

mergesort

Conceptually, a merge sort works as follows:

1. Divide the unsorted list into n sublists,

each containing 1 element.

2. Repeatedly Merge sublists to produce new

sublists until there is only 1 sublist left.

Wrong! Wrong! Wrong!

SLIDE 54

mergesort

Conceptually, a merge sort works as follows:

1. Divide the unsorted list into n sublists,

each containing 1 element.

2. Repeatedly Merge sublists to produce new

sublists until there is only 1 sublist left.

Wrong! Wrong! Wrong! Doesn’t say “recursive”...

SLIDE 55

mergesort

Conceptually, a merge sort works as follows:

1. Divide the unsorted list into n sublists,

each containing 1 element.

2. Repeatedly Merge sublists to produce new

sublists until there is only 1 sublist left.

Wrong! Wrong! Wrong! Doesn’t say “recursive”...

… what’s n?

SLIDE 56

mergesort

Conceptually, a merge sort works as follows:

1. Divide the unsorted list into n sublists,

each containing 1 element.

2. Repeatedly Merge sublists to produce new

sublists until there is only 1 sublist left.

Wrong! Wrong! Wrong! Doesn’t say “recursive”...

… what’s n? … repeatedly????

SLIDE 57

mergesort

Conceptually, a merge sort works as follows:

1. Divide the unsorted list into n sublists,

each containing 1 element.

2. Repeatedly Merge sublists to produce new

sublists until there is only 1 sublist left.

Wrong! Wrong! Wrong! Doesn’t say “recursive”...

… what’s n? … and then? … repeatedly????

SLIDE 58

mergesort

Conceptually, a merge sort works as follows:

1. Divide the unsorted list into n sublists,

each containing 1 element.

2. Repeatedly Merge sublists to produce new

sublists until there is only 1 sublist left.

Wrong! Wrong! Wrong! Doesn’t say “recursive”...

… what’s n? … and then?

What’s the output? How does it relate to the input?

… repeatedly????

SLIDE 59

mergesort

A recursive divide-and-conquer algorithm

If list has length 0 or 1, do nothing.
Otherwise,

split the list into two shorter lists, sort these two lists, merge the (sorted) results

SLIDE 60

implementation

First, let’s design helper functions

split : int list -> int list * int list merge : int list * int list -> int list

SLIDE 61

implementation

First, let’s design helper functions

split : int list -> int list * int list merge : int list * int list -> int list (what specs should we use?)

SLIDE 62

implementation

First, let’s design helper functions

split : int list -> int list * int list merge : int list * int list -> int list (what specs should we use?) split splits a list into two sublists merge combines two sorted lists into one

SLIDE 63

implementation

First, let’s design helper functions