Slides6 Analysis PDF

15-150 Fall 2014
Lecture 6
Stephen Brookes
Announcements
Read the notes!
Do the Self-tests!
Homework 3 out...
Start Homework early,
hand it in on time,
and do your own work!
today
Sorting an integer list
specifications and proofs

asymptotic analysis
Insertion sort
Mergesort
SML features
. datatype definitions

. boolean connectives

.
.
- andalso, orelse

case expressions

<> means
comparison
compare : int * int -> order
datatype
definition
datatype order = LESS | EQUAL | GREATER

!
fun compare(x:int, y:int):order =

if x<y then LESS else

if y<x then GREATER else EQUAL
compare(x,y) = LESS
compare(x,y) = EQUAL
compare(x,y) = GREATER
introduces type

order

and values

LESS, EQUAL, GREATER

of type!order

if x<y
if x=y

if x>y
key properties
is a linear ordering

For all values a,b,c :int
If a b and b a then a = b
If a b and b c then a c
Either a b or b a
(antisymmetry)!
(transitivity)!
(totality)
< is defined by

a < b if and only if (a b and a b)
and satisfies
a < b or b < a or a = b
(trichotomy)
sorted
sorted : int list -> bool
A list of integers is <-sorted (or just sorted)

if each item in the list is all items that occur later.
the type

fun sorted [ ] = true

order

| sorted [x] = true

is an equality type
| sorted (x::y::L) =

(compare(x,y) <> GREATER) andalso sorted(y::L)

For all L : int list,

sorted(L) = true if L is sorted

= false otherwise
specs and code

We use sorted only in specifications.

Our sorting functions wont use it.

But you could use it for testing...
For every integer list L

there exists

a sorted permutation of L
insertion sort
Insertion sort is a simple sorting algorithm that

builds the sorted list recursively, one item at a time.!
If the list is empty, do nothing.!
Otherwise, each recursive call inserts an item from
the input list into its correct position in the sorted
list so far.
(Wikipedia doesnt give good specs!)
insertion sort
If the list is empty, do nothing.!

Otherwise, recursively sort the tail,
then insert the head item into its correct position
in the sorted tail.
need a helper function to do insertion

insert : int * int list -> int list

REQUIRES

ENSURES
insertion
ins : int * int list -> int list
(* REQUIRES L is a sorted list
*)
(* ENSURES ins(x, L) = a sorted permutation of x::L *)
a sorted list consisting of x and all items from L
fun ins (x, [ ]) = [x]

| ins (x, y::L) = case compare(x, y) of
GREATER => y::ins(x, L)
|
_
=> x::y::L
For all sorted integer lists L,

ins(x, L) = a sorted permutation of x::L.
insertion
ins : int * int list -> int list
(* REQUIRES L is a sorted list
*)
(* ENSURES ins(x, L) = a sorted perm of x::L *)
fun ins (x, [ ]) = [x]
| ins (x, y::L) =
if x>y then y::ins(x, L) else x::y::L
(equivalent code, using if-then-else)
proof outline
Theorem
For all sorted integer lists L, all values x:int,

ins(x, L) = a sorted permutation of x::L.
Proof: By induction on length of L.

Base case: When L has length 0, L is [ ].
[ ] is sorted, and ins(x, [ ]) = [x] is a sorted perm of x::[ ].

Inductive case: Let k>0 and assume IH:
For all sorted lists A of length < k, all values x,

ins(x, A) = a sorted perm of x::A.
Let L be a sorted list of length k.

Pick y and R such that L=y::R. So length(R) < k.

R is a sorted list with length < k, and y all of R

By IH, ins(x, R) = a sorted perm of x::R

Show: ins(x, y::R) = a sorted perm of x::(y::R)
sketch
for inductive step
Exercise:
fill in the details!
ins (x, y::R) = case compare(x, y) of

GREATER => y::ins(x, R)
|
_
=> x::y::R
R is sorted and y all of R.

By IH, ins(x, R) = a sorted perm of x::R

If x>y we have ins(x, y::R) = y::ins(x,R)

This list is sorted because...
This list is a perm of x::y::R because...

If xy we have ins(x, y::R) = x::y::R

This list is sorted because...
This list is a perm of x::y::R because...

In all cases, ins(x, y::R) = a sorted perm of x::y::L

isort
isort : int list -> int list
(* REQUIRES L is a value of type int list

*)
(* ENSURES isort(L) = a sorted perm of L *)
fun isort [ ] = [ ]
| isort (x::L) = ins (x, isort L)
For all values L: int list,

isort L = a sorted permutation of L.
For all integer lists L,

proof outline
For all L: int list,

Proof: By induction on length of L.

Base case: for L = [ ].
Show that isort [ ] = a sorted perm of [ ].

Inductive case: for L = y::R.
IH:
isort R = a sorted perm of R.
Show: isort(y::R) = a sorted perm of y::R.
Use the proven spec for ins!
a variation
fun isort [ ] = [ ]
clause for isort [x]

seems redundant
fun isort [ ] = [ ]

| isort [x] = [x]

variation
isort : int list -> int list
fun isort [ ] = [ ]

| isort [x] = [x]

If in doubt,

test it then prove it
equivalent
isort and isort are extensionally equivalent.

For all L : int list, isort L = isort L.

Proof? (See lecture notes!)

No need for extra clause

but it doesnt do any harm
mergesort
Conceptually, a merge sort works as follows:!
!
1. Divide the unsorted list into n sublists,
each containing 1 element.
!
2. Repeatedly Merge sublists to produce new
sublists until there is only 1 sublist left.
! whats n?
and then?
Whats the output?

How does it relate to the input?
mergesort
A recursive divide-and-conquer algorithm

If list has length 0 or 1, do nothing.

If list has length 2 or more,
split the list into two smaller lists,
mergesort these lists,
merge the results
(not a spec, but a pretty clear description of an algorithm)
implementation
First, design helper functions for
splitting and merging
split : int list -> int list * int list

merge : int list * int list -> int list
split [4,2,1,3] = ([4,1], [2,3])
merge([1,4], [2,3]) = [1,2,3,4]
split
split : int list -> int list * int list
(* REQUIRES true
*)

(* ENSURES split(L) = a pair of lists (A, B)
*)
(* such that length(A) and length(B) differ by at most 1, *)
(* and A@B is a permutation of L.
*)
write as
length(A)length(B)
fun split [ ] = ([ ], [ ])
| split [x] = ([x], [ ])
| split (x::y::L) =
let val (A, B) = split L in (x::A, y::B) end
split [4,2,1,3] = ([4,1], [2,3])
proof outline
For all L:int list,

split(L) = a pair of lists (A, B) such that

length(A) length(B) and A@B is a permutation of L.
Proof: by (strong) induction on length of L

Base cases: L = [ ], [x]
(i) Show that split [ ] = a pair (A, B) such that
length(A)length(B) & A@B is a perm of [ ].
(ii) Show that split [x] = a pair (A, B) such that
length(A)length(B) & A@B is a perm of [x].

Inductive case: L=x::y::R.
Key facts
split [ ] = ([ ], [ ])
[ ]@[ ] = [ ]
split [x] = ([x], [ ])
[x]@[ ] = [x]
Induction Hypothesis: split(R) = a pair (A, B) such that

length(A)length(B) & A@B is a perm of R.
(iii) Show that split(x::y::R) = a pair (A, B) such that
length(A)length(B) & A@B is a perm of x::y::R.
Key facts
split (x::y::R) = (x::A, y::B)

length(x::A) length(y::B)
(x::A)@(y::B) is a perm of x::y::R
comments
We used strong induction on length of L
rather than simple induction.

Reason: split(x::y::R) calls split(R) and
length of R is two less than length of x::y::R.
notes
If length(L)>1 and split(L) = (A, B),
then A and B have smaller length than L.

This follows from the spec,
using some fairly obvious facts:

A@B is a perm of L, so
length(A)+length(B)=length(L)
length(A) & length(B) differ by 0 or 1
if n>1 and n odd, (n div 2)+1 < n
if n>1 and n even, n div 2 < n
merge
merge : int list * int list -> int list
(* REQUIRES A and B are sorted lists
(* ENSURES merge(A, B) = a sorted perm of A@B
*)

*)
fun merge (A, [ ]) = A

| merge ([ ], B) = B
| merge (x::A, y::B) = case compare(x, y) of

LESS => x :: merge(A, y::B)
| EQUAL => x::y::merge(A, B)
| GREATER => y :: merge(x::A, B)
merge([1,4], [2,3]) = [1,2,3,4]
proof outline
For all sorted lists A and B,

merge(A, B) = a sorted permutation of A@B.
Method: strong induction on product of lengths of A, B.

Inductive case: (x::A, y::B).
Base cases: (A, [ ]) and ([ ], B).

(i) Show: if A is sorted, merge(A,[ ]) = a sorted perm of A@[ ].
(ii) Show: if B is sorted, merge([ ],B) = a sorted perm of [ ]@B.

IH: for all pairs(A, B) with smaller product of lengths,

if A & B are sorted, merge(A, B) = a sorted perm of A@B.
Show: if x::A and y::B are sorted,

merge(x::A, y::B) = a sorted perm of (x::A)@(y::B).
(Exercise: fill in the details!)
clause order
fun merge (A, [ ]) = A

| merge ([ ], B) = B
| merge (x::A, y::B) =

or
fun merge (x::A, y::B) =

| merge (A, [ ]) = A

| merge ([ ], B) = B
Does clause order matter here? NO

Exhaustive

Patterns are
Overlap of first two clauses is harmless
Each yields merge([ ], [ ]) = [ ]
(not always true!)

Could use nested if-then-else instead of case.
But we need a 3-way branch, so case is better style.
so far
We defined split and merge

We proved they meet their specs

Now lets use them to implement the
mergesort algorithm...
msort
msort : int list -> int list
(* REQUIRES true
(* ENSURES msort(L) = a sorted perm of L
*)

*)
fun msort [ ] = [ ]
msort [4,2,1,3]

| msort [x] = [x]
= merge(msort [4,1], msort [2,3])

= merge([1,4], [2,3])

| msort L =
= [1,2,3,4]
let

val (A, B) = split L

in
merge (msort A, msort B)
end
msort [4,2,1,3] =>* [1,2,3,4]
msort
(* REQUIRES true
(* ENSURES msort(L) = a sorted perm of L
fun msort [ ] = [ ]
| msort [x] = [x]
| msort L = let


val A = msort A

an

val
B
=
msort
B

alternative

in
version
merge (A, B)
end
*)

*)
proof outline
For all L:int list,

msort(L) = a sorted permutation of L.
Method: by strong induction on length of L

Base cases: L = [ ], L = [x]
(i) Show msort [ ] = a sorted perm of [ ]
(ii) Show msort [x] = a sorted perm of [x]

Inductive case: length(L)>1.
Inductive hypothesis: for all shorter lists R,

msort R = a sorted perm of R.
Show that msort L = a sorted perm of L.
msort
fun msort [ ] = [ ]
is this clause
| msort [x] = [x]
redundant?
| msort L = let


in


end
a variation
fun msort [ ] = [ ]
| msort L = let


in


end
loops forever

on non-empty lists
the problem
split [x] = ([x], [ ])

msort [x] =>* (fn ... => ...) (msort [x], msort [ ])
leads to infinite computation
principles

Every function needs a spec
Every spec needs a proof

Recursive functions require inductive proofs

Learn to pick an appropriate method...

Choose helper functions wisely!
proof of msort was easy,

because of split and merge
choose wisely
merge and split were helpers for msort
Use helper functions with truly helpful specs

merge also satisfies other specs, e.g.
For all integer lists L and R,
merge(L, R) = a perm of L@R.
This wont help prove

correctness of msort!
the joy of specs

The proof for msort relied only on the
specifications of split and merge
Can replace split by any other function

with the same specification, and
the new msort will still meet its spec!
the old proof

for msort

still works!
example
fun split [ ] = ([ ], [ ])

| split [x] = ([ ], [x])

| split (x::y::L) = let val (A, B) = split L in (x::A, y::B) end

!
fun msort [ ] = [ ]

| msort [x] = [x]

| msort L = let


in

merge(msort A, msort B)

end;
example
split and split are not extensionally equivalent,
but they both satisfy the specification used in
the correctness proof of msort

... so msort and msort are both correct
so far
Weve implemented insertion sort and
mergesort in ML, correctly

What about efficiency?

Slides6 Analysis PDF

Diunggah oleh

Informasi Dokumen

Deskripsi Asli:

Judul Asli

Hak Cipta

Format Tersedia

Bagikan dokumen Ini

Bagikan atau Tanam Dokumen

Opsi Berbagi

Apakah menurut Anda dokumen ini bermanfaat?

Apakah konten ini tidak pantas?

Hak Cipta:

Format Tersedia

Slides6 Analysis PDF

Diunggah oleh

Hak Cipta:

Format Tersedia

15-150 Fall 2014

specifications and proofs

datatype order = LESS | EQUAL | GREATER

fun compare(x:int, y:int):order =

a < b if and only if (a b and a b)

A list of integers is <-sorted (or just sorted)

For all L : int list,

specs and code

Insertion sort is a simple sorting algorithm that

(Wikipedia doesnt give good specs!)

If the list is empty, do nothing.!

need a helper function to do insertion

fun ins (x, [ ]) = [x]

For all sorted integer lists L, all values x:int,

Proof: By induction on length of L.

Inductive case: Let k>0 and assume IH:

For all sorted lists A of length < k, all values x,

Let L be a sorted list of length k.

ins (x, y::R) = case compare(x, y) of

If x>y we have ins(x, y::R) = y::ins(x,R)

If xy we have ins(x, y::R) = x::y::R

In all cases, ins(x, y::R) = a sorted perm of x::y::L

(* REQUIRES L is a value of type int list

Proof: By induction on length of L.

Show that isort [ ] = a sorted perm of [ ].

Inductive case: for L = y::R.

Proof? (See lecture notes!)

split : int list -> int list * int list

split [4,2,1,3] = ([4,1], [2,3])

For all L:int list,

Proof: by (strong) induction on length of L

Inductive case: L=x::y::R.

Induction Hypothesis: split(R) = a pair (A, B) such that

split (x::y::R) = (x::A, y::B)

(x::A)@(y::B) is a perm of x::y::R

Reason: split(x::y::R) calls split(R) and

length of R is two less than length of x::y::R.

then A and B have smaller length than L.

This follows from the spec,

using some fairly obvious facts:

fun merge (A, [ ]) = A

merge([1,4], [2,3]) = [1,2,3,4]

Method: strong induction on product of lengths of A, B.

Inductive case: (x::A, y::B).

Base cases: (A, [ ]) and ([ ], B).

IH: for all pairs(A, B) with smaller product of lengths,

Show: if x::A and y::B are sorted,

(Exercise: fill in the details!)

fun merge (x::A, y::B) =

Does clause order matter here? NO

Each yields merge([ ], [ ]) = [ ]

(not always true!)

msort [4,2,1,3] =>* [1,2,3,4]

Method: by strong induction on length of L

Inductive case: length(L)>1.

Inductive hypothesis: for all shorter lists R,

Use helper functions with truly helpful specs

This wont help prove

the joy of specs

Can replace split by any other function

... so msort and msort are both correct

What about efficiency?