Homework due Tues 11/9 CLRS 16-2 (scheduling) CLRS 17-2 (binary - PDF document

Homework due Tues 11/9 • CLRS 16-2 (scheduling) • CLRS 17-2 (binary search) 1

Matroids A matroid is a pair ( S , I ) such that • S is a finite nonempty set and • I is a nonempty family of subsets of S with the following properties: 1. heredity If A ∈ I , then every subset of A is in I . 2. exchange property If A, B ∈ I and �A� < �B� , then there exists some x ∈ B \ A such that A ∪ { x } ∈ I . An element x is called an extension of A ∈ I if A ∪ { x } ∈ I . A maximal element of I is one with no extension. 2

Note that, assuming heredity, the exchange property is equivalent to: • If A, B ∈ I and � A � < � B � , then there is some C ⊆ B \ A such that A ∪ C ∈ I and � C � = � B � − � A � . By the exchange property, • all maximal elements in a matroid have the same size. 3

Graphic Matroids For an undirected graph G = ( V, E ), the graphic matroid M G = ( S G , I G ) of G is defined as follows: • S G = E . • A ⊆ E belongs to I G if and only if A is acyclic (( V, A ) is a forest). 4

Properties of Graphic Matroids Let G be a connected undirected Lemma A graph. Then the maximal elements of M G are the spanning trees of G . Proof Let n = � V � . Let T be an arbitrary spanning tree of G and let A be the edge set of T . Since T is acyclic, A belongs to G . There cannot be elements in I having size n , since they induce subgraphs of G with cycles. So, A is maximal. On the other hand, if A is an element in I having size n − 1, then since A induces an acyclic graph, it induces a spanning tree. 5

Let G be an undirected graph. Lemma B Let C 1 , · · · , C k the connected components of G . For each i , 1 ≤ i ≤ k , Z i be the set of all edges of the spanning trees of C i . Then the maximal elements of M G is Z 1 × · · · × Z k , that is, the cartesian product of Z 1 , · · · , Z k . 6

Let G be an undirected graph. Theorem C Then M G is a matroid. Proof Let G = ( V, E ) be an arbitrary undirected graph and let n = � V � . Let A ⊂ E be such that ( V, A ) is Heredity acyclic. Then, for all B ⊆ A , ( V, B ) is acyclic. so, M G is hereditary. 7

Let C 1 , . . . , C k be the Exchange Property connected components of G . Let A ∈ I . Then A can be expressed as the disjoint union of some A 1 , . . . , A k such that for each i , 1 ≤ i ≤ k , A i induces an acyclic subgraph of C i . Let B ∈ I . Then B can be similarly decomposed into the disjoint union of some B 1 , . . . , B k . Assume � A � < � B � . Then there is some i , 1 ≤ i ≤ k , such that � A i � < � B i � . Pick such an i . Since B i induces an acyclic subgraph of C i , A i induces a forest, but not a tree, of C i . Let D 1 , . . . , D m be the connected components of C i that A i induces, where m ≥ 2. Let t 1 , . . . , t m be the number of nodes in D 1 , . . . , D m . For each j , 1 ≤ j ≤ m , A i induces a tree of D j . So, the size of A i is ( t 1 − 1) + · · · ( t m − 1). 8

We claim that there exist some j, j ′ , 1 ≤ j < j ′ ≤ m , such that B i has an edge connecting a node in D j and D j ′ . To prove the claim, assume otherwise. Then each edge of B i belongs to one of the connected components D 1 , . . . , D m . Since B i induces an acyclic graph, for all j , 1 ≤ j ≤ m , the number of edges of B i within D j is at most t j − 1. So, the size of B i is at most ( t 1 − 1) + · · · ( t m − 1), and thus, does not exceed the size of A i . This is a contradiction. This proves the claim. 9

By the claim, there exist some j, j ′ , 1 ≤ j < j ′ ≤ m , such that B i has an edge connecting a node in D j and D j ′ . Pick such j and j ′ and such an edge e . Add e to A to obtain A ′ . By the way e is selected, A ′ induces an acyclic graph. So, M G has the exchange property. 10

Weighted Matroids Let w be a function from S to N + , the set of all positive integers. For each A ⊆ S , define w ( A ) as � x ∈ A w ( x ). Call w a weight function of M . An optimal subset of M with respect to w is the one having the largest weight. By definition, optimal subsets are maximal. Finding an optimal subset Greedy ( M, w ) 1 sort the elements in S in the non-increasing order of their weights; let x 1 , . . . , x m be the enumeration 2 A ← ∅ 3 for i ← 1 to m do 4 if A ∪ { x i } ∈ I then A ← A ∪ { x i } 5 6 return A 11

Why does this algorithm work? Let k be the smallest i such that Lemma D { x i } ∈ I . Then there is an optimal subset containing x k . Proof Let B be an optimal subset with x k / ∈ B . No element y of B has w ( y ) > w ( x k ) Begin with A = { x k } , add from B − A until � A � = � B � . A = B − y ∪ x for some y ∈ B w ( A ) = w ( B ) − w ( y ) + w ( x ) (1) ≥ w ( B ) 12

If an element is not an option initially, it cannot be an option later, because of heredity. 13

Application of the Matroid Theory: The task-scheduling problem Objects with deadlines and penalty. Input Integers n , d 1 , . . . , d n , w 1 , . . . , w n . Output Find a permutation p 1 , . . . , p n of 1 , . . . , n that minimizes � w i . p i >d i Intuitively, think of 1 , . . . , n as n tasks to be fulfilled that require a unit-time each and of d 1 , . . . , d n as the deadlines for their tasks. Starting from time 0, the n tasks are executed in an order. If a task is not accomplished on or before its deadline, the penalty associated with it is imposed. w 1 , . . . , w n are the penalty values. The goal is to find scheduling of the tasks that minimizes the total penalty. 14

The matroid over the set of tasks Let S = { 1 , . . . , n } Assume that the tasks are enumerated so that their deadlines are non-decreasing. Let A ⊆ S . We say that A is good if there is a canonical ordering of the items in A such that for all i ∈ A , the order of i in the ordering is at most d i . Let I be the set of all subsets of S that are good and let M = ( S , I ). 15

M is a matroid. Theorem E Proof To prove the heredity, let A ∈ I . Let π be an ordering of A such that for all i ∈ A , π ( i ) ≤ d i . Let B be a proper subset of A . Let σ be the ordering defined for all i ∈ B by: σ ( i ) = 1 + �{ j ∈ B | π ( j ) < π ( i ) }� Then, for all i , σ ( i ) ≤ π ( i ), and thus, σ ( i ) ≤ p i . So, B is good. 16

Exchange Property For each t , 1 ≤ t ≤ n , A ⊆ S , let µ ( A, t ) be the number of i ∈ A such that d i ≤ t . To prove the exchange property of M , let A and B be good ones such that � A � < � B � . Since A and B are good, for all t , 1 ≤ t ≤ n , both µ ( A, t ) and µ ( B, t ) are at most t . Let t 0 = max { t | µ ( A, t ) = t } if there is at least one t such that µ ( A, t ) = t and 0 otherwise. There is some j ∈ B \ A such that Claim F d j ≥ t 0 + 1. Let A 0 = { i ∈ A | d i ≤ t 0 } and Proof A 1 = A − A 0 . Similarly, define B 0 and B 1 . Since µ ( A, t 0 ) = t 0 , � A 0 � = t 0 . Since � B 0 � ≤ t 0 , this implies that � A 0 � ≤ � B 0 � . Sicne � A � < � B � , this implies that � A 1 � < � B 1 � . By definition, every element of B 1 has a deadline greater than t 0 , so there is some j ∈ B 1 \ A 1 such that d j ≥ t 0 + 1. 17

The proof continues . . . Let j be such that j ∈ B 1 \ A 1 and d j ≥ t 0 + 1. Note that, for all i ∈ A 0 π ( i ) ≤ t 0 and for all i ∈ A 1 d i ≥ i + 1. This means that for each element i in A 1 can be delayed by one unit-time. Since d j ≥ t 0 + 1, we can add j to A and still execute all of them by their deadlines. 18

Solution to the Maximization Problem Let R = w 1 + · · · w n . For each good A , let w ( A ) = � i ∈ A w i and let Q ( A ) = R − w ( A ). Then the problem of minimizing the total penalty incurred is equivalent to the problem of maximizing Q , which can be solved by greedy. 19

Chapter 17: Amortized Analysis Efficiency averaged over time. We assume that a sequence of operations is executed on a data structure and calculate the cost per operation averaged over the sequence. Some operations are cheap and some are expensive depending on the situation. 20

Our first example is Multipop , a new operation on a stack. With this you are able to pop any number of elements from a stack. However, it is implemented by repeated execution of Pop . What are the other permissible operations? Creation of an empty stack, Push , Pop , and Empty , which tests the emptiness. 21

Ostensibly Multipop is quite expensive because elimination of k objects requires O ( k ) steps. However, for us to be able to eliminate k objects Push has to be executed at least k times prior to that... Which means a bad thing does not happen so very often... 22

Our next example is a k -bit binary counter. Suppose we will increment n times a k -bit counter that is initially set to 0 ... The number of bit operations required is high if there is a long run of 1’s at the lower bits of the counter, but that does not happen very often. 23

Homework due Tues 11/9 CLRS 16-2 (scheduling) CLRS 17-2 (binary - PDF document

Homework due Tues 11/9 CLRS 16-2 (scheduling) CLRS 17-2 (binary search) 1 Matroids A matroid is a pair ( S , I ) such that S is a finite nonempty set and I is a nonempty family of subsets of S with the following properties: 1.

Homework 5 Due Tuesday Oct 26 CLRS 14-2.2 (rb tree black height) CLRS 14-2.3 (rb tree

Homework 8 due Tues 11/16 CLRS 18.1-5 (red-black vs. BTrees) CLRS 18.2-6 (complexity in t )

Homework 9 Due Tuesday Dec 6 CLRS 19.2-4 (correctness of heap union) CLRS 22.3-4

Homework 3 Due Thursday Sept 30 CLRS 8-3 (sorting variable-length items) CLRS 9-2 (weighted

Homework 2 Due Thursday Sept 23 CLRS 6.5-8 (algorithm for merging lists) CLRS 7-5 (median

Announcements HW 6 due tomorrow HW 7 due next Tues 4/2 Midterm #2: Tues 4/2 to Tues 4/9 Office

Homework 1 Due Thursday Sept 16 CLRS 2.3-7 CLRS 2-2 Solve the following recurrence

Homework 4 Due Thursday Oct 7 CLRS 12-4 (number of binary trees) CLRS 13.3-6 (rb insert

Homework and Exams Homework Context Free Languages Return Homework #2 Homework #3

Homework Homework Context Free Languages Return Homework #2 Homework #3 Due today

Homework Homework #1 returned today Kleene Theorem Homework #2 due today Homework

Homework Homework #5 returned Turing Machines Homework #6 due today Homework #7

Homework Homework #3 returned Chomsky Normal Form Homework #4 due today Homework #5

Homework Homework #2 returned Context Free Languages Homework #3 due today Homework #4

Aperiodic Task Scheduling Radek Pel anek Preemptive Scheduling Non-preemptive Scheduling

Reminders Late Homework 5 is due Homework 6 is due Homework 7 will be released today

The Mechanics of Evolution Interaction of Natural Selection and Inheritance (Genetics) Lifeforms

Computational Materials Discovery Using the USPEX Code Artem R. Oganov Skolkovo Institute of

Bayesian Personalized Feature Interaction Selection for Factorization Machines Yifan Chen 1,2

Maximization in Massive Datasets Alina Ene Joint work with Rafael Barbosa, Huy L. Nguyen, Justin

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture 23: Pedigree and inbred

A Model Selection Approach for Genome Wide Association Studies Florian Frommlet, Piotr Twarog,

Raising a Generation of Innovators: The How and Why of Promoting Creativity in Children The

A review of genetic parameters in Eucalyptus nitens Matthew Hamilton Brad Potts School of Plant