Tightening Big M Constraints Lou Hafer DIMACS Workshop on COIN-OR - PDF document

Tightening ‘Big M’ Constraints Lou Hafer DIMACS Workshop on COIN-OR July 17 – 20, 2006 1

Various portions joint with Mikhail Bilenky, Alexander Kononov, Cheryl Petreman, and Ken Collins. 1-1

Time-Expanded Models Suppose we have jobs j = 1, . . . , J with duration d j that use the same machine. The machine can handle one job at a time. A time-expanded model breaks time into discrete steps t = 1, . . . , T . A binary variable x jt encodes job j starting at time t . To express that at most one job can use the machine in a given time step τ , I can write J τ � � x jt = 1 j =1 t = τ − d j +1 for every time slot t . 2

What’s my motivation? A long time ago, when I was a bright-eyed Ph.D. student, I was working on digital hardware design automation. Es- sentially, it’s a resource-constrained scheduling problem: I have a bunch of tasks with precedence, and a box of parts, and I want to assign tasks to parts so as to minimise an objective that balances makespan and cost of parts. When the execution time of a task is known in advance, you can write a time-expanded model. A constraint that says “one task at a time” looks like this. I have to look over a range of time indices, in order to see if a job started earlier is still active. The duration of the activity has to be a known constant to write this constraint. 2-1

Continuous Time Models Suppose I use continuous variables s i , f i to en- code the start and finish times of jobs. To express that at most one job can use the machine at a given time, I can write s q − f p + α pq M ≥ 0 s p − f q + (1 − α pq ) M ≥ 0 for all pairs p , q which could overlap. 3

My problem was that I didn’t know the duration in advance. One of the decisions in digital hardware design affects the duration of a task. So I tried using continuous variables, and ended up writing ‘big M’ constraints. M has to be big enough to satisfy the ‘wrong way’ constraint, once I’ve picked an order for p and q . In my defense, I’d only just learned about MIP, and had no idea what a bad model this was. After a while, I learned that researchers using the time- expanded model for digital hardware synthesis were get- ting much better computational results. And I found poly- hedral theory. Ever since, I’ve been interested in answering the question “What is it about the structure of the continuous time poly- hedron that makes this model behave so badly, while the time-expanded model behaves so well?” Finally, I can offer a partial answer. 3-1

A Simple Example A scheduling problem: Two activities, with no precedence, constant duration, upper and lower bounds on start times. The constraint system is: s 2 ) ≥ 0 s 2 − ( s 1 + d 1 ) + α (( ˆ s 1 + d 1 ) − ˇ s 1 ) ≥ 0 s 1 − ( s 2 + d 2 ) + (1 − α )(( ˆ s 2 + d 2 ) − ˇ s 1 ≤ s 1 ≤ ˆ ˇ s 1 s 2 ≤ s 2 ≤ ˆ ˇ s 2 0 ≤ α ≤ 1 4

Here’s a simple example. I’ve reverted to constant duration so I can stay in three dimensions. The first constraint says that if the execution order is x 1 → x 2 ( α = 0), then the start time of x 2 ( s 2 ) must come after the finish time of x 1 ( s 1 + d 1 ). But what about when x 2 goes first? Then α = 1, and the value ( ˆ s 1 + d 1 ) − ˇ s 2 is always larger than s 2 − ( s 1 + d 1 ). 4-1

α D F E I s1 J H G C B A s2 5

Here’s the polytope. On the bottom, s 1 goes first. On the top, s 2 goes first. The ramps represent intermediate values of α . The dashed lines are the upper and lower bounds on the start times. s 2 ) ≥ 0. The ramp BCD is s 2 − ( s 1 + d 1 ) + α (( ˆ s 1 + d 1 ) − ˇ s 1 ) ≥ 0. The ramp EFA is s 1 − ( s 2 + d 2 ) + (1 − α )(( ˆ s 2 + d 2 ) − ˇ We need to cut off the non-integral points, leaving only the integral points. By inspection, we can see the cutting planes which are necessary. ABE and BDE together will cut off G and J, for example. 5-1

α D F E I s1 J H G C B A s2 6

The cutting planes are shown with dark lines. In effect, they will tighten the bound on a variable as we move from one feasible region to the other. 6-1

Non-Constant Duration Given activities x 1 and x 2 , to be serialised in some order. Minimum durations d 1 and d 2 . Continuous variables s 1 , s 2 , for activity start times and f 1 , f 2 , for activity finish times, each with upper and lower bounds. The minimum durations, and the upper and lower bounds on start and finish times, are assumed to be known constants. The constraint system is s 2 ≥ f 1 ∨ s 1 ≥ f 2 f 1 ≥ s 1 + d 1 f 2 ≥ s 2 + d 2 s i ≤ s i ≤ ˆ ˇ s i f i ≤ f i ≤ ˆ ˇ f i 7

Now, let’s look at the case where we don’t know execution time in advance. Here’s the constraint system: Two activities, x 1 and x 2 , with minimum durations d i . For each activity, the start time s i and finish time f i is represented as a continuous variable, with upper and lower bounds. The minimum duration and bounds on the start and finish times are assumed to be known a priori . For brevity, I’ll refer to the 10 values as the parameters of the polytope. We want to serialise the activities, in either order, and this gives the disjunctive constraint s 2 ≥ f 1 ∨ s 1 ≥ f 2 . 7-1

Linear Constraints Rewritten in ‘big M’ form, the constraints are s 2 ) ≥ 0 s 2 − f 1 + (1 − α )( ˆ f 1 − ˇ s 1 ) ≥ 0 s 1 − f 2 + α ( ˆ f 2 − ˇ f 1 ≥ s 1 + d 1 f 2 ≥ s 2 + d 2 s i ≤ s i ≤ ˆ ˇ s i f i ≤ f i ≤ ˆ ˇ f i 8

One common transformation for linearising disjunctions is the ‘big M’ form. Here, if the variable α = 1, activity x 1 should execute first; if α = 0, activity x 2 should execute first. When α is equal to 0 or 1, one of the ordering constraints is enforced and the other is trivially satisfied. As mentioned a moment ago, it’s well known that this model has lousy computational behaviour. One oft-stated reason relates to the value of M . Many peo- ple have observed that it’s critically important that M be as small as possible — here we can see that it’s explicitly related to the upper and lower bounds on the variables. The other reason is that the constraints shown here in- clude only four of the 20 possible facets of this polytope. 8-1

Constraints for a Fixed Execution Order Suppose x 1 precedes x 2 . The fixed order polytope P 1 is described by this system of 11 constraints: s 2 − f 1 ≥ 0 f 1 − s 1 ≥ d 1 f 2 − s 2 ≥ d 2 s 1 ≥ ˇ s 1 − s 1 ≥ −min{ ˆ s 1 , ˆ s 2 − d 1 } f 1 ≥ ˇ f 1 − f 1 ≥ −min{ ˆ f 1 , ˆ s 2 } s 2 ≥ max{ ˇ s 2 , ˇ f 1 } − s 2 ≥ − ˆ s 2 f 2 ≥ max{ ˇ f 2 , ˇ f 1 + d 2 } − f 2 ≥ − ˆ f 2 9

The lower and upper bound constraints make use of min{•} s i , ˇ and max{•} because the values of the parameters ˇ s i , ˆ f i , ˆ f i , and d i determine the tighter bound. In effect, each max or min represents two constraints. All constraints are facets of P 1 for some combination of values of the upper and lower bounds and minimum durations. We can construct a similar set of constraints for the P 0 polytope where x 2 precedes x 1 . 9-1

Regions of the parameter space for P 1 ∨ ∧ f 1 +d 2 ≤ s 2 -d 1 ∧ f 2 1 A 1 B ∧ s 2 +d 2 3 A 3 B 11 6 ∨ f 1 +d 2 2 A 2 B 9 5 ∨ f 1 -d 1 8 A 4 A 10 A 7 A 10 B ∨ s 1 8 B 4 B 10 C 7 B 10 D ∨ s 2 +d 2 ∨ ∨ ∧ ∧ ∧ ∧ s f 1 -d 1 s 2 -d 1 s 2 +d 2 f f 1 -d 1 1 2 ∨ ∧ f 1 +d 2 > s 2 -d 1 ∧ f 2 1 A 1 B ∧ ∨ s 2 +d 2 f 3 A 3 B 2 11 6 ∨ f 1 +d 2 2 A 2 B 9 5 ∨ f 1 -d 1 7 A 8 A 4 A 10 A 10 B ∨ ∧ s s 1 1 7 B 8 B 4 B 10 C 10 D ∨ s 2 +d 2 ∨ ∨ ∧ ∧ ∧ ∧ s 1 f 1 -d 1 s 2 -d 1 s 2 +d 2 f f 1 -d 1 2 10

This figure illustrates the many different shapes the fixed order polytope can assume, plotted as a function of ˇ f 2 and s 1 . The parameter space can be divided into 20 regions, ˆ corresponding to 20 different sets of supporting constraints for the P 1 polytope The facial structure of the polytope changes with the values of the minimum durations and upper and lower bounds on the variables. The dividing lines show where there are changes in the supporting constraints. The max and min terms in the constraints define most of the boundaries. A few other necessary relationships between the parameters produce the remainder. The P 0 polytope is similar, as you’d expect from the sym- metry of the constraint system. 10-1

Incorporating Order of Execution Add a dimension with a binary variable α . Embed P 0 and P 1 at α = 0 and α = 1, respectively. Form the convex hull P = conv( P 0 ∪ P 1 ) 11

To generate a polytope that incorporates both execution or- ders, we embed the fixed order polytopes in a larger space, using a binary variable to specify the execution order. Then it’s a matter of lifting the facets and faces of the fixed order polytopes to find the facets of the full polytope. 11-1

Tightening Big M Constraints Lou Hafer DIMACS Workshop on COIN-OR - PDF document

Tightening Big M Constraints Lou Hafer DIMACS Workshop on COIN-OR July 17 20, 2006 1 Various portions joint with Mikhail Bilenky, Alexander Kononov, Cheryl Petreman, and Ken Collins. 1-1 Time-Expanded Models Suppose we have jobs j

Machine Learning Anders Holst SICS Big Data Analytics Analysis Big Data Big Value Big Data

Filtering Algorithms for Global Constraints Classes of Constraints Application-Based Graph-Based

Watched Literals in SAT and CP T opics in this Series Why SAT & Constraints? SAT

Big data, big research? Opportunities and constraints for computer supported social science Jrgen

M agseis ASA Pareto Oil & Offshore Conference 12 th September 2018 Tightening Market

Tortoise Race Benjamin Tal April 2012 2 0 1 1 GDP Grow th Show s I m pact of Tightening 2011

Slide 1 Today we are talking about Monetary Policy and the effects of easing and tightening the

Wishful Thinking or Effective Threat? Tightening Bank Resolution Regimes and Bank Risk-Taking

Tightening the net: a review of current and next generation spam filtering tools James Carpinter

Tightening worst-case timing analysis of Tilera-like NoC architecture Hamdi Ayed, Jrme

The effect of market structure and relationship lending on the likelihood of credit tightening

Wishful Thinking or Effective Threat? Tightening Bank Resolution Regimes and Bank Risk-Taking

The Impact of Tightening Margins on Plant Availability Amy OMahoney , Kristian Marr, David

I Prefer Pi Corey Sinnamon Febuary 3, 2015 Big Day 3/14/15 Big Day 3/14/15 Themes Big

Big Data Algorithms with Medical Applications Yixin Chen Outline Challenges to big data

CS535 Big Data 1/22/2020 Sangmi Lee Pallickara CS535 Big Data | Computer Science Department

twenty six diagonal cracking shear f c http://www.bam.de concrete construction:

Solving problems by searching Uninformed search algorithms Discussion Class CS 171 Friday,

Compression: Huffmans Algorithm Greg Plaxton Theory in Programming Practice, Spring 2004

Week 5 Video 5 Relationship Mining Network Analysis Todays Class Network Analysis Network

Common Stormwater Problems 1 11/29/2018 New Challenges and Mandates Aim To Improve Water

Play & Leisure The Power of Play 1.3m Free-standing Slide - FS13 Steel sections are all mild

Play & Leisure The Power of Play 2.1m Free-standing Slide - FS21 Steel sections are all mild

The Frequency Domain Time domain methods: regress present on past; capture dynamics in

Tightening Big M Constraints Lou Hafer DIMACS Workshop on COIN-OR - PDF document

Tightening Big M Constraints Lou Hafer DIMACS Workshop on COIN-OR July 17 20, 2006 1 Various portions joint with Mikhail Bilenky, Alexander Kononov, Cheryl Petreman, and Ken Collins. 1-1 Time-Expanded Models Suppose we have jobs j

Machine Learning Anders Holst SICS Big Data Analytics Analysis Big Data Big Value Big Data

Filtering Algorithms for Global Constraints Classes of Constraints Application-Based Graph-Based

Watched Literals in SAT and CP T opics in this Series Why SAT &amp; Constraints? SAT

Big data, big research? Opportunities and constraints for computer supported social science Jrgen

M agseis ASA Pareto Oil &amp; Offshore Conference 12 th September 2018 Tightening Market

Tortoise Race Benjamin Tal April 2012 2 0 1 1 GDP Grow th Show s I m pact of Tightening 2011

Slide 1 Today we are talking about Monetary Policy and the effects of easing and tightening the

Wishful Thinking or Effective Threat? Tightening Bank Resolution Regimes and Bank Risk-Taking

Tightening the net: a review of current and next generation spam filtering tools James Carpinter

Tightening worst-case timing analysis of Tilera-like NoC architecture Hamdi Ayed, Jrme

The effect of market structure and relationship lending on the likelihood of credit tightening

Wishful Thinking or Effective Threat? Tightening Bank Resolution Regimes and Bank Risk-Taking

The Impact of Tightening Margins on Plant Availability Amy OMahoney , Kristian Marr, David

I Prefer Pi Corey Sinnamon Febuary 3, 2015 Big Day 3/14/15 Big Day 3/14/15 Themes Big

Big Data Algorithms with Medical Applications Yixin Chen Outline Challenges to big data

CS535 Big Data 1/22/2020 Sangmi Lee Pallickara CS535 Big Data | Computer Science Department

twenty six diagonal cracking shear f c http://www.bam.de concrete construction:

Solving problems by searching Uninformed search algorithms Discussion Class CS 171 Friday,

Compression: Huffmans Algorithm Greg Plaxton Theory in Programming Practice, Spring 2004

Week 5 Video 5 Relationship Mining Network Analysis Todays Class Network Analysis Network

Common Stormwater Problems 1 11/29/2018 New Challenges and Mandates Aim To Improve Water

Play &amp; Leisure The Power of Play 1.3m Free-standing Slide - FS13 Steel sections are all mild

Play &amp; Leisure The Power of Play 2.1m Free-standing Slide - FS21 Steel sections are all mild

The Frequency Domain Time domain methods: regress present on past; capture dynamics in

Watched Literals in SAT and CP T opics in this Series Why SAT & Constraints? SAT

M agseis ASA Pareto Oil & Offshore Conference 12 th September 2018 Tightening Market

Play & Leisure The Power of Play 1.3m Free-standing Slide - FS13 Steel sections are all mild

Play & Leisure The Power of Play 2.1m Free-standing Slide - FS21 Steel sections are all mild