Topics in Computational Sustainability
CS 325
Spring 2016
Making Choices: Stochastic Optimization
Topics in Computational Sustainability CS 325 Spring 2016 Making - - PowerPoint PPT Presentation
Topics in Computational Sustainability CS 325 Spring 2016 Making Choices: Stochastic Optimization Introduction Stochastic programming is a modeling framework to deal with optimization problems that involve uncertainty. Real world
CS 325
Spring 2016
Making Choices: Stochastic Optimization
and uncertain parameters (e.g., how much energy the wind farm will actually produce, the actual costs of a project, ..)
Example
dry
All Corn All Soy All Beans Wet 100 70 80 Dry
40 35
All Corn All Soy All Beans Wet 100 70 80 Dry
40 35 Assume the probability of a wet season is p, the expected profit of planting the different crops: Corn: 100 p + (-10) (1-p) = -10+ 110p Soy: 40+ 30p Beans: 35+ 45p
Suppose p = 0.5, can anyone suggest a planting plan?
– If it is wet, corn is the most profitable plant all corn – If it is dry, soy is the most profitable plant all soy – Plant 1/2 corn, 1/2 soy ?
Expected Profit: 0.5 (-10 + 110(0.5)) + 0.5 (40 + 30(0.5))= 50 Is this optimal?
Suppose p = 0.5, can anyone suggest a planting plan? Plant all beans! Expected Profit: 35 + 45(0.5) = 57.5!
different outcomes for the future, is not equal to the “average” of the decisions that would have been best for each specific future outcome.
Discrete random variable Z is described by mass probabilities of all elementary events:
2 1 2 1 K K
2 1
K
Such that
If probability measure is discrete, the expected value of Z is the sum : Example: If Z represents the outcome of a die Similarly, given a function f
K i i i p
1
K i i i
1
X x
) 35 40 ) 10 ( )( 1 ( ) 80 70 100 ( )] , , , ( [ ) , , (
3 2 1 3 2 1 3 2 1 3 2 1
x x x p x x x p Z x x x f E x x x F
Crop yield optimization: Z is a binary random variable: wet (Z=1, with probability p) or dry season (Z=0, prob. 1-p) The expected value is
3 2 1 3 2 1
3 2 1 3 2 1
Crop yield optimization problem. Given p, subject to
3 2 1 3 2 1
maximize
X x
– Might not be possible to evaluate the integral in closed form – Computationally hard to evaluate
– The expected value is f(x)
S x
N
2 1
– 7.4 Billion people to feed; 21 Million newborns in the past year
– Increase arable land – Improve yield with technology
– Use knowledge of soil/regional data – Understand the uncertainty due to weather/climate – Around 34000 data points, 80 varieties
Li, Zhong, Lobell, Ermon: first prize out of 130 teams
– 100 dollars with probability 0.5, nothing with probability 0.5 – 50 dollars with probability 1
– Maximize expected yield minus variance – Maximize expected yield, subject to small variance – Maximize the yield that you can achieve with probability at least 95%
with application to spatial conservation planning
Daniel Sheldon, Bistra Dilkina, Adam Elmachtoub, Ryan Finseth, Ashish Sabharwal, Jon Conrad, Carla P. Gomes, David Shmoys Institute for Computational Sustainability Cornell University and Oregon State University Will Allen, Ole Amundsen, Buck Vaughan The Conservation Fund
support the recovery of the Red-Cockaded Woodpecker (RCW)?
! "# $%# "
Federally listed rare and endangered species
– small family groups – well-defined territories or patches – centered around cluster of cavity trees
– One for each bird – Live, old-growth pine (80+ years old) – 2-10 years to excavate – Extensively reused
conflict with modern land-use
– 30-year timber rotation – Development
– Habitat restoration and preservation – Artificial cavities
Given limited budget, what parcels should I conserve to maximize the expected number of occupied territories in 50 years?
! "# $%# "
Conserved parcels Available parcels Current territories Potential territories
– Territories are occupied or unoccupied in each time step – Two types of stochastic events:
Time 1 Time 2
Note: “activated” nodes are those reachable by red edges
i j k l m i j k l m i j k l m i j k l m i j k l m 1 2 3 4 5 Time: Initiall y
d territori es
Patches
i j k l m i j k l m i j k l m i j k l m i j k l m 1 2 3 4 5 Time: p(i,i) p(i,j)
Patches
i j k l m i j k l m i j k l m i j k l m i j k l m 1 2 3 4 5 Time:
Patches
i j k l m i j k l m i j k l m i j k l m i j k l m 1 2 3 4 5 Time:
Non-extinction Colonization Patches
i j k l m i j k l m i j k l m i j k l m i j k l m 1 2 3 4 5 Time:
Patches
i j k l m i j k l m i j k l m i j k l m i j k l m 1 2 3 4 5 Time:
Patches
i j k l m i j k l m i j k l m i j k l m i j k l m 1 2 3 4 5 Time:
Patches
i j k l m i j k l m i j k l m i j k l m i j k l m 1 2 3 4 5 Time:
Patches
i j k l m i j k l m i j k l m i j k l m i j k l m 1 2 3 4 5 Time:
Patches
i j k l m i j k l m i j k l m i j k l m i j k l m
1 2 3 4 5 Time: Patches
i j k l m i j k l m i j k l m i j k l m i j k l m
Key point: after simulation, occupied territories given by nodes that are reachable in the network by live edges Live edges Patches
i j k l m i j k l m i j k l m i j k l m i j k l m
Target nodes: territories at final time step Patches
Parcel 1 Parcel 2 Initial network
Parcel 1 Parcel 2 Initial network
Parcel 1 Parcel 2 Initial network
Given:
– Initially occupied territories – Colonization and extinction probabilities
– Already-conserved parcels – List of available parcels and their costs
Find set of parcels with total cost at most B that maximizes the expected number of occupied territories at time T.
– Cannot even calculate objective function exactly (#P-hard)
???
...
management actions that works well on average
reachable target nodes
design problem 1 2 N
... Stochastic Deterministic
NP hard: solve by branch and bound (CPLEX) Node v reachable in cascade k? v only reachable if some containing parcel l is purchased non-source node v only reachable if some predecessor u is reachable Budget Purchase parcel l?
! "# $%# "
RCW are much more likely than long-range colonizations
– Greedy-uc – choose action that results in biggest immediate increase in objective [Kempe et al. 2003] – Greedy-cb – use ratio of benefit to cost [Leskovec et
! "# $%# "
M = 50, N = 10, Ntest = 500 Upper bound!
! "# $%# "
M = 50, N = 10, Ntest = 500 Upper bound!
! "# $%# "
Conservation Reservoir Initial population M = 50, N = 10, Ntest = 500 Upper bound!
! "# $%# "
Move the conservation reservoir so it is more remote.
Greedy Baselin e SAA Optimum (our approach) $150M $260M $320M Build
sources Path-building (goal-setting)