Recent Progresses on the Simplex Method
Yinyu Ye www.stanford.edu/~yyye
K.T. Li Professor of Engineering Stanford University and International Center of Management Science and Engineering Nanjing University
November 2013 Yinyu Ye
Recent Progresses on the Simplex Method Yinyu Ye - - PowerPoint PPT Presentation
Recent Progresses on the Simplex Method Yinyu Ye www.stanford.edu/~yyye K.T. Li Professor of Engineering Stanford University and International Center of Management Science and Engineering Nanjing University November 2013 Yinyu Ye Outlines
K.T. Li Professor of Engineering Stanford University and International Center of Management Science and Engineering Nanjing University
November 2013 Yinyu Ye
November 2013 Yinyu Ye
November 2013 Yinyu Ye
November 2013 Yinyu Ye
November 2013 Yinyu Ye
n 2 1 m n mn 2 m2 1 m1 2 n 2n 2 22 1 21 1 n 1n 2 12 1 11 n n 2 2 1 1
n d nd n n d d d d d d
≤ ≤ ≤
2 2 1 1 2 2 2 22 1 21 1 1 2 12 1 11 2 2 1 1
The feasible region is a polyhedron defined by n inequalities in d dimensions.
November 2013 Yinyu Ye
November 2013 Yinyu Ye
November 2013 Yinyu Ye
November 2013 Yinyu Ye
November 2013 Yinyu Ye
November 2013 Yinyu Ye
November 2013 Yinyu Ye
November 2013 Yinyu Ye
1 1 1 1 1 1/2 3/4 7/8 Chosen actions in Red
November 2013 Yinyu Ye
i T j j i i T j j i
∀ ∈ + = ∀ ∈ + =
γ γ
1
i T j j i m i i
∀ ∈ ∀ + ≤
=
γ
November 2013 Yinyu Ye
1 1
j j ij n j ij n i j j
∀ ≥ ∀ = −
= =
γ
November 2013 Yinyu Ye
Chosen actions in Red
November 2013 Yinyu Ye
Chosen actions in Red
November 2013 Yinyu Ye
Chosen actions in Red
November 2013 Yinyu Ye
November 2013 Yinyu Ye
November 2013 Yinyu Ye
2
November 2013 Yinyu Ye
than n times. More precisely, at any step of the pivot process, there exists a non-optimal action j that will never re-enter future policies or bases after pivot steps
eliminate from appearance in any future policies generated by the simplex or policy-iteration method.
vector at the current policy and the optimal reduced- cost vector to provide a lower and upper bound for a non-optimal action when the greedy rule is used.
2
γ γ − −
November 2013 Yinyu Ye
2
γ γ − −
November 2013 Yinyu Ye
November 2013 Yinyu Ye
i T j j j i i T j j j i
∀ ∈ + = ∀ ∈ + =
γ γ
1
i T j j j i m i i
∀ ∈ ∀ + ≤
=
γ
November 2013 Yinyu Ye
1 1
j j ij j n j ij n i j j
∀ ≥ ∀ = −
= =
γ
November 2013 Yinyu Ye
deterministic MDP even with uniform discounts.
solving MDP with uniform discounts.
pivot rule terminates in at most pivot steps when discount factors are uniform, or in at most pivot steps with non-uniform discounts.
factor of m.
method.
2 2 3
2 3 5
November 2013 Yinyu Ye
November 2013 Yinyu Ye
November 2013 Yinyu Ye
general non-degenerate and bounded LPs:
pivot steps, when the ratio of the minimum value over the maximum value, in all basic feasible solution entries, is bounded below by σ.
1 1
j i j n j ij n i j j
∀ ≥ ∀ =
= =
2
σ σ
November 2013 Yinyu Ye
unboundedness:
pivot steps, either finds an optimal basic feasible solution or detects the unboundedness.
1 1
j i j n j ij n i j j
∀ ≥ ∀ =
= =
2
σ σ
November 2013 Yinyu Ye
z*, and consider the “shadow” LP problem
1 1 1 1
j n n i j j i j n j ij n i j j
∀ ≥ = − ∀ =
+ = = =
November 2013 Yinyu Ye
the original LP, would generate the identical solution and reduced cost sequence as it is applied to the “shadow” LP in which Xn+1 remains a basic variable before detects unboundedness.
satisfy the σ property.
. , *, ; , s.t. min
1 1 1 1
j x z x x c i b x a x c
j n n i j j i j n j ij n i j j
∀ ≥ = − ∀ =
+ = = =
2
log
σ σ
m mn
. , ; , s.t. min
1 1
j x i b x a x c
j i j n j ij n i j j
∀ ≥ ∀ =
= =
November 2013 Yinyu Ye