Finite Horizon Dynamic Programming: Getting Value from Spending - PowerPoint PPT Presentation

Finite Horizon Dynamic Programming: Getting Value from Spending Symmetry J. Michael Steele University of Pennsylvania The Wharton School Department of Statistics Stochastic Processes and Applications, Buenos Aires, August 8, 2014 J. M. Steele (UPenn, Wharton) On-line Selection August 2015 1

Introduction Some History and Motivation Famous combinatorial problems with long mathematical history on sequences of n real numbers, or permutations of the integers 1 , . . . , n ◮ Erd˝ os and Szekeres (1935): monotone subsequences ◮ Fan Chung (1980): unimodal subsequences ◮ Euler (cf. Stanley, 2010): alternating permutations J. M. Steele (UPenn, Wharton) On-line Selection August 2015 2

Introduction Some History and Motivation 1 Famous combinatorial problems with long mathematical 0.9 history on sequences of n real numbers, or permutations 0.8 0.7 of the integers 1 , . . . , n 0.6 0.5 ◮ Erd˝ os and Szekeres (1935): monotone subsequences 0.4 0.3 ◮ Fan Chung (1980): unimodal subsequences 0.2 ◮ Euler (cf. Stanley, 2010): alternating permutations 0.1 0 0 10 20 30 40 50 60 70 80 90 100 1 Probabilistic version (full-information) 0.9 0.8 ◮ Longest monotone subsequences: Hammersley (1972), 0.7 Kingman (1973), Logan and Shepp (1977), Verˇ sik and 0.6 0.5 Kerov (1977), . . . 0.4 ◮ Longest Unimodal subsequences: Steele (1981) 0.3 0.2 ◮ Longest Alternating subsequences: Widom (2006), 0.1 0 Pemantle (cf. Stanley, 2007), Stanley (2008), Houdr´ e 0 10 20 30 40 50 60 70 80 90 100 and Restrepo (2010) 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 10 20 30 40 50 60 70 80 90 100 J. M. Steele (UPenn, Wharton) On-line Selection August 2015 2

Introduction Some History and Motivation 1 Famous combinatorial problems with long mathematical 0.9 history on sequences of n real numbers, or permutations 0.8 0.7 of the integers 1 , . . . , n 0.6 0.5 ◮ Erd˝ os and Szekeres (1935): monotone subsequences 0.4 0.3 ◮ Fan Chung (1980): unimodal subsequences 0.2 ◮ Euler (cf. Stanley, 2010): alternating permutations 0.1 0 0 10 20 30 40 50 60 70 80 90 100 1 Probabilistic version (full-information) 0.9 0.8 ◮ Longest monotone subsequences: Hammersley (1972), 0.7 Kingman (1973), Logan and Shepp (1977), Verˇ sik and 0.6 0.5 Kerov (1977), . . . 0.4 ◮ Longest Unimodal subsequences: Steele (1981) 0.3 0.2 ◮ Longest Alternating subsequences: Widom (2006), 0.1 0 Pemantle (cf. Stanley, 2007), Stanley (2008), Houdr´ e 0 10 20 30 40 50 60 70 80 90 100 and Restrepo (2010) 1 0.9 0.8 Now ... Study the sequential (on-line) version of these 0.7 problems 0.6 0.5 ◮ Objective: maximize the expected length (number of 0.4 selections) of monotone, unimodal and alternating 0.3 0.2 subsequences 0.1 0 0 10 20 30 40 50 60 70 80 90 100 J. M. Steele (UPenn, Wharton) On-line Selection August 2015 2

Introduction Full-information vs. on-line — Increasing n = 100 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 10 20 30 40 50 60 70 80 90 100 J. M. Steele (UPenn, Wharton) On-line Selection August 2015 3

Introduction Full-information vs. on-line — Increasing n = 100 I n = 15 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 10 20 30 40 50 60 70 80 90 100 J. M. Steele (UPenn, Wharton) On-line Selection August 2015 3

Introduction Full-information vs. on-line — Increasing I o n ( π ∗ n = 100 I n = 15 n ) = 14 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 10 20 30 40 50 60 70 80 90 100 J. M. Steele (UPenn, Wharton) On-line Selection August 2015 3

Introduction Full-information vs. on-line — Unimodal n = 100 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 10 20 30 40 50 60 70 80 90 100 J. M. Steele (UPenn, Wharton) On-line Selection August 2015 3

Introduction Full-information vs. on-line — Unimodal n = 100 U n = 22 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 10 20 30 40 50 60 70 80 90 100 J. M. Steele (UPenn, Wharton) On-line Selection August 2015 3

Introduction Full-information vs. on-line — Unimodal U o n ( π ∗ n = 100 U n = 22 n ) = 21 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 10 20 30 40 50 60 70 80 90 100 J. M. Steele (UPenn, Wharton) On-line Selection August 2015 3

Introduction Summary View of Means in Some On-Line Selection Problems How Much Better Does a “Prophet” Do Asymptotically? Full Information Real Time Info. Only Realized Bonus Increasing Unimodal Alternating J. M. Steele (UPenn, Wharton) On-line Selection August 2015 4

Introduction Summary View of Means in Some On-Line Selection Problems How Much Better Does a “Prophet” Do Asymptotically? Full Information Real Time Info. Only Realized Bonus √ 2 √ n Increasing 2 n 29% Unimodal Alternating J. M. Steele (UPenn, Wharton) On-line Selection August 2015 4

Introduction Summary View of Means in Some On-Line Selection Problems How Much Better Does a “Prophet” Do Asymptotically? Full Information Real Time Info. Only Realized Bonus √ 2 √ n Increasing 2 n 29% √ 2 √ n Unimodal 2 2 n 29% Alternating J. M. Steele (UPenn, Wharton) On-line Selection August 2015 4

Introduction Summary View of Means in Some On-Line Selection Problems How Much Better Does a “Prophet” Do Asymptotically? Full Information Real Time Info. Only Realized Bonus √ 2 √ n Increasing 2 n 29% √ 2 √ n Unimodal 2 2 n 29% √ Alternating 2 n / 3 (2 − 2) n 12% J. M. Steele (UPenn, Wharton) On-line Selection August 2015 4

Introduction Summary View of Means in Some On-Line Selection Problems How Much Better Does a “Prophet” Do Asymptotically? Full Information Real Time Info. Only Realized Bonus √ 2 √ n Increasing 2 n 29% √ 2 √ n Unimodal 2 2 n 29% √ Alternating 2 n / 3 (2 − 2) n 12% Question: Can one get more detailed information? J. M. Steele (UPenn, Wharton) On-line Selection August 2015 4

Introduction Summary View of Means in Some On-Line Selection Problems How Much Better Does a “Prophet” Do Asymptotically? Full Information Real Time Info. Only Realized Bonus √ 2 √ n Increasing 2 n 29% √ 2 √ n Unimodal 2 2 n 29% √ Alternating 2 n / 3 (2 − 2) n 12% Question: Can one get more detailed information? More precise asymptotics of the means? J. M. Steele (UPenn, Wharton) On-line Selection August 2015 4

Introduction Summary View of Means in Some On-Line Selection Problems How Much Better Does a “Prophet” Do Asymptotically? Full Information Real Time Info. Only Realized Bonus √ 2 √ n Increasing 2 n 29% √ 2 √ n Unimodal 2 2 n 29% √ Alternating 2 n / 3 (2 − 2) n 12% Question: Can one get more detailed information? More precise asymptotics of the means? Any second-order information, i.e. what about the variances? J. M. Steele (UPenn, Wharton) On-line Selection August 2015 4

Introduction Summary View of Means in Some On-Line Selection Problems How Much Better Does a “Prophet” Do Asymptotically? Full Information Real Time Info. Only Realized Bonus √ 2 √ n Increasing 2 n 29% √ 2 √ n Unimodal 2 2 n 29% √ Alternating 2 n / 3 (2 − 2) n 12% Question: Can one get more detailed information? More precise asymptotics of the means? Any second-order information, i.e. what about the variances? Is there hope for a CLT or other distributional result? J. M. Steele (UPenn, Wharton) On-line Selection August 2015 4

Introduction Summary View of Means in Some On-Line Selection Problems How Much Better Does a “Prophet” Do Asymptotically? Full Information Real Time Info. Only Realized Bonus √ 2 √ n Increasing 2 n 29% √ 2 √ n Unimodal 2 2 n 29% √ Alternating 2 n / 3 (2 − 2) n 12% Question: Can one get more detailed information? More precise asymptotics of the means? Any second-order information, i.e. what about the variances? Is there hope for a CLT or other distributional result? There is a CLT for the On-Line Alternating Subsequence Problem (briefly noted in next frame) There has much further work on the On-Line Selection of a Monotone Increasing Subsequence , the original motivating problem. This will get most of our attention. J. M. Steele (UPenn, Wharton) On-line Selection August 2015 4

Introduction CLT for Alternating Sequentially Selected Alternating Series — A CLT Theorem (Arlotto & Steele, AAP 2014) There is a constant σ > 0 such that √ A o n ( π ∗ n ) − n (2 − 2) ⇒ N (0 , 1) . n σ J. M. Steele (UPenn, Wharton) On-line Selection August 2015 5

Introduction CLT for Alternating Sequentially Selected Alternating Series — A CLT Theorem (Arlotto & Steele, AAP 2014) There is a constant σ > 0 such that √ A o n ( π ∗ n ) − n (2 − 2) ⇒ N (0 , 1) . n σ The Mysterious σ ? Its existence is proved but the value is not yet known. J. M. Steele (UPenn, Wharton) On-line Selection August 2015 5

Introduction CLT for Alternating Sequentially Selected Alternating Series — A CLT Theorem (Arlotto & Steele, AAP 2014) There is a constant σ > 0 such that √ A o n ( π ∗ n ) − n (2 − 2) ⇒ N (0 , 1) . n σ The Mysterious σ ? Its existence is proved but the value is not yet known. A Candidate σ ? Yes, but not yet in the bag. J. M. Steele (UPenn, Wharton) On-line Selection August 2015 5

Finite Horizon Dynamic Programming: Getting Value from Spending - PowerPoint PPT Presentation

Finite Horizon Dynamic Programming: Getting Value from Spending Symmetry J. Michael Steele University of Pennsylvania The Wharton School Department of Statistics Stochastic Processes and Applications, Buenos Aires, August 8, 2014 J. M. Steele

Lecture 2: Infinite Horizon and Indefinite Horizon MDPs B9140 Dynamic Programming &

Dynamic Programming Outline and Reading Matrix Chain-Product (5.3.1) Dynamic Programming:

Dynamic Programming Prof. Kuan-Ting Lai 2020/4/10 Dynamic Programming Dynamic Programming is

Supplementary Notes to Horizon II macro 1 Horizon II macro Intro CONTENTS Page

Welcome to Horizon Honors Its a Great Day to be an Eagle! School Structure 1 Organization

Horizon Horizon Steel Steel Steel Steel

? P12 2 Getting Started/Lab Programming Lab Programming Program of Requirements PRELIMINARY

Finite A to B implies |A| = |B| Cardinality for finite A, B finite-card .1 finite-card .2

CS 170 Section 6 Dynamic Programming Owen Jow | owenjow@berkeley.edu Agenda Dynamic

Dynamic Programming Kevin Zatloukal July 18, 2011 Motivation Dynamic programming deserves

ThorneConsulting.com W E L C O M E From Getting Noticed From Getting Noticed to Getting

respy A prototypical finite-horizon discrete choice dynamic programming model Janos Gabler

Finite Horizon Life-cycle Horizon Learning Erin Cottle Hunt Department of Economics Lafayette

Receding Horizon Control Mar a M. Seron September 2004 Centre for Complex Dynamic Systems

$TITLE: M10-2.GMS: Infinite horizon dynamic model, MPS/GE formulation $ONTEXT Converts infinite

Dynamic programming 1 Dynamic programming also solve a problem by combining the solutions to

ADDAM (Atmospheric Dispersion ADDAM (Atmospheric Dispersion and Dose Analysis Method) and Dose

Catch Me if You Can: Targeting At-risk Students to Stop the Academic Slide University of Central

Lao PDR R-PP: Comments by the TAP Nov. 1-3, 2010 FCPF Participants Committee Washington DC For

Investor Day May 15, 2017 Regulation G: Non-GAAP Measures and Reconciliation of Non-GAAP Measures

WORDINAIRE Visualize While You Memorize Multi Modal Learning Our mission at Wordinaire is to

Conservation of critically endangered White-bellied Heron and fostering community livelihood in

Post-fire Forest Regeneration in the California s National Forests Kevin Welch Graduate

Regression Based on Support Vector Classification Marcin Orchel AGH University of Science and

Finite Horizon Dynamic Programming: Getting Value from Spending - PowerPoint PPT Presentation

Finite Horizon Dynamic Programming: Getting Value from Spending Symmetry J. Michael Steele University of Pennsylvania The Wharton School Department of Statistics Stochastic Processes and Applications, Buenos Aires, August 8, 2014 J. M. Steele

Lecture 2: Infinite Horizon and Indefinite Horizon MDPs B9140 Dynamic Programming &amp;

Dynamic Programming Outline and Reading Matrix Chain-Product (5.3.1) Dynamic Programming:

Dynamic Programming Prof. Kuan-Ting Lai 2020/4/10 Dynamic Programming Dynamic Programming is

Supplementary Notes to Horizon II macro 1 Horizon II macro Intro CONTENTS Page

Welcome to Horizon Honors Its a Great Day to be an Eagle! School Structure 1 Organization

Horizon Horizon Steel Steel Steel Steel

? P12 2 Getting Started/Lab Programming Lab Programming Program of Requirements PRELIMINARY

Finite A to B implies |A| = |B| Cardinality for finite A, B finite-card .1 finite-card .2

CS 170 Section 6 Dynamic Programming Owen Jow | owenjow@berkeley.edu Agenda Dynamic

Dynamic Programming Kevin Zatloukal July 18, 2011 Motivation Dynamic programming deserves

ThorneConsulting.com W E L C O M E From Getting Noticed From Getting Noticed to Getting

respy A prototypical finite-horizon discrete choice dynamic programming model Janos Gabler

Finite Horizon Life-cycle Horizon Learning Erin Cottle Hunt Department of Economics Lafayette

Receding Horizon Control Mar a M. Seron September 2004 Centre for Complex Dynamic Systems

$TITLE: M10-2.GMS: Infinite horizon dynamic model, MPS/GE formulation $ONTEXT Converts infinite

Dynamic programming 1 Dynamic programming also solve a problem by combining the solutions to

ADDAM (Atmospheric Dispersion ADDAM (Atmospheric Dispersion and Dose Analysis Method) and Dose

Catch Me if You Can: Targeting At-risk Students to Stop the Academic Slide University of Central

Lao PDR R-PP: Comments by the TAP Nov. 1-3, 2010 FCPF Participants Committee Washington DC For

Investor Day May 15, 2017 Regulation G: Non-GAAP Measures and Reconciliation of Non-GAAP Measures

WORDINAIRE Visualize While You Memorize Multi Modal Learning Our mission at Wordinaire is to

Conservation of critically endangered White-bellied Heron and fostering community livelihood in

Post-fire Forest Regeneration in the California s National Forests Kevin Welch Graduate

Regression Based on Support Vector Classification Marcin Orchel AGH University of Science and

Lecture 2: Infinite Horizon and Indefinite Horizon MDPs B9140 Dynamic Programming &