NEW DEVELOPMENTS IN RANKING AND SELECTION: An Empirical Comparison - PowerPoint PPT Presentation

Overview NEW DEVELOPMENTS IN RANKING AND SELECTION: An Empirical Comparison of Three Main Approaches urgen Branke 2 Stephen E. Chick 1 Christian Schmidt 2 J¨ 1 (speaker) Technology Management Area INSEAD Fontainebleau, France 2 Institut AIFB Universit¨ at Karlsruhe (TH) D-76128 Karlsruhe, GERMANY 2005 Winter Simulation Conference Chick Selecting a Selection Procedure

Selecting the Best of a Finite Set 1 There are a plethora of ranking and selection approaches Indifference zone, VIP, OCBA, ETSS, . . . Each approach has variations, parameters, approximations leading to different allocation, stopping and selection rules Optimizations more demanding of such procedures 2 Today: Which sequential selection procedure is “best” (given independent, Gaussian samples, unknown means/variances). New procedures (stopping rules, allocations) New measures and mechanisms to evaluate procedures Summarize observations from what is believed to be the largest numerical experiment to date Identify strengths/weaknesses of leading procedures See also Selecting a Selection Procedure Branke, Chick, and Schmidt (2005), more allocations, experiments, . . . Chick Selecting a Selection Procedure

Introduction Evaluation Results Summary References “Goodness” Setup Evidence/Stopping Procedures Outline Overview for Ranking and Selection 1 What are Measures of a Good Procedure? Problem Formulation Evidence for Correct Selection and New Stopping Rules Procedures Tested Empirical Evaluation 2 Empirical Figures of Merit Numerical Test Bed Implementation Summary of Qualitative Conclusions 3 Stopping Rules Allocations General Comments General Summary 4 Which procedure to use? Discussion (time permitting) Chick Selecting a Selection Procedure

Introduction Evaluation Results Summary References “Goodness” Setup Evidence/Stopping Procedures What are measures of a good procedure? Utopia: always find true best with zero effort. Fact: Variability implies incorrect selections or infinite work. Theoretical properties: Derivations are preferred to ad hoc approximations Reasonable people may choose different assumptions Empirical properties: Efficiency: Mean evidence for correct selection as function of mean number of samples Controllability: Ease of setting parameters to achieve a targeted evidence level Robustness: Dependency of procedure’s effectiveness on underlying problem characteristics Sensitivity: Effect of parameters on mean number of samples Chick Selecting a Selection Procedure

Introduction Evaluation Results Summary References “Goodness” Setup Evidence/Stopping Procedures Problem formulation Identify best of k systems (biggest mean). Let X ij be output of j th replication of i th system: { X ij : j = 1 , 2 , . . . } i . i . d . w i , σ 2 � � ∼ Normal , system i = 1 , . . . , k . i True (unknown) order of means: w [1] ≤ w [2] ≤ . . . ≤ w [ k ] Configuration: χ = ( w , σ 2 ) . σ 2 Samples statistics: ¯ x i and ˆ i updated based on n i observations seen so far. Order statistics: ¯ x (1) ≤ ¯ x (2) ≤ . . . ≤ ¯ x ( k ) If select ( k ), then { w ( k ) = w [ k ] } is a correct selection event Chick Selecting a Selection Procedure

Introduction Evaluation Results Summary References “Goodness” Setup Evidence/Stopping Procedures Evidence for Correct Selection Loss function if system D is chosen when means are w : Zero-one: L 0 − 1 ( D , w ) = 1 � w D � = w [ k ] � 1 Expected opportunity cost (EOC): L oc ( D , w ) = w [ k ] − w D Frequentist measures (distribution of D = f ( X )) def PCS iz ( χ ) = 1 − E [ L 0 − 1 ( D , w ) | χ ] def EOC iz ( χ ) = E [ L oc ( D , w ) | χ ] Bayesian measures (given all output E , D and posterior of W ) def PCS Bayes = 1 − E [ L 0 − 1 ( D , W ) | E ] def EOC Bayes = E [ L oc ( D , W ) | E ] Similar for PGS δ ∗ , for “good” selections (within δ ∗ of best) Chick Selecting a Selection Procedure

Introduction Evaluation Results Summary References “Goodness” Setup Evidence/Stopping Procedures Bayesian Evidence and Stopping Rules Bounds (approximate) for Bayesian measures jk = d ( j )( k ) λ 1 / 2 Normalized distance: d ∗ jk , where � � σ 2 σ 2 ˆ ˆ x ( j ) ) and λ − 1 x ( k ) − ¯ ( j ) ( k ) d ( j )( k ) = (¯ jk = n ( j ) + . n ( k ) � � � PCS Bayes ≥ Pr W ( k ) > W ( j ) | E (Slepian) j :( j ) � =( k ) jk ) def � Φ ν ( j )( k ) ( d ∗ ≈ = PCS Slep (Welch) j :( j ) � =( k ) � � j :( j ) � =( k ) λ − 1 / 2 EOC Bonf = � d ∗ Ψ ν ( j )( k ) . (“newsvendor” loss) jk jk j :( j ) � =( k ) Φ ν ( j )( k ) ( λ 1 / 2 jk ( δ ∗ + d ( j )( k ) )). PGS Slep ,δ ∗ = � j :( j ) � =( k ) Φ ν ( j )( k ) ( λ 1 / 2 PCS Slep ,δ ∗ = � jk max { δ ∗ , d ( j )( k ) } ) (Chen and Kelton 2005). Chick Selecting a Selection Procedure

Introduction Evaluation Results Summary References “Goodness” Setup Evidence/Stopping Procedures Bayesian Evidence and Stopping Rules New “adaptive” stopping rules provide flexibility 1 Sequential ( S ): Repeat sampling if � k i =1 n i < B for a given total budget B . [Default for most previous VIP and all OCBA work] 2 Repeat if PCS Slep ,δ ∗ < 1 − α ∗ for a given δ ∗ , α ∗ . 3 Repeat if PGS Slep ,δ ∗ < 1 − α ∗ for a given δ ∗ , α ∗ . 4 Repeat if EOC Bonf > β ∗ , for an EOC target β ∗ . We use PCS Slep to denote PCS Slep , 0 . Chick Selecting a Selection Procedure

Introduction Evaluation Results Summary References “Goodness” Setup Evidence/Stopping Procedures State-of-the-Art and New Procedures Tested Indifference-zone (IZ): KN ++ (Kim and Nelson 2001) OCBA Allocations with all stopping rules Usual OCBA allocation (Chen 1996; PCS Slep objective) OCBA LL for EOC Bonf objective (He, Chick, and Chen 2005) OCBA δ ∗ : Like OCBA but with PGS δ ∗ -allocation OCBA max ,δ ∗ : Like OCBA , with max replacing + in PGS δ ∗ -allocation (cf. Chen and Kelton 2005) VIP Allocations (Chick and Inoue 2001) with all stopping rules Sequential LL allocation (for EOC Bonf objective) Sequential 0-1 allocation (for PCS Bonf objective) Equal allocation with all stopping rules Names: Allocation(stop rule), e.g. LL (EOC Bonf ). Chick Selecting a Selection Procedure

Introduction Evaluation Results Summary References Metrics/Plots Test Bed Implementation Comparing Procedures Theoretical evaluation: Hard. Different objectives. Each makes approximations. Can link large-sample EVI LL with small-sample OCBA LL Empirical measures of effectiveness: Parameters of procedures implicitly define efficiency curves , ( E [ N ] , log PICSiz) or ( E [ N ] , log EOCiz) “More efficient” procedures have lower efficiency curves. Efficiency ignores how to set parameter to achieve desired target PICSiz or EOCiz Target curves relate procedures parameter with desired target, (log α ∗ , log PICSiz) or (log β ∗ , log EOCiz) “Conservative” procedures are below diagonal “Controllable”: Can pick parameters to get desired target Robust: Efficient and controllable over range of configs. Chick Selecting a Selection Procedure

Introduction Evaluation Results Summary References Metrics/Plots Test Bed Implementation Configurations: Stylized Slippage configuration (SC): All worst systems tied for second. ∼ X 1 j Normal (0 , 2 ρ/ (1 + ρ )) ∼ Normal ( − δ, 2 / (1 + ρ )) for i = 2 , . . . , k X ij δ ∗ = γδ. Best has largest variance if ρ > 1. Var[ X 1 j − X ij ] constant for all ρ . γ allows δ ∗ to differ from difference in means. Monotone decreasing means (MDM): Equally spaced means. − ( i − 1) δ, 2 ρ 2 − i / (1 + ρ ) � � ∼ X ij Normal δ ∗ = γδ. Tested hundreds of combinations of k ∈ { 2 , 5 , 10 , 20 , 50 } ; ρ ∈ { 0 . 125 , 0 . 177 , 0 . 25 , 0 . 354 , 0 . 5 , 0 . 707 , 1 , 1 . 414 , 2 , 2 . 828 , 4 } ; n 0 ∈ { 4 , 6 , 10 } ; δ ∈ { 0 . 25 , 0 . 354 , 0 . 5 , 0 . 707 , 1 } ; δ ∗ ∈ { 0 . 05 , 0 . 1 , . . . , 0 . 6 } . Chick Selecting a Selection Procedure

Introduction Evaluation Results Summary References Metrics/Plots Test Bed Implementation Configurations: Randomized SC and MDM are unlikely to be found in practice Randomized problem may be more representative Randomized problem instances (RPI1): Sample χ randomly (conjugate prior) p ( σ 2 ∼ i ) InvGamma ( α, β ) p ( W i | σ 2 µ 0 , σ 2 ∼ � � i ) i /η . Normal We set β = α − 1 > 0: standardize mean of variances to be 1. Increase η : means more similar (OCBA, VIP and η → 0); Increase α : reduce variability in the variances. Tested all combinations of k ∈ { 2 , 5 , 10 } ; η ∈ { . 707 , 1 , 1 . 414 , 2 } ; α ∈ { 2 . 5 , 100 } . Also tested other RPI experiments Chick Selecting a Selection Procedure

Introduction Evaluation Results Summary References Metrics/Plots Test Bed Implementation Summary: Numerics 20,000 combinations of allocation-stopping rule-configuration. Each generates an efficiency and target curve Each curve estimated with at least 100,000 macro-replications of each allocation/stopping rule combination CRN across configurations C++, Gnu Scientific Libary for cdfs and Mersenne twister RNG (Matsumoto and Nishimura 1998, 2002 revised seeding) FILIB++ (Lerch et al. 2001) for interval arithmetic (stability for LL 1 , 0-1 1 , and sometimes OCBA ) Mixed cluster of up to 120 nodes: Linux 2.4 and Windows XP; Intel P4 and AMD Athlon; 2 to 3 GHz. Distributed via JOSCHKA-System (Bonn et al. 2005). Chick Selecting a Selection Procedure

NEW DEVELOPMENTS IN RANKING AND SELECTION: An Empirical Comparison - PowerPoint PPT Presentation

Overview NEW DEVELOPMENTS IN RANKING AND SELECTION: An Empirical Comparison of Three Main Approaches urgen Branke 2 Stephen E. Chick 1 Christian Schmidt 2 J 1 (speaker) Technology Management Area INSEAD Fontainebleau, France 2 Institut AIFB

Easy and Hard Outline Constraint Ranking in OT The Constraint Ranking problem Making fast

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?

Online Submodular Set Cover, Ranking, and Repeated Active Learning Online Ranking: At each round,

Ranking candidate genes from Ranking candidate genes from perturbation experiments Niko

TVM for Ads Ranking @ Facebook Hao Lu, Ansha Yu, Yinghai Lu, Andrew Tulloch Ads Ranking at

Functional Principal Component Analysis May 14, 2018 Empirical Principal Component FPC for the

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

SECONDHAND SELECTION Sales Price - 275,000.00 EU SECONDHAND SELECTION INTERNAL VIEWS SECONDHAND

SELECTION Deterministic Stochastic Proportionate selection: Roulette Wheel Selection

Selection 2 Selection Selection given a set of (distinct) elements, finding the element larger

KNN and re ranking models for English KNN and re-ranking models for English patent mining at

Tutorial Ranking Mechanisms in Games Vanessa Volz and Boris Naujoks CIG 2018, Maastricht

Ranking Related News Predictions Nattiya Kanhabua 1 , Roi Blanco 2 and Michael Matthews 2 1

Web Mining and Recommender Systems Advanced Recommender Systems: Bayesian Personalized Ranking

Lecture 3: Improving Ranking with Lecture 3: Improving Ranking with Behavior Data Eugene

Import Competition, Heterogeneous Preferences of Managers and Productivity Cheng Chen Claudia

Pr Private Stochastic Convex Optimization wi with Optimal Ra Rate Raef Bassily Vitaly

An Empirical Study on Selective Sampling in Active Learning for Splog Detection Taichi Katayama 1

Dialetheism is an empirical hypothesis David Ripley University of Connecticut

Time-Bounded Sequential Parameter Optimization Frank Hutter, Holger H. Hoos, Kevin Leyton-Brown,

14.581 International Trade Lecture 6: Ricardian Model (Empirics) 14.581

Who Gets Placed Where and Why? An Empirical Framework for Foster Care Placement Alejandro

Faster GPS via the Sparse Fourier Transform Haitham Hassanieh Fadel Adib Dina Katabi Piotr Indyk

Sambuz

Useful Links

Newsletter

Mail Us

NEW DEVELOPMENTS IN RANKING AND SELECTION: An Empirical Comparison - PowerPoint PPT Presentation

Overview NEW DEVELOPMENTS IN RANKING AND SELECTION: An Empirical Comparison of Three Main Approaches urgen Branke 2 Stephen E. Chick 1 Christian Schmidt 2 J 1 (speaker) Technology Management Area INSEAD Fontainebleau, France 2 Institut AIFB

Easy and Hard Outline Constraint Ranking in OT The Constraint Ranking problem Making fast

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

ERP Selection KIRTANE &amp; PANDIT Suhas Deshpande Why ERP Selection is important ?

Online Submodular Set Cover, Ranking, and Repeated Active Learning Online Ranking: At each round,

Ranking candidate genes from Ranking candidate genes from perturbation experiments Niko

TVM for Ads Ranking @ Facebook Hao Lu, Ansha Yu, Yinghai Lu, Andrew Tulloch Ads Ranking at

Functional Principal Component Analysis May 14, 2018 Empirical Principal Component FPC for the

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

SECONDHAND SELECTION Sales Price - 275,000.00 EU SECONDHAND SELECTION INTERNAL VIEWS SECONDHAND

SELECTION Deterministic Stochastic Proportionate selection: Roulette Wheel Selection

Selection 2 Selection Selection given a set of (distinct) elements, finding the element larger

KNN and re ranking models for English KNN and re-ranking models for English patent mining at

Tutorial Ranking Mechanisms in Games Vanessa Volz and Boris Naujoks CIG 2018, Maastricht

Ranking Related News Predictions Nattiya Kanhabua 1 , Roi Blanco 2 and Michael Matthews 2 1

Web Mining and Recommender Systems Advanced Recommender Systems: Bayesian Personalized Ranking

Lecture 3: Improving Ranking with Lecture 3: Improving Ranking with Behavior Data Eugene

Import Competition, Heterogeneous Preferences of Managers and Productivity Cheng Chen Claudia

Pr Private Stochastic Convex Optimization wi with Optimal Ra Rate Raef Bassily Vitaly

An Empirical Study on Selective Sampling in Active Learning for Splog Detection Taichi Katayama 1

Dialetheism is an empirical hypothesis David Ripley University of Connecticut

Time-Bounded Sequential Parameter Optimization Frank Hutter, Holger H. Hoos, Kevin Leyton-Brown,

14.581 International Trade Lecture 6: Ricardian Model (Empirics) 14.581

Who Gets Placed Where and Why? An Empirical Framework for Foster Care Placement Alejandro

Faster GPS via the Sparse Fourier Transform Haitham Hassanieh Fadel Adib Dina Katabi Piotr Indyk

Sambuz

Useful Links

Newsletter

Mail Us

ERP Selection KIRTANE & PANDIT Suhas Deshpande Why ERP Selection is important ?