A concept of multicriteria stratification: a definition and - PowerPoint PPT Presentation

A concept of multicriteria stratification: a definition and solution MIKHAIL ORLOV , , D E PA RT M E N T O F A P P L I E D M AT H E M AT I C S A N D I N F O R M AT I C S H S E BORIS MIRKIN I N T E R N AT I O N A L L A B O R AT O RY O F D E C I S I O N C H O I C E A N D A N A LY S I S ; D E PA R T M E N T O F A P P L I E D M AT H E M AT I C S A N D I N F O R M AT I C S

What is stratification? 2  Geology: “the arrangement of sedimentary rocks in distinct layers (strata)“;  Sociology: “the hierarchical structures of classes and statuses in any society”.

Stratification example. Food and housing prices 3 Aggregate criterion C=aH+bF : overall expensiveness; Housing and food prices (2007) Stra rata : : Values are normalized to range to 1. I cheap, II medium and III expensive. City ity Hous Ho using Foods Moscow 0.9749 0.7440 London 0.9479 0.7812 C Tokyo 1.0000 0.6764 I II III Copenhagen 0.5602 1.0000 New-York 0.9749 0.6446 0.4881 Peking 0.6924 Sydney 0.4967 0.5318 Vancouver 0.3318 0.4775 Johannesburg 0.2322 0.4483 Buenos-Aires 0.3412 0.4178

Preliminaries 4  𝑂 objects are evaluated by 𝑁 criteria to be maximized;  Criteria matrix 𝑌 = 𝑦 𝑗𝑗 , 𝑗 = 1, … , 𝑂 , 𝑘 = 1, … , 𝑁 ;  Strata are disjoint sets of objects 𝑇 = { 𝑇 1 , … , 𝑇 𝐿 };  Strata are indexed so that the more preferable, the smaller the index.

Problem 5  A set of 𝑂 objects, evaluated by 𝑁 criteria, should be assigned with an a aggregat ate c e criter erion W W and s split into to 𝑳 di disj sjoint o orde dered d subset sets ( (strata) a) so that W-values in the same group are as close to each other as possible.

Distinction between strata and clusters 6 Str Strata Clust usters

Proposed model for strata 7  If object 𝑦 𝑗 belongs to stratum 𝑇 𝑙 then: 𝑦 𝑗1 𝑥 1 + 𝑦 𝑗𝑗 𝑥 𝑗 + ⋯ + 𝑦 𝑗𝑗 𝑥 𝑗 = 𝑑 𝑙 + 𝑓 𝑗 Ag Aggregat ate e criter erion v value ue  𝑥 – vector of weights of criteria;  𝑑 𝑙 –center or level of 𝑙 -th stratum, 𝑑 𝑙 ∈ { 𝑑 1 , … , 𝑑 𝐿 };  𝑓 𝑗 - error to be minimized.

Strata in the cities example 8 𝑇 3 𝑇 𝑗 c 1 c 2 𝑇 1 c 3

Linear stratification criterion 9  The problem of stratification: 𝑗 𝐿 𝑂 𝑗 � � � 𝑦 𝑗𝑗 𝑥 − 𝑑 𝑙 𝑥 , 𝑑 , 𝑇 𝑛𝑗𝑛 𝑗 𝑙=1 𝑗𝑗𝑇 𝑙 𝑗=1 𝑗 � 𝑥 = 1, 𝑥 𝑗 ≥ 0 𝑗 𝑗=1

Related work 10  Weighted sum of criteria [Sun et al 2009], [Ng 2007; Ramanathan 2006];  Multicriteria rank aggregation [Aizerman, Aleskerov 1995; Mirkin 1979];  Multicriteria decision analysis, outranking [DeSmet, Montano, Guzman 2004], [Nemery, DeSmet 2005];

Why do we need stratification at all? 11  Expert opinion is often a scale with few grades. E. g. 3- graded: “Good”, “Medium” and “Bad”, or ABC grades;  Complete order of many items can be inconvenient to work with: choosing a university program according to some rating. What is the point to prefer 500-th item to 501-th out of a thousand?

Computational comparison: Data specification 12  A A model el f for gener erating s synthet etic d c data s a sets;  Two real d dataset asets;  Two t types es of c criter eria a normalizat ation:  st statistical ( (sc scaling t to ze zero m mean and u unity st std.)  stan andar ard ( (scal aling to to the r e range 0 e 0 to to 1 1).

13 (a) (b) (c) Synthetic d data set ets Examples of 3-strata (d) (e) (f) artificial datasets generated by our model. Parameters : (a),(b),(c) – orientation; (g) (h) (i) (d),(e),(f) – thickness; (g),(h),(i) – intensities; (j),(k),(l) – spread. (j) (k) (l)

Real dataset 1 14  Bibliometric indexes for 118 scientific journals in Artificial Intelligence, 2012 [ from SCImago Journal & Country Ranking Database ]: - Index SJR (Scientific Journal Ranking); - Hirsch index (number of documents that received at least h citations); - Impact-factor.

Real dataset 2 15  Bibliomet etric i c indexes es of 102 c count ntries at 2 2012, in n Ar Artifici cial al I Intel ellige gence: - Total number of documents published in 2012; - Number of citable documents published in in 2012; - Citations received in 2012 for documents published the same year; - Country self-citations in 2012; - Citation per document in 2012; - Country Hirsch index.

Methods under comparison 16  Algorithms for optimization the linear stratification criterion: -Evolutionary minimization [Mirkin, Orlov 2013]; -Quadratic programming [Orlov 2014].  Rankings partitioned using k-means: - Borda count; - Linear weight optimization [ Ramanathan (2006) ]; - Authority ranking [Sun et. Al 2009].  Pareto layers merged using agglomerative clustering: - Pareto stratification [Mirkin, Orlov 2013].

Evaluation criteria 17  On synthetic data. Stratification accuracy: 𝑏𝑑𝑑𝑏𝑏𝑏𝑑𝑏 = 𝑂 𝑑𝑑𝑑𝑑𝑑𝑑𝑑 𝑂  On real data. Coherence of obtained stratification with respect to stratifications over single criteria using Kemeny-Snell distance: 𝑂 1 𝑒 𝑆𝑇 = 2𝑂 ( 𝑂 − 1) � | 𝑆 𝑗𝑗 − 𝑇 𝑗𝑗 | 𝑗 , 𝑗=1 1, 𝑇 𝑦 𝑗 > 𝑇 ( 𝑦 𝑗 ) 0, 𝑇 𝑦 𝑗 = 𝑇 ( 𝑦 𝑗 ) 𝑇 𝑗𝑗 = � − 1, 𝑇 𝑦 𝑗 < 𝑇 ( 𝑦 𝑗 )

Experimental results on synthetic data 18  Accuracy of stratification with respect to the following data generation parameters:  data dimensionality,  number of objects,  strata “intensities”,  “spread”,  “thickness”.  In most cases our quadratic programming based algorithm LSQ demonstrated the best accuracy.

Real data set 1 (3 strata) 19  In the first stratum: 1. IEEE Transactions on Pattern Analysis and Machine Intelligence (United States); 2. International Journal of Computer Vision (Netherland); 3. Foundations and Trends in Machine Learning (United States); 4. ACM Transactions on Intelligent Systems and Technology (United States); 5. IEEE Transactions on Evolutionary Computation (United States); 6. IEEE Transactions on Fuzzy Systems (United States).  Criteria weights: - Impact Factor: 0.47; - Scientific Journal Ranking (SJR): 0.38; - Hirsch Index: 0.05.

Real data set 2 (3 strata) 20  The first stratum consists of two countries: China, USA.  The second stratum, 17 countries: Spain, UK, France, Taiwan, Japan, India, Germany, Canada, Italy, South Korea, Australia, Hong-Kong, Netherlands, Singapore, Switzerland, and Israel.  The other 83 countries form the 3-rd strata.  Non zero weights: - Self-citation: 0.52; - Hirsch-index : 0.41; - Average citation number: 0.07.

Conclusion 21  The problem of multicriteria stratification is formalized as an optimization task to minimize the thickness of strata;  Two algorithms are proposed;  A stratified synthetic data generating algorithm is proposed;  In most synthetic data cases our QP algorithm demonstrated superior performance;  Application of methods to real data leads to sensible results.

Future work 22  Avoiding trivial solutions: If some of criterion is k- valued then optimization task has a trivial minimum. Just assign weight 1 to this feature and get a solution;  Extensive experimental study of the developed and existing stratification methods on real world data sets;  Probabilistic formulation of strata model;  Choosing right number of strata;  Interpretation of stratification results .

A concept of multicriteria stratification: a definition and - PowerPoint PPT Presentation

A concept of multicriteria stratification: a definition and solution MIKHAIL ORLOV , , D E PA RT M E N T O F A P P L I E D M AT H E M AT I C S A N D I N F O R M AT I C S H S E BORIS MIRKIN I N T E R N AT I O N A L L A B O R AT O RY O F

Monomial Tropical Cones for Multicriteria Optimization Georg Loho joint work with Michael Joswig

Problem Definition Problem Definition Problem Definition Problem Definition Problem Definition

Network-based stratification of tumor mutations Matan Hofree Goal Tumor stratification: to

Lipschitz stratification in power-bounded o-minimal fields Yimu Yin (joint work with Immi

Unit 1: Topic and History Learning goals Unit 1 I. What is multicriteria optimization and

Superseding Traditional Indexes with Multicriteria Data Structures GIORGIO VINCIGUERRA PhD

Developing and Implementing a Risk Stratification Method in a Patient Centered Medical Home

Risk Stratification and Population Health Management Nancy Jaeckels Kamp PCMH Team PCMH

Stratification Criteria and Rewriting Techniques for Checking Chase Termination S. Greco, F .

The Power of Compromise Approximation in Multicriteria Optimization C. B using, Kai-Simon

Reference Point Methods and Approximation in Multicriteria Optimization C. B using, Kai-Simon

Compromise Solutions in Multicriteria Optimization Kai-Simon Goetzmann (Joint work with

HOSPITAL DEPARTMENT PERFORMANCE EVALUATION USING THE PROMETHEE MULTICRITERIA METHODOLOGY Prof.

Multicriteria 3D PET image segmentation Application to oncology and cancer lesion extraction

A multicriteria decision analysis as an innovative approach to managing Lyme disease Paris,

Multicriteria Optimization Some continuous and discrete dynamics Guillaume Garrigos Institut de

Decomposition Behavior in Aggregated Data Sets Sarah Berube Karl-Dieter Crisman Gordon College

BAYESIAN GLOBAL OPTIMIZATION Using Optimal Learning to Tune Deep Learning Pipelines Scott Clark

REMEDI3S-TLD: Reputation Metrics Design to Improve Intermediary Incentives for Security of TLDs A

Voting: Issues, Problems, and Systems, Continued 9 March 2012 Voting III 9 March 2012 1/1 Last

Micro, Small, and Medium Enterprises Using Analytic Network Process at PT Sarana Jatim Ventura

for Manned Space Exploration In Internship Christie Watters Crew and Thermal Systems Division

Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample Anthony Atkinson,

Powerlink Customer & Consumer Panel 18 May 2017 Welcome and Introductions Agenda AER

Sambuz

Useful Links

Newsletter

Mail Us

A concept of multicriteria stratification: a definition and - PowerPoint PPT Presentation

A concept of multicriteria stratification: a definition and solution MIKHAIL ORLOV , , D E PA RT M E N T O F A P P L I E D M AT H E M AT I C S A N D I N F O R M AT I C S H S E BORIS MIRKIN I N T E R N AT I O N A L L A B O R AT O RY O F

Monomial Tropical Cones for Multicriteria Optimization Georg Loho joint work with Michael Joswig

Problem Definition Problem Definition Problem Definition Problem Definition Problem Definition

Network-based stratification of tumor mutations Matan Hofree Goal Tumor stratification: to

Lipschitz stratification in power-bounded o-minimal fields Yimu Yin (joint work with Immi

Unit 1: Topic and History Learning goals Unit 1 I. What is multicriteria optimization and

Superseding Traditional Indexes with Multicriteria Data Structures GIORGIO VINCIGUERRA PhD

Developing and Implementing a Risk Stratification Method in a Patient Centered Medical Home

Risk Stratification and Population Health Management Nancy Jaeckels Kamp PCMH Team PCMH

Stratification Criteria and Rewriting Techniques for Checking Chase Termination S. Greco, F .

The Power of Compromise Approximation in Multicriteria Optimization C. B using, Kai-Simon

Reference Point Methods and Approximation in Multicriteria Optimization C. B using, Kai-Simon

Compromise Solutions in Multicriteria Optimization Kai-Simon Goetzmann (Joint work with

HOSPITAL DEPARTMENT PERFORMANCE EVALUATION USING THE PROMETHEE MULTICRITERIA METHODOLOGY Prof.

Multicriteria 3D PET image segmentation Application to oncology and cancer lesion extraction

A multicriteria decision analysis as an innovative approach to managing Lyme disease Paris,

Multicriteria Optimization Some continuous and discrete dynamics Guillaume Garrigos Institut de

Decomposition Behavior in Aggregated Data Sets Sarah Berube Karl-Dieter Crisman Gordon College

BAYESIAN GLOBAL OPTIMIZATION Using Optimal Learning to Tune Deep Learning Pipelines Scott Clark

REMEDI3S-TLD: Reputation Metrics Design to Improve Intermediary Incentives for Security of TLDs A

Voting: Issues, Problems, and Systems, Continued 9 March 2012 Voting III 9 March 2012 1/1 Last

Micro, Small, and Medium Enterprises Using Analytic Network Process at PT Sarana Jatim Ventura

for Manned Space Exploration In Internship Christie Watters Crew and Thermal Systems Division

Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample Anthony Atkinson,

Powerlink Customer &amp; Consumer Panel 18 May 2017 Welcome and Introductions Agenda AER

Sambuz

Useful Links

Newsletter

Mail Us

Powerlink Customer & Consumer Panel 18 May 2017 Welcome and Introductions Agenda AER