Software Effort Estimation as a Multi-objective Learning Problem - PowerPoint PPT Presentation

Software Effort Estimation as a Multi-objective Learning Problem Leandro Minku ( www.cs.bham.ac.uk/~minkull ) CERCIA, School of Computer Science, The University of Birmingham January 31, 2013 Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 1 / 23

Introduction – ML and ensembles ML models for Software Effort Estimation (SEE). Decision support tools. Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 2 / 23

Introduction – ML and ensembles ML models for Software Effort Estimation (SEE). Decision support tools. Ensembles of learning machines: Recently attracted attention of SEE community. Tailoring is necessary (base learner choice or ensemble method). E. Kocaguneli, T. Menzies and J. Keung. On the value of ensemble effort estimation. TSE in press. L. Minku and X. Yao. Ensembles and locality: insight on improving software effort estimation. IST in press. Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 2 / 23

Introduction – diversity and performance measures Base learners in ensembles should be diverse. Different performance measures for evaluating SEE can behave differently. MMRE, PRED, LSD, MAE, etc. Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 3 / 23

Introduction – diversity and performance measures Base learners in ensembles should be diverse. Different performance measures for evaluating SEE can behave differently. MMRE, PRED, LSD, MAE, etc. Question Can we use that for improving SEEs? L. Minku and X. Yao. Software effort estimation as a multi-objective learning problem. TOSEM (accepted). Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 3 / 23

Introduction – diversity and performance measures Base learners in ensembles should be diverse. Different performance measures for evaluating SEE can behave differently. MMRE, PRED, LSD, MAE, etc. Question Can we use that for improving SEEs? L. Minku and X. Yao. Software effort estimation as a multi-objective learning problem. TOSEM (accepted). How differently do these measures behave in SEE? 1 Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 3 / 23

Introduction – diversity and performance measures Base learners in ensembles should be diverse. Different performance measures for evaluating SEE can behave differently. MMRE, PRED, LSD, MAE, etc. Question Can we use that for improving SEEs? L. Minku and X. Yao. Software effort estimation as a multi-objective learning problem. TOSEM (accepted). How differently do these measures behave in SEE? 1 Can we use them to create good ensembles for SEE? 2 Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 3 / 23

Introduction – diversity and performance measures Base learners in ensembles should be diverse. Different performance measures for evaluating SEE can behave differently. MMRE, PRED, LSD, MAE, etc. Question Can we use that for improving SEEs? L. Minku and X. Yao. Software effort estimation as a multi-objective learning problem. TOSEM (accepted). How differently do these measures behave in SEE? 1 Can we use them to create good ensembles for SEE? 2 Can we emphasize a particular measure if we wish to? 3 Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 3 / 23

SEE as a Multi-Objective Learning Problem Learn models for SEE. Each performance measure is an objective to be optimised. M. Harman and J. Clark. Metrics are fitness functions too. METRICS 2004. Multi-Objective Evolutionary Algorithm: Can be used for answering our research questions. Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 4 / 23

Multi-Objective Evolutionary Algorithms (MOEAs) MOEAs are population-based optimisation algorithms. Multiple-objectives, possibly conflicting – dominance: f i ( x (1) ) ≤ f i ( x (2) ) ∀ i ∧ ∃ i | f i ( x (1) ) < f i ( x (2) ) “Pareto solutions” – nondominated solutions in the last generation, generally good at all objectives. Solutions should be diverse, spread well over the objective space. Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 5 / 23

Using MOEAs for Creating SEE Models Performance measures for creating models: Mean Magnitude of the Relative Error: T MMRE = 1 � MRE i , T i =1 where MRE i = | ˆ y i − y i | /y i ; ˆ y i is the predicted effort; and y i is the actual effort. Percentage of estimations within 25% of the actual values: T � if MRE i ≤ 25 1 , PRED (25) = 1 � 100 . T 0 , otherwise i =1 Logarithmic Standard Deviation: � � 2 � � e i + s 2 � T � i =1 2 � LSD = , T − 1 where s 2 is an estimator of the variance of the residual e i and e i = ln y i − ln ˆ y i . Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 6 / 23

Using MOEAs for Creating SEE Models MOEA: Harmonic Distance MOEA. Objectives/performance: calculated on training set. SEE Models: Multi-Layer Perceptrons (MLPs). Representation: vector of real values (weights and thresholds). Crossover: w c = w p 1 + N (0 , σ 2 )( w p 2 − w p 3 ) Self-tuning crossover: σ 2 = 2 − � � 1 1+ e ( anneal time − generation ) Mutation: w i = w i + N (0 , 0 . 1) Optional: training with Backpropagation. Z. Wang, K. Tang and X. Yao. Multi-objective approaches to optimal testing resource allocation in modular software systems. TR, 2010. A. Chandra and X. Yao. Ensemble learning using multi-objective evolutionary algorithms. JMMA, 2006. Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 7 / 23

Using MOEAs for Creating SEE Models Two different ways to use solutions: Ensemble of “best fit” Pareto solutions: Ensemble SEE = average SEE of base models. Good trade-off among measures. Use one best fit Pareto solution. Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 8 / 23

Experiments Data sets: cocomo81, nasa93, nasa, cocomo2, desharnais, 7 ISBSG organization type subsets. ISBSG subsets’ productivity rate is statistically different. Attributes: cocomo attributes + loc for PROMISE data, functional size, development type and language type for ISBSG. Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 9 / 23

Experiments Data sets: cocomo81, nasa93, nasa, cocomo2, desharnais, 7 ISBSG organization type subsets. ISBSG subsets’ productivity rate is statistically different. Attributes: cocomo attributes + loc for PROMISE data, functional size, development type and language type for ISBSG. 30 runs for each data set, test in a holdout set with 10 projects. Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 9 / 23

Experiments Data sets: cocomo81, nasa93, nasa, cocomo2, desharnais, 7 ISBSG organization type subsets. ISBSG subsets’ productivity rate is statistically different. Attributes: cocomo attributes + loc for PROMISE data, functional size, development type and language type for ISBSG. 30 runs for each data set, test in a holdout set with 10 projects. Performance measures for evaluation on test set: MMRE, PRED(25), LSD, MdMRE, MAE, MdAE. Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 9 / 23

Experiments Data sets: cocomo81, nasa93, nasa, cocomo2, desharnais, 7 ISBSG organization type subsets. ISBSG subsets’ productivity rate is statistically different. Attributes: cocomo attributes + loc for PROMISE data, functional size, development type and language type for ISBSG. 30 runs for each data set, test in a holdout set with 10 projects. Performance measures for evaluation on test set: MMRE, PRED(25), LSD, MdMRE, MAE, MdAE. Effect size: | M a − M p | SD p M. Shepperd and S. MacDonell. Evaluating prediction systems in software project estimation. IST 2012. Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 9 / 23

Experiments Data sets: cocomo81, nasa93, nasa, cocomo2, desharnais, 7 ISBSG organization type subsets. ISBSG subsets’ productivity rate is statistically different. Attributes: cocomo attributes + loc for PROMISE data, functional size, development type and language type for ISBSG. 30 runs for each data set, test in a holdout set with 10 projects. Performance measures for evaluation on test set: MMRE, PRED(25), LSD, MdMRE, MAE, MdAE. Effect size: | M a − M p | SD p M. Shepperd and S. MacDonell. Evaluating prediction systems in software project estimation. IST 2012. Comparing approaches: MLP, RBF; REPTree, Bagging+MLP, Bagging+REPTree, log + EBA; Bagging+RBF, Rand+MLP, NCL+MLP. Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 9 / 23

Research Question 1 Question 1 How differently do the performance measures behave in SEE? (Are they different enough for using them as a source of diversity in ensembles?) MMRE, PRED(25), LSD. Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a Multi-objective Learning Problem 10 / 23

Software Effort Estimation as a Multi-objective Learning Problem - PowerPoint PPT Presentation

Software Effort Estimation as a Multi-objective Learning Problem Leandro Minku ( www.cs.bham.ac.uk/~minkull ) CERCIA, School of Computer Science, The University of Birmingham January 31, 2013 Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a

Multi-Objective Software Effort Estimation Federica Sarro ! ! Senior Research Associate Dept.

An Evaluation of Ensemble Learning for Software Effort Estimation Leandro Minku CERCIA, School

A Novel Automated Approach for Software Effort Estimation Based on Data Augmentation Liyan Song

10/20/19 Multi-objective Evolutionary Algorithms Genetic Algorithms Multi-objective

Feder ederal al Time Time and and Effort Effort Reporting Requirements Reporting

Motion Estimation by Affine Transforms Motion Estimation by Affine Transforms Motion Estimation

Software size measures and their usefulness for software project estimation Software Size

Planning a Software Project Agenda Background Process planning Effort estimation

CISC 322 Software Architecture Lecture 20: Software Cost Estimation 2 Emad Shihab Slides

GOOGLE NEXUS 5X & 6P MAINLINING EFFORT JEREMY MCNICOLL JEREMYMC@REDHAT.COM $> WHOAMI 2

Multi-agent learning Multi-agent reinforcement learning Gerard Vreeswijk , Intelligent Systems

Application of Multi- -Objective Objective Metaheuristic Metaheuristic Application of Multi

Constraint Approach to Two Approaches Let Us Estimate the . . . Multi-Objective Let Us Estimate

Project Objective Project Objective Project Objective Project

COMBINING SWIFT AND OBJECTIVE-C AGENDA Using Objective-C from Swift Using Swift from

MLSE Channel Estimation MLSE Channel Estimation MLSE Channel Estimation Parametric or Non-

Multi-armed bandit problem and its applications in reinforcement learning Pietro Lovato Ph.D.

The Learning with Rounding Problem: Reductions and Applications Alon Rosen IDC Herzliya

Summary of the 2015 NIST Language Recognition i-Vector Machine Learning Challenge Craig Greenberg

Defining a Learning Problem } Suppose we have three basic components: Set of tasks, T 1. A

Apr/28/09 Feng, Chi wei Supervised by Prof. Ching Tsan Huang, PhD 1 41% of the world's

Last class... 1. Sequence - {Folding} - Structure - Dynamics - Function paradigm: perspective for

Compressed Factorization: Fast and Accurate Low-Rank Factorization of Compressively-Sensed Data

Non-Hamiltonian and Non-Traceable Regular 3-Connected Planar Graphs Nico Van Cleemput Carol T.

Software Effort Estimation as a Multi-objective Learning Problem - PowerPoint PPT Presentation

Software Effort Estimation as a Multi-objective Learning Problem Leandro Minku ( www.cs.bham.ac.uk/~minkull ) CERCIA, School of Computer Science, The University of Birmingham January 31, 2013 Leandro Minku ( www.cs.bham.ac.uk/~minkull ) SEE as a

Multi-Objective Software Effort Estimation Federica Sarro ! ! Senior Research Associate Dept.

An Evaluation of Ensemble Learning for Software Effort Estimation Leandro Minku CERCIA, School

A Novel Automated Approach for Software Effort Estimation Based on Data Augmentation Liyan Song

10/20/19 Multi-objective Evolutionary Algorithms Genetic Algorithms Multi-objective

Feder ederal al Time Time and and Effort Effort Reporting Requirements Reporting

Motion Estimation by Affine Transforms Motion Estimation by Affine Transforms Motion Estimation

Software size measures and their usefulness for software project estimation Software Size

Planning a Software Project Agenda Background Process planning Effort estimation

CISC 322 Software Architecture Lecture 20: Software Cost Estimation 2 Emad Shihab Slides

GOOGLE NEXUS 5X &amp; 6P MAINLINING EFFORT JEREMY MCNICOLL JEREMYMC@REDHAT.COM $&gt; WHOAMI 2

Multi-agent learning Multi-agent reinforcement learning Gerard Vreeswijk , Intelligent Systems

Application of Multi- -Objective Objective Metaheuristic Metaheuristic Application of Multi

Constraint Approach to Two Approaches Let Us Estimate the . . . Multi-Objective Let Us Estimate

Project Objective Project Objective Project Objective Project

COMBINING SWIFT AND OBJECTIVE-C AGENDA Using Objective-C from Swift Using Swift from

MLSE Channel Estimation MLSE Channel Estimation MLSE Channel Estimation Parametric or Non-

Multi-armed bandit problem and its applications in reinforcement learning Pietro Lovato Ph.D.

The Learning with Rounding Problem: Reductions and Applications Alon Rosen IDC Herzliya

Summary of the 2015 NIST Language Recognition i-Vector Machine Learning Challenge Craig Greenberg

Defining a Learning Problem } Suppose we have three basic components: Set of tasks, T 1. A

Apr/28/09 Feng, Chi wei Supervised by Prof. Ching Tsan Huang, PhD 1 41% of the world's

Last class... 1. Sequence - {Folding} - Structure - Dynamics - Function paradigm: perspective for

Compressed Factorization: Fast and Accurate Low-Rank Factorization of Compressively-Sensed Data

Non-Hamiltonian and Non-Traceable Regular 3-Connected Planar Graphs Nico Van Cleemput Carol T.

GOOGLE NEXUS 5X & 6P MAINLINING EFFORT JEREMY MCNICOLL JEREMYMC@REDHAT.COM $> WHOAMI 2