Experimental Analysis Marco Chiarandini Department of Mathematics - PowerPoint PPT Presentation

DM841 D ISCRETE O PTIMIZATION Part 2 – Heuristics Experimental Analysis Marco Chiarandini Department of Mathematics & Computer Science University of Southern Denmark

Outline Outline Experimental Analysis 1. Experimental Analysis Motivations and Goals Descriptive Statistics Performance Measures Sample Statistics Scenarios of Analysis A. Single-pass heuristics B. Asymptotic heuristics Guidelines for Presenting Data 2

Outline Outline Experimental Analysis 1. Experimental Analysis Motivations and Goals Descriptive Statistics Scenarios of Analysis Guidelines for Presenting Data 3

Outline Outline Experimental Analysis 1. Experimental Analysis Motivations and Goals Descriptive Statistics Scenarios of Analysis Guidelines for Presenting Data 4

Outline Contents and Goals Experimental Analysis Provide a view of issues in Experimental Algorithmics ◮ Exploratory data analysis ◮ Presenting results in a concise way with graphs and tables ◮ Organizational issues and Experimental Design ◮ Basics of inferential statistics ◮ Sequential statistical testing: race, a methodology for tuning 5

Outline Contents and Goals Experimental Analysis Provide a view of issues in Experimental Algorithmics ◮ Exploratory data analysis ◮ Presenting results in a concise way with graphs and tables ◮ Organizational issues and Experimental Design ◮ Basics of inferential statistics ◮ Sequential statistical testing: race, a methodology for tuning The goal of Experimental Algorithmics is not only producing a sound analysis but also adding an important tool to the development of a good solver for a given problem. 5

Outline Contents and Goals Experimental Analysis Provide a view of issues in Experimental Algorithmics ◮ Exploratory data analysis ◮ Presenting results in a concise way with graphs and tables ◮ Organizational issues and Experimental Design ◮ Basics of inferential statistics ◮ Sequential statistical testing: race, a methodology for tuning The goal of Experimental Algorithmics is not only producing a sound analysis but also adding an important tool to the development of a good solver for a given problem. Experimental Algorithmics is an important part in the algorithm production cycle, which is referred to as Algorithm Engineering 5

Outline The Engineering Cycle Experimental Analysis from http://www.algorithm-engineering.de/ 6

Outline Experimental Algorithmics Experimental Analysis Mathematical Model Simulation Program (Algorithm) Experiment In empirical studies we consider simulation programs which are the implementation of a mathematical model (the algorithm) [McGeoch, 1996] 7

Outline Experimental Algorithmics Experimental Analysis Goals ◮ Defining standard methodologies ◮ Comparing relative performance of algorithms so as to identify the best ones for a given application ◮ Characterizing the behavior of algorithms ◮ Identifying algorithm separators, i.e. , families of problem instances for which the performance differ ◮ Providing new insights in algorithm design 8

Outline Fairness Principle Experimental Analysis Fairness principle: being completely fair is perhaps impossible but try to remove any possible bias ◮ possibly all algorithms must be implemented with the same style, with the same language and sharing common subprocedures and data structures ◮ the code must be optimized, e.g., using the best possible data structures ◮ running times must be comparable, e.g., by running experiments on the same computational environment (or redistributing them randomly) 9

Outline Definitions Experimental Analysis The most typical scenario considered in analysis of search heuristics Asymptotic heuristics with time/quality limit decided a priori The algorithm A ∞ is halted when time expires or a solution of a given quality is found. Deterministic case: A ∞ on π returns a solution of cost x . The performance of A ∞ on π is a scalar y = x . 10

Outline Definitions Experimental Analysis The most typical scenario considered in analysis of search heuristics Asymptotic heuristics with time/quality limit decided a priori The algorithm A ∞ is halted when time expires or a solution of a given quality is found. Deterministic case: A ∞ on π Randomized case: A ∞ on π returns returns a solution of cost x . a solution of cost X , where X is a random variable. The performance of A ∞ on π is a The performance of A ∞ on π is the scalar y = x . univariate Y = X . [This is not the only relevant scenario: to be refined later] 10

Random Variables and Probability Statistics deals with random (or stochastic) variables. A variable is called random if, prior to observation, its outcome cannot be predicted with certainty. The uncertainty is described by a probability distribution.

Random Variables and Probability Statistics deals with random (or stochastic) variables. A variable is called random if, prior to observation, its outcome cannot be predicted with certainty. The uncertainty is described by a probability distribution. Discrete variables Continuous variables Probability distribution: Probability density function (pdf): f ( v ) = dF ( v ) p i = P [ x = v i ] dv Cumulative Distribution Function (CDF): Cumulative Distribution Function (CDF) � v � F ( v ) = P [ x ≤ v ] = p i F ( v ) = f ( v ) dv i −∞ Mean Mean � � µ = E [ X ] = x i p i µ = E [ X ] = xf ( x ) dx Variance Variance σ 2 = E [( X − µ ) 2 ] = � ( x i − µ ) 2 p i � σ 2 = E [( X − µ ) 2 ] = ( x − µ ) 2 f ( x ) dx

Outline Generalization Experimental Analysis For each general problem Π (e.g., TSP, GCP) we denote by C Π a set (or class) of instances and by π ∈ C Π a single instance. 13

Outline Generalization Experimental Analysis For each general problem Π (e.g., TSP, GCP) we denote by C Π a set (or class) of instances and by π ∈ C Π a single instance. On a specific instance, the random variable Y that defines the performance measure of an algorithm is described by its probability distribution/density function Pr ( Y = y | π ) 13

Outline Generalization Experimental Analysis For each general problem Π (e.g., TSP, GCP) we denote by C Π a set (or class) of instances and by π ∈ C Π a single instance. On a specific instance, the random variable Y that defines the performance measure of an algorithm is described by its probability distribution/density function Pr ( Y = y | π ) It is often more interesting to generalize the performance on a class of instances C Π , that is, � Pr ( Y = y , C Π ) = Pr ( Y = y | π ) Pr ( π ) π ∈ Π 13

Outline Sampling Experimental Analysis In experiments, 1. we sample the population of instances and 2. we sample the performance of the algorithm on each sampled instance If on an instance π we run the algorithm r times then we have r replicates of the performance measure Y , denoted Y 1 , . . . , Y r , which are independent and identically distributed (i.i.d.), i.e. � r Pr ( y 1 , . . . , y r | π ) = Pr ( y j | π ) j = 1 � Pr ( y 1 , . . . , y r ) = Pr ( y 1 , . . . , y r | π ) Pr ( π ) . π ∈ C Π 14

Outline Instance Selection Experimental Analysis In real-life applications a simulation of p ( π ) can be obtained by historical data. 15

Outline Instance Selection Experimental Analysis In real-life applications a simulation of p ( π ) can be obtained by historical data. In simulation studies instances may be: ◮ real world instances ◮ random variants of real world-instances ◮ online libraries ◮ randomly generated instances 15

Outline Instance Selection Experimental Analysis In real-life applications a simulation of p ( π ) can be obtained by historical data. In simulation studies instances may be: ◮ real world instances ◮ random variants of real world-instances ◮ online libraries ◮ randomly generated instances They may be grouped in classes according to some features whose impact may be worth studying: ◮ type (for features that might impact performance) ◮ size (for scaling studies) ◮ hardness (focus on hard instances) ◮ application (e.g., CSP encodings of scheduling problems), ... 15

Outline Instance Selection Experimental Analysis In real-life applications a simulation of p ( π ) can be obtained by historical data. In simulation studies instances may be: ◮ real world instances ◮ random variants of real world-instances ◮ online libraries ◮ randomly generated instances They may be grouped in classes according to some features whose impact may be worth studying: ◮ type (for features that might impact performance) ◮ size (for scaling studies) ◮ hardness (focus on hard instances) ◮ application (e.g., CSP encodings of scheduling problems), ... Within the class, instances are drawn with uniform probability p ( π ) = c 15

Outline Statistical Methods Experimental Analysis The analysis of performance is based on finite-size sampled data. Statistics provides the methods and the mathematical basis to ◮ describe, summarizing, the data (descriptive statistics) ◮ make inference on those data (inferential statistics) 16

Experimental Analysis Marco Chiarandini Department of Mathematics - PowerPoint PPT Presentation

DM841 D ISCRETE O PTIMIZATION Part 2 Heuristics Experimental Analysis Marco Chiarandini Department of Mathematics & Computer Science University of Southern Denmark Outline Outline Experimental Analysis 1. Experimental Analysis

Basic Experimental Design Basic Concepts in Experimental Design Prof. Dr. Luc Duchateau Ghent

In vitro tests and experimental animal In vitro tests and experimental animal In vitro tests and

What Can Experimental Philosophy Do? David Chalmers Cast of Characters n X-Phi: Experimental

EMBEDDED SYSTEMS BASICS WORKSHOP by ELC Skyward SKYWARD EXPERIMENTAL ROCKETRY SKYWARD

Experimental Design and Probability Introduction to course Robin Elahi Experimental Design and

Experimental Design in R Kaelen Medeiros Product Data Scientist at DataCamp DataCamp

WHAT WOULD TREX DO? From Experimental Design to Analysis, the TREX Approach EXPERIMENTAL DESIGN

Experimental Evidence from Ghanas Civil Service Experimental Evidence from Ghanas Civil

X-Phi Meets A-Phi David Chalmers What are the Targets? n What is experimental philosophy? n

SWOT Analysis W T S O SWOT Analysis Learning Objectives What is SWOT Analysis? What is SWOT

Analysis and Optimizations Analysis and Optimizations Program Analysis Program Analysis

CS 147: Computer Systems Performance Analysis Ratio Games and Introduction to Experimental Design

Experimental Analysis Guidelines for Presenting Data 3. Examples Marco Chiarandini Results Task

Experimental Verification of Experimental Verification of a Radiant Heat Exposure Model a

Experimental Assessment of Experimental Assessment of p Brine and/or CO2 Leakage Brine and/or

The experimental investigation on the role of E x B The experimental investigation on the role of

Announcements U nit 1: I ntroduction to data L ecture 2: E xploratory data analysis S tatistics 101

Descripti v e statistics P R AC TIC IN G STATISTIC S IN TE R VIE W QU E STION S IN P YTH ON

Descriptive Statistics Central Tendency Variation Mean and Standard Deviation of Grouped Data

Descriptive Statistics DS GA 1002 Probability and Statistics for Data Science

Introduction to Statistical Inference Edwin Leuven Introduction Define key terms that are

Section 1.1 Population and Sample WHAT IS STATISTICS? Statistics is a group of methods used to

Adding data analysis to a mathematical statistics course Johanna Franklin Hofstra University

1 Practical Information 2 Introduction to Statistics Per Bruun Brockhoff 3 Descriptive Statistics: