Some Usages of RTDs Uses of empirical analysis through RTDs include: - PDF document

Run-length distributions: I RTDs based on run-times measured in terms of elementary operations of the given algorithm are also called run-length distributions (RLDs) . I Caution: RLDs should be based on elementary operations that either require constant CPU time (for the given problem instance), or on aggregate counts in which operations that require di ff erent amounts of CPU time ( e.g. , two types of search steps) are weighted appropriately. I Elementary operations commonly used as the basis for RLD and other run-time measurements of SLS algorithms include search steps, objective function evaluations and updates of data structures used for implementing the step function. Heuristic Optimization 2018 39 Some Usages of RTDs Uses of empirical analysis through RTDs include: I the analysis of asymptotic and stagnation behaviour, I the use of functional approximations to mathematically characterise entire RTDs. Such advanced analyses can facilitate improvements in the performance and run-time behaviour of a given SLS, e.g. , by providing the basis for I designing or configuring restart strategies and other diversification mechanisms , I realising speedups through multiple independent runs parallelisation . Heuristic Optimization 2018 40

Asymptotic run-time behaviour of SLS algorithms I completeness: for each soluble problem instance π there is a time bound t max ( π ) for the time required to find a solution. I probabilistic approximate completeness (PAC property): for each soluble problem instance a solution is found with probability → 1 as run-time → ∞ . Note: Do not confuse with probably approximately correct (PAC) learning . I essential incompleteness: for some soluble problem instances, the probability for finding a solution is strictly smaller than 1 for run-time → ∞ . Heuristic Optimization 2018 41 Examples: I Many randomised tree search algorithms are complete, e.g. , Satz-Rand [Gomes et al. , 1998]. I Uninformed Random Walk and Randomised Iterative Improvement are probabilistically approximately complete (PAC). I Iterative Best Improvement is essentially incomplete. Heuristic Optimization 2018 42

Note: I Completeness of SLS algorithms can be achieved by using a restart mechanism that systematically initialises the search at all candidate solutions. Typically very ine ff ective, due to large size of search space. I Essential incompleteness of SLS algorithms is typically caused by inability to escape from attractive local minima regions of search space. Remedy: Use diversification mechanisms such as random restart, random walk, probabilistic tabu tenure, . . . In many cases, these can render algorithms provably PAC; but e ff ectiveness in practice can vary widely. Heuristic Optimization 2018 43 Asymptotic behaviour and stagnation The three norms of SLS algorithm behaviour, completeness , PAC property and essential incompleteness , correspond to properties of an algorithm’s theoretical RTDs. Note: I Completeness can be empirically falsified for a given time-bound, but it cannot empirically verified. I Neither the PAC property , nor essential incompleteness can be empirically verified or falsified. I But: Empirical RTDs can provide evidence (rather than proof) for essential incompleteness or PAC behaviour. Heuristic Optimization 2018 44

Example of asymptotic behaviour in empirical RTDs: 1 0.9 0.8 0.7 P(solve) 0.6 0.5 0.4 0.3 0.2 MMAS 0.1 MMAS* 0 10 100 1 000 10 000 run-time [CPU sec] Note: MM AS is provably PAC, MM AS ∗ is essentially incomplete. Heuristic Optimization 2018 45 Functional characterisation of SLS behaviour (1) I Empirical RTDs are step functions that approximate the underlying theoretical RTDs. I For reasonably large sample sizes (numbers of runs), empirical RTDs can often be approximated well using much simpler continuous mathematical functions. I Such functional approximations are useful for summarising and mathematically modelling empirically observed behaviour, which often provides deeper insights into SLS algorithm behaviour. I Approximations with parameterised families of continuous distribution functions known from statistics, such as exponential or normal distributions, are particularly useful. Heuristic Optimization 2018 46

Approximation of an empirical RTD with an exponential distribution ed[ m ]( x ) := 1 − 2 � x / m : 1 empirical RLD 0.9 ed[61081.5] 0.8 0.7 P(solve) 0.6 0.5 0.4 0.3 0.2 0.1 0 10 2 10 3 10 5 10 5 10 6 run-time [search steps] Heuristic Optimization 2018 47 Functional characterisation of SLS behaviour (2) I Model fitting techniques , such as the Marquardt-Levenberg or Expectation Maximisation algorithms , can be used to find good approximations of empirical RTDs with parameterised cumulative distribution functions. I The quality of approximations can be assessed using statistical goodness-of-fit tests , such as the χ 2 -test or the Kolmogorov-Smirnov test . I Note: Particularly for small or easy problem instances, the quality of optimal functional approximations can sometimes be limited by the inherently discrete nature of empirical RTD data. I This approach can be easily generalised to ensembles of problem instances. Heuristic Optimization 2018 48

Approximation of an empirical RTD with an exponential distribution ed[ m ]( x ) := 1 − 2 � x / m : 1 empirical RLD 0.9 ed[61081.5] 0.8 0.7 P(solve) 0.6 0.5 0.4 0.3 0.2 0.1 0 10 2 10 3 10 5 10 5 10 6 run-time [search steps] The optimal fit exponential distribution obtained from the Marquardt-Levenberg algorithm passes the χ 2 goodness-of-fit test at α = 0 . 05. Heuristic Optimization 2018 49 Approximation of an empirical RTD with an exponential distribution ed[ m ]( x ) := 1 − 2 � x / m : 1 600 0.01 acceptance threshold empirical RLD 0.9 0.05 acceptance threshold ed[61081.5] 500 0.8 0.7 400 P(solve) χ 2 value 0.6 0.5 300 0.4 200 0.3 0.2 100 0.1 0 0 10 2 10 3 10 5 10 5 10 6 10 2 10 3 10 4 10 5 run-time [search steps] median run-time [search steps] Heuristic Optimization 2018 50

example: asymptotic SQD of random-order first improvement for TSP instance pcb3038 1 0.9 cumulative frequency 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 7 7.5 8 8.5 9 9.5 10 10.5 relative solution quality [%] hypothesis: solution quality data follow a normal distribution The hypothesis can be tested using the Shapiro-Wilk test; test does not reject hypothesis that data follow a normal distribution with mean 8,6 and std.deviation 0.51 ( p -value 0.2836. Heuristic Optimization 2018 51 Q-Q plot I graphical method for comparing two probability distributions by plotting their quantiles against each other I if two distributions compared are similar, the points are roughly on a line Heuristic Optimization 2018 52

Performance improvements based on static restarts (1) I Detailed RTD analyses can often suggest ways of improving the performance of a given SLS algorithm. I Static restarting , i.e. , periodic re-initialisation after all integer multiples of a given cuto ff -time t 0 , is one of the simplest methods for overcoming stagnation behaviour. I A static restart strategy is e ff ective, i.e. , leads to increased solution probability for some run-time t 00 , if the RTD of the given algorithm and problem instance is less steep than an exponential distribution crossing the RTD at some time t < t 00 . Heuristic Optimization 2018 53 Example of an empirical RTD of an SLS algorithm on a problem instance for which static restarting is e ff ective: 1 ed[18] 0.9 ILS 0.8 0.7 P(solve) 0.6 0.5 0.4 0.3 0.2 0.1 0 0.1 1 10 100 1 000 run-time [CPU sec] ‘ed[18]’ is the CDF of an exponential distribution with median 18; the arrows mark the optimal cuto ff -time for static restarting. Heuristic Optimization 2018 54

Improvements – check on several instances Basic ILS for QAP (better, VNS perturbation, 2-exchange LS) 1 1 0.5% 0.25% 0.9 opt 0.9 opt ed[14] ed[31.5] 0.8 0.8 ed[65] ed[113.5] 0.7 0.7 0.6 0.6 P(solve) P(solve) 0.5 0.5 0.4 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0 0 0.1 1 10 100 0.1 1 10 100 run-time [CPU sec] run-time [CPU sec] 1 1 0.5% 0.25% 0.9 opt 0.9 opt ed[16] ed[7.1] 0.8 0.8 ed[73] ed[33] 0.7 0.7 0.6 0.6 P(solve) P(solve) 0.5 0.5 0.4 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0 0 0.1 1 10 100 0.1 1 10 100 run-time [CPU sec] run-time [CPU sec] Heuristic Optimization 2018 55 Performance improvements based on static restarts (2) I To determine the optimal cuto ff -time t opt for static restarts, consider the left-most exponential distribution that touches the given empirical RTD and choose t opt to be the smallest t value at which the two respective distribution curves meet. (For a formal derivation of t opt , see page 193 of SLS:FA.) I Note: This method for determining optimal cuto ff -times only works a posteriori , given an empirical RTD. I Optimal cuto ff -times for static restarting typically vary considerably between problem instances; for optimisation algorithms, they also depend on the desired solution quality. Heuristic Optimization 2018 56

Some Usages of RTDs Uses of empirical analysis through RTDs include: - PDF document

Run-length distributions: I RTDs based on run-times measured in terms of elementary operations of the given algorithm are also called run-length distributions (RLDs) . I Caution: RLDs should be based on elementary operations that either require

A PUBLIC NOTICE ANNOUNCING: A PUBLIC NOTICE ANNOUNCING: RTDS PROPOSED OVERALL DBE GOAL

The Radiotherapy Dataset (RTDS) NDRS Webinar - 18 th November 2020 Whos presenting? Catherine

Exploring API Embedding for API Usages and Applications Yi Chang Trong Duc Nguyen, Anh Tuan

PLC Multilin Remote Bridge Sensors RTDS Panel/PC Controller Controller Panel/PC Cables

Model-Based Testing: an Approach with SDL/RTDS and DIVERSITY {julien.deltour,emmanuel.gaudin}

SDL Implementations for Wireless Sensor Networks Incorporation of PragmaDevs RTDS into the

Trees At this point it is important to understand the two usages of the word tree . We introduced

Disambiguating False-Alarm Hashtag Usages in Tweets for Irony Detection Hen-Hsen Huang 1 ,

Urban water usages in Egbeda area of Oyo State, Nigeria Temitope D. T. OYEDOTUN Department of

APISan: Sanitizing API Usages through Semantic Cross-checking Insu Yun, Changwoo Min, Xujie Si,

CoAP usages for Device Management Jaime Jimnez jaime.jimenez@ericsson.com Managing Networks

Dpartement LUSSI (Logiques des Usages, Sciences Sociales et de lInformation) Yvon KERMARREC

Mining Source Code^3 Mining Idioms, Usages and Edits Dario Di Nucci Research Fellow

How smart APIs are different. @berndruecker Some Service Some Some Service Service Some

The Good Samaritan Luke 10:25-37 Here is some test text Here is some test text Here is some

The God Who Whispers 1 Kings 19 Here is some test text Here is some test text Here is some test

Comparative Effectiveness Research: Opportunities and Challenges for Transforming Health Care

Industrialization in Africa Reconsidered John Page The Brookings Institution and UNU-WIDER UNHQ

Creating Interactive IR Interfaces Mark Hall, Spyros Katsaris, Elaine Toms Sheffield University

Case Comparisons Department of Government London School of Economics and Political Science Uses

0114 marketing MARKETING POWER HOUR WEEK 4 COMPETITION! | Learn What Am I Talking About?

Competitions overview W IN N IN G A K AGGLE COMP ETITION IN P YTH ON Yauhen Babakhin Kaggle

Competitive Information Disclosure and Consumer Search Simon Board Jay Lu UCLA Cornell May,

Competition and Market Concentration in the Municipal Bond Market Dario Cestau IE Business