Automatic Configuration of Benchmark Sets for Classical Planning - PowerPoint PPT Presentation

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion Automatic Configuration of Benchmark Sets for Classical Planning Alvaro Torralba, 1 Jendrik Seipp, 2 Silvan Sievers 2 ´ 1 Aalborg University, Denmark 2 University of Basel, Switzerland October 21, 2020 Automatic Configuration of Benchmark Sets for Classical Planning 1/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion Outline The ICAPS Way 1 Benchmark Design Principles 2 Benchmark Configuration 3 Evaluation 4 Conclusion 5 Automatic Configuration of Benchmark Sets for Classical Planning 2/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion The Cycle of Life (in Planning Research) Everything you Always Wanted to Know About Planning (But Were Afraid to Ask) — (J¨ org Hoffmann, 2011) Automatic Configuration of Benchmark Sets for Classical Planning 3/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion Empirical Evaluation – Examples from HSDIP’20 Automatic Configuration of Benchmark Sets for Classical Planning 4/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion Empirical Evaluation – The ICAPS/IPC Way The ICAPS/IPC Way Measure coverage Time limit 30 minutes Memory limit 2-8 GB Use the benchmarks from the International Planning Competition Automatic Configuration of Benchmark Sets for Classical Planning 5/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion Empirical Evaluation – The ICAPS/IPC Way The ICAPS/IPC Way Measure coverage Time limit 30 minutes Memory limit 2-8 GB Use the benchmarks from the International Planning Competition Having a standard evaluation setting is generally beneficial: Reproducibility Interpretability Avoids hand picking results Automatic Configuration of Benchmark Sets for Classical Planning 5/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion The diversity in the IPC Benchmark Set Automatic Configuration of Benchmark Sets for Classical Planning 7/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion So, What’s Wrong with the IPC Benchmark Set? IPC L D O Nomystery (20) 11 20 12 Rovers (40) 40 40 40 Woodworking (50) 50 50 50 Total 101 110 102 Table: Coverage of LAMA (L), Decstar (D) and OLCFF (O) Automatic Configuration of Benchmark Sets for Classical Planning 8/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion So, What’s Wrong with the IPC Benchmark Set? IPC L D O Nomystery (20) 11 20 12 Rovers (40) 40 40 40 Woodworking (50) 50 50 50 Total 101 110 102 Table: Coverage of LAMA (L), Decstar (D) and OLCFF (O) Different number of instances per domain Instance scaling: too easy, too hard, and not smooth Automatic Configuration of Benchmark Sets for Classical Planning 8/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion So, What’s Wrong with the IPC Benchmark Set? IPC New’14 L D O L D O Nomystery (20) 11 20 12 25 30 24 Rovers (40) 40 40 40 22 18 21 Woodworking (50) 50 50 50 18 27 30 Total 101 110 102 65 75 75 Table: Coverage of LAMA (L), Decstar (D) and OLCFF (O) Different number of instances per domain Instance scaling: too easy, too hard, and not smooth → Experiments on some domains of the IPC benchmark set may not observe any difference between planners even if it exists! Automatic Configuration of Benchmark Sets for Classical Planning 8/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion Non-Smooth Scaling uns. 10 3 Time (s) 10 2 Complementary 2, IPC 10 1 Delfi-blind, IPC Automatic Configuration of Benchmark Sets for Classical Planning 9/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion Smooth Scaling uns. 10 3 10 2 Time (s) 10 1 10 0 10 − 1 Complementary 2, New’14 10 − 2 Delfi-blind, New’14 Automatic Configuration of Benchmark Sets for Classical Planning 10/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion Contribution An automatic tool to select instances from a given domain (more informative than the IPC set to compare current and future planners) Automatic Configuration of Benchmark Sets for Classical Planning 11/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion Contribution An automatic tool to select instances from a given domain (more informative than the IPC set to compare current and future planners) Smooth scaling from easy to hard instances: 1 Automatic Configuration of Benchmark Sets for Classical Planning 11/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion Contribution An automatic tool to select instances from a given domain (more informative than the IPC set to compare current and future planners) Smooth scaling from easy to hard instances: 1 Easy: solvable by any planner that anyone would compare against (baseline) Hard: out of reach of current existing planners within a reasonable time limit Automatic Configuration of Benchmark Sets for Classical Planning 11/25

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion Contribution An automatic tool to select instances from a given domain (more informative than the IPC set to compare current and future planners) Smooth scaling from easy to hard instances: 1 Easy: solvable by any planner that anyone would compare against (baseline) Hard: out of reach of current existing planners within a reasonable time limit Minimize bias towards/against planners used 2 Automatic Configuration of Benchmark Sets for Classical Planning 11/25

Automatic Configuration of Benchmark Sets for Classical Planning - PowerPoint PPT Presentation

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion Automatic Configuration of Benchmark Sets for Classical Planning Alvaro Torralba, 1 Jendrik Seipp, 2 Silvan Sievers 2 1 Aalborg University, Denmark 2

Configuration management Configuration management Configuration management Configuration

Augeas a configuration API Raphal Pinson Configuration Management Sitewide configuration

CNC PINpad USA, December 2014 Configuration Configuration Description POS Dollar General

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

A Framework for Automatic Generation A Framework for Automatic Generation of Configuration Files

Automatic Algorithm Configuration Thomas St utzle IRIDIA, CoDE, Universit e Libre de

MATH 105: Finite Mathematics 6-1: Sets Prof. Jonathan Duncan Walla Walla College Winter

EPiServer och Configuration Management EPiServer och Configuration Management Configuration

Medicaid Benchmark Options Analysis Stakeholder Advisory Committee July 23, 2012 Overview

The HPC Challenge Benchmark: The HPC Challenge Benchmark: A Candidate for Replacing A Candidate

Automatic Algorithm Configuration Thomas St utzle IRIDIA, CoDE, Universit e Libre de

Languages and Regular expressions Lecture 2 1 Strings, Sets of Strings, Sets of Sets of

Sets Sets A Set is an abstract data type representing an unordered Sets are unordered and

Classical Conditioning MacFarlane (1978) Perceptual Development: Methods Classical Conditioning

Decline of classical economics and the rise of neoclassical economics From 1870s on, classical

Non-Classical Logics Viorica Sofronie-Stokkermans E-mail: sofronie@uni-koblenz.de Winter

Smart Cities: myths and realities Elisabet Viladecans-Marsal

Phone:480- 541-2515 Communication emails or newsletters from me Assignment Book/Planner

CS 10: Problem solving via Object Oriented Programming Pattern Matching 2 Agenda 1. Pattern

The UX Lifecycle Selected material from The UX Book , Hartson & Pyla The UX Life Cycle

Multivariate Solutions to Emerging Passive DNS Challenges Dr. Paul Vixie, CEO and Dr. Joe St

The Evolution of Nuclear Security: From Sites to Summitry 23rd WiN Global Annual Conference:

Summarization: Overview Ling573 Systems & Applications April 2, 2015 Roadmap

Project-team OPALE INRIA Sophia-Antipolis Mditerrane and Rhne-Alpes Scientific Themes

Sambuz

Useful Links

Newsletter

Mail Us

Automatic Configuration of Benchmark Sets for Classical Planning - PowerPoint PPT Presentation

The ICAPS Way Benchmark Design Principles Benchmark Configuration Evaluation Conclusion Automatic Configuration of Benchmark Sets for Classical Planning Alvaro Torralba, 1 Jendrik Seipp, 2 Silvan Sievers 2 1 Aalborg University, Denmark 2

Configuration management Configuration management Configuration management Configuration

Augeas a configuration API Raphal Pinson Configuration Management Sitewide configuration

CNC PINpad USA, December 2014 Configuration Configuration Description POS Dollar General

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

A Framework for Automatic Generation A Framework for Automatic Generation of Configuration Files

Automatic Algorithm Configuration Thomas St utzle IRIDIA, CoDE, Universit e Libre de

MATH 105: Finite Mathematics 6-1: Sets Prof. Jonathan Duncan Walla Walla College Winter

EPiServer och Configuration Management EPiServer och Configuration Management Configuration

Medicaid Benchmark Options Analysis Stakeholder Advisory Committee July 23, 2012 Overview

The HPC Challenge Benchmark: The HPC Challenge Benchmark: A Candidate for Replacing A Candidate

Automatic Algorithm Configuration Thomas St utzle IRIDIA, CoDE, Universit e Libre de

Languages and Regular expressions Lecture 2 1 Strings, Sets of Strings, Sets of Sets of

Sets Sets A Set is an abstract data type representing an unordered Sets are unordered and

Classical Conditioning MacFarlane (1978) Perceptual Development: Methods Classical Conditioning

Decline of classical economics and the rise of neoclassical economics From 1870s on, classical

Non-Classical Logics Viorica Sofronie-Stokkermans E-mail: sofronie@uni-koblenz.de Winter

Smart Cities: myths and realities Elisabet Viladecans-Marsal

Phone:480- 541-2515 Communication emails or newsletters from me Assignment Book/Planner

CS 10: Problem solving via Object Oriented Programming Pattern Matching 2 Agenda 1. Pattern

The UX Lifecycle Selected material from The UX Book , Hartson &amp; Pyla The UX Life Cycle

Multivariate Solutions to Emerging Passive DNS Challenges Dr. Paul Vixie, CEO and Dr. Joe St

The Evolution of Nuclear Security: From Sites to Summitry 23rd WiN Global Annual Conference:

Summarization: Overview Ling573 Systems &amp; Applications April 2, 2015 Roadmap

Project-team OPALE INRIA Sophia-Antipolis Mditerrane and Rhne-Alpes Scientific Themes

Sambuz

Useful Links

Newsletter

Mail Us

The UX Lifecycle Selected material from The UX Book , Hartson & Pyla The UX Life Cycle

Summarization: Overview Ling573 Systems & Applications April 2, 2015 Roadmap