goal recognition in latent space Leonardo Amado, Ramon Fraga Pereira, - PowerPoint PPT Presentation

Mauricio Magnaguagno, Roger Granada and Felipe Meneguzzi July 2018 goal recognition in latent space Leonardo Amado, Ramon Fraga Pereira, João Paulo Aires , PUCRS

introduction

∙ Goal recognition is the task of inferring the intended goal of an agent by observing the actions of such agent. ∙ Current approaches of goal recognition assume that there is a domain expert capable of building complete and correct domain knowledge. 2 Introduction

∙ This is too strong for most real-world applications. ∙ To overcome these limitations, we combine goal recognition techniques from automated planning and deep autoencoders to automatic generate PDDL domains and use them to perform goal recognition 3 Introduction

background

5 Goal recognition A goal recognition problem is a tuple P GR = ⟨D , F , I , G , O ⟩ , where: ∙ D is a planning domain; ∙ F is the set of facts; ∙ I ⊆ F is an initial state; ∙ G is the set of possible goals, which include a correct hidden goal G ∗ ( G ∗ ∈ G ); ∙ and O = ⟨ o 1 , o 2 , ..., o n ⟩ is an observation sequence of executed actions, with each observation o i ∈ A , and the corresponding action being part of a valid plan π that sequentially transforms I into G ∗ .

∙ Using autoencoders it is possible to encode an image to a binary representation (equiv. to logic fluents) ∙ To perform the encoding of complex images , a complex autoencoder can ∙ The encoded representation is called latent space . Source: https://towardsdatascience.com/applied-deep-learning-part-3-autoencoders-1c083af4d798 6 Autoencoders be used, using the Gumbel Softmax .

planning in latent space

∙ Taking advantage of such autoencoders, LatPlan [Asai and Fukunaga, 2017] generates plans using only images of the initial and goal states. ∙ The initial state image and goal images are encoded in a binary representation. ∙ LatPlan uses traditional planning algorithms to plan using only the latent-space ∙ LatPlan shows that many classical heuristics remain valid and effective even in latent space 8 LatPlan

9 LatPlan Figure: Latplanner.

goal recognition in latent space

∙ We propose an approach capable of recognizing goals in image based domains. ∙ We use the same tuple as planning goal recognition, but our states are now images. 11 Goal recognition in raw data

12 Goal Recognition in latent space Figure: Goal Recognizer.

To recognize goals in image based domains, there are 4 milestones we must achieve. 1. First, we must train an autoencoder capable of creating a latent representation to a state of such image domain. 2. Second, we derive a PDDL domain, by extracting the transitions of such 3. Third, we must convert to a latent representation a set of images 4. Finally, we can apply goal recognition techniques using the computed 13 Goal Recognition in latent space domain when encoded in latent space, obtaining a domain D . representing, the initial state I , the set of facts F and a set of possible goals G , where the hidden goal G ∗ is included. tuple ⟨D , F , I , G , O ⟩

∙ Use a dataset with 20000 states to train the autoencoder. ∙ Use a dataset with all the state transitions to extract a PDDL. ∙ Convert the GR problem to latent space using the autoencoder. ∙ With the domain PDDL and the encoded PR problem, recognize a plan in latent space. 14 Goal recognition in latent space Figure: IGR complete schematics.

15 We use the autoencoder with the following structure, using 36 bits for the latent representation: Goal Recognition in latent space Latent Representation Fully Gaussian Connected Noise(0.4) (72) Fully Fully Connected Connected Convolution Convolution (1000) (1000) 2D 3x3 2D 3x3 Figure: Autoencoder structure.

To derive a domain PDDL from raw data, we use the following method. 1. We encode every single transition using the autoencoder. 2. We then group up transitions that have the same effect. 3. We then derive a precondition by comparing which bits do not change between each transition of each group of effects. 4. Having both a precondition and an effect, we derive a PDDL action. 16 Goal Recognition in latent space

experiments

18 To test our approach, we use 6 domains from 3 distinct games. Domains (f) Hanoi (d) LO Digital (e) LO Twist (a) MNIST (b) Mandrill (c) Spider Figure: Sample state for each domain.

19 1048576 192 3.974 LO Digital 1048576 1048576 100.0% 5940 1392 4.267 LO Twisted 1048576 100.0% 100.0% 12669 1392 9.101 Hanoi 237 237 100.0% 211 38 5.552 763 967680 First, we analyze the quality of the PDDL domain and the accuracy of the 25.76 autoencoder. MNIST 967680 963795 967680 4946 192 99.6% Mandrill 967680 967680 100.0% 495 192 2.578 Spider Autoencoder results Table: PDDL generation performance for each domain. Domain Total Transitions Encoded Transitions SAE Accuracy % Computed Actions Ground Actions PDDL Redundancy

Second, we show the results obtained by goal recognition techniques using hand-made PDDL domains. ∙ We consider different levels of observability: 10, 30, 50, 70, and 100% ∙ We evaluate Time, Accuracy, and Spread over the three games ∙ We use three different standard Goal Recognizers 20 Standard Goal Recognition results

21 0.092 / 0.100 8-Puzzle 6.0 50 4.0 0.088 / 0.091 100.0% / 100.0% 1.1 / 1.6 0.191 100.0% 1.3 70 5.3 100.0% / 100.0% 100.0% 1.0 / 1.0 0.210 100.0% 1.0 100 7.3 0.108 / 0.110 100.0% / 100.0% 1.0 / 1.0 0.246 83.3% 1.1 1.3 0.188 RG 83.3% / 83.3% 0.079 / 0.085 3.0 30 4.8 100.0% 0.179 2.6 / 2.6 33.3% / 33.3% 0.074 / 0.080 1.0 10 POM ( h uniq ) 1.0 / 2.5 Standard Goal Recognition Results Sample of the obtained results Time (s) Accuracy % Spread in G Domain (%) Obs Time (s) Accuracy % Spread in G |G| | O | θ (0 / 10) θ (0 / 10) θ (0 / 10)

22 3.0 MNIST 4.8 22.26 1.4 / 3.0 20.0% / 80.0% 0.587 / 0.599 30 50 6.0 21.25 0.555 / 0.562 1.2 10 83.3% 7.3 6.0 4.0 70 23.53 3.4 26.34 2.4 / 3.0 0.676 / 0.681 7.8 100 3.2 2.4 / 3.6 0.609 / 0.628 0.631 / 0.654 5.8 70 4.8 22.48 2.2 / 2.8 60.0% / 80.0% 5.3 100 3.0 4.0 POM ( h uniq ) RG 10 1.0 33.3% / 33.3% 30 8-Puzzle 6.0 50 Goal recognition in latent space Comparing hand-made and automatic generated PDDL domains. Accuracy % Time (s) Spread in G Domain (%) Obs Time (s) Accuracy % Spread in G |G| | O | θ (0 / 10) θ (0 / 10) θ (0 / 10) 0.074 / 0.080 2.6 / 2.6 0.179 100.0� 4.8 0.079 / 0.085 83.3� / 83.3� 1.0 / 2.5 0.188 100.0� 1.3 0.088 / 0.091 100.0� / 100.0� 1.1 / 1.6 0.191 100.0� 1.3 0.092 / 0.100 100.0� / 100.0� 1.0 / 1.0 0.210 100.0� 1.0 0.108 / 0.110 100.0� / 100.0� 1.0 / 1.0 0.246 1.1 40.0� / 60.0� 1.6 / 3.2 100.0� 100.0� 100.0� 60.0% / 100.0� 100.0� 80.0% / 100.0� 100.0�

conclusion and future work

∙ We developed an approach for goal recognition capable of obviating the need for human engineering to create a task for goal recognition. ∙ Empirical results shows that our approach comes close to standard goal recognition techniques. ∙ Regardless, our approach allows breakthroughs in goal recognition techniques. ∙ Our current approach has two main limitations: ∙ we need all possible transitions of the domain; ∙ we currently use relatively small images as input. 24 Conclusion

∙ For future work we aim to improve pruning of redundant actions in the domain inference process. ∙ Furthermore, we would like to develop plan recognition algorithms for incomplete domain models. ∙ Finally, we aim to develop an approach that applies goal recognition over video streams. 25 Future work

leonardo.amado@acad.pucrs.br joao.aires.001@acad.pucrs.br 26 Goal Recognition in Latent Space Thank you!

goal recognition in latent space Leonardo Amado, Ramon Fraga Pereira, - PowerPoint PPT Presentation

Mauricio Magnaguagno, Roger Granada and Felipe Meneguzzi July 2018 goal recognition in latent space Leonardo Amado, Ramon Fraga Pereira, Joo Paulo Aires , PUCRS introduction Goal recognition is the task of inferring the intended goal of

Empirical Analysis of Latent Space Embedding David Mount and Eunhui Park Department of Computer

1 Latent variable models In the next section we will discuss latent variable models for

Part III: Latent Tree Models Le Song ICML 2012 Tutorial on Spectral Algorithms for Latent

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

8-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches

Action recognition in videos Action recognition in videos Cordelia Schmid Cordelia Schmid

Action recognition in videos II Action recognition in videos II Cordelia Schmid INRIA Grenoble

Action recognition in videos Cordelia Schmid Action recognition - goal Short actions, i.e.

Action recognition in videos Cordelia Schmid Action recognition - goal Short actions, i.e.

DeepMDP Learning Latent Space Continuous Models for Representation Learning Carles Gelada,

EMPLOYEE RECOGNITION OBJECTIVES Types of recognition Creating a culture of recognition

License Plate Recognition License Plate Recognition License Plate Recognition License Plate

Instance-level Recognition Pingmei Xu Object Recognition Friends SE01EP02 Recognition: Find the

Face detection and recognition Detection Recognition Sally Face detection &

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model CS330

C unobserved construct (e.g. Disordered v. Non- Disordered) Latent classes are mutually

Multi-Site Vs. Domain A Commerce Case Study May 7, 2019 Page 1 | Multi-Site Vs Domain: A

Bayesian inference and mathematical imaging. Part III: probability & convex optimisation. Dr.

M. Kardouchi & E. Hervet Universit de Moncton Moncton, NB, Canada 30-mai-03 1

Cluster Architectures Overview Cluster Computing The Problem The Solution The Anatomy

RealCertify: a Maple package for certifying non-negativity Victor Magron , CNRS Joint work with

But the relation sc hemas m ust b e the same. 2. : Pic king certain ro ws

POLY : A new polynomial data structure for Maple. Michael Monagan Center for Experimental and

Exam 2 Review Objects, Arrays, Strings Objects Defined by template given in as class

goal recognition in latent space Leonardo Amado, Ramon Fraga Pereira, - PowerPoint PPT Presentation

Mauricio Magnaguagno, Roger Granada and Felipe Meneguzzi July 2018 goal recognition in latent space Leonardo Amado, Ramon Fraga Pereira, Joo Paulo Aires , PUCRS introduction Goal recognition is the task of inferring the intended goal of

Empirical Analysis of Latent Space Embedding David Mount and Eunhui Park Department of Computer

1 Latent variable models In the next section we will discuss latent variable models for

Part III: Latent Tree Models Le Song ICML 2012 Tutorial on Spectral Algorithms for Latent

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

8-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches

Action recognition in videos Action recognition in videos Cordelia Schmid Cordelia Schmid

Action recognition in videos II Action recognition in videos II Cordelia Schmid INRIA Grenoble

Action recognition in videos Cordelia Schmid Action recognition - goal Short actions, i.e.

Action recognition in videos Cordelia Schmid Action recognition - goal Short actions, i.e.

DeepMDP Learning Latent Space Continuous Models for Representation Learning Carles Gelada,

EMPLOYEE RECOGNITION OBJECTIVES Types of recognition Creating a culture of recognition

License Plate Recognition License Plate Recognition License Plate Recognition License Plate

Instance-level Recognition Pingmei Xu Object Recognition Friends SE01EP02 Recognition: Find the

Face detection and recognition Detection Recognition Sally Face detection &amp;

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model CS330

C unobserved construct (e.g. Disordered v. Non- Disordered) Latent classes are mutually

Multi-Site Vs. Domain A Commerce Case Study May 7, 2019 Page 1 | Multi-Site Vs Domain: A

Bayesian inference and mathematical imaging. Part III: probability &amp; convex optimisation. Dr.

M. Kardouchi &amp; E. Hervet Universit de Moncton Moncton, NB, Canada 30-mai-03 1

Cluster Architectures Overview Cluster Computing The Problem The Solution The Anatomy

RealCertify: a Maple package for certifying non-negativity Victor Magron , CNRS Joint work with

But the relation sc hemas m ust b e the same. 2. : Pic king certain ro ws

POLY : A new polynomial data structure for Maple. Michael Monagan Center for Experimental and

Exam 2 Review Objects, Arrays, Strings Objects Defined by template given in as class

Face detection and recognition Detection Recognition Sally Face detection &

Bayesian inference and mathematical imaging. Part III: probability & convex optimisation. Dr.

M. Kardouchi & E. Hervet Universit de Moncton Moncton, NB, Canada 30-mai-03 1