@dkasenberg
*Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
Generatjng Explanatjons for Temporal Logic Planner Decisions Daniel - - PowerPoint PPT Presentation
Generatjng Explanatjons for Temporal Logic Planner Decisions Daniel Kasenberg*, Ravenna Thielstrom, and Matuhias Scheutz *Daniel Kasenberg dmk@cs.tufus.edu @dkasenberg dkasenberg.github.io Our (long-term) goal Agents which can Learn
@dkasenberg
*Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
– Learn interpretable objectjves (through language and
– Behave competently with respect to these objectjves,
– Explain their behaviors to human teammates in terms of
[1] Kasenberg, D., & Scheutz, M. (2017, December). Interpretable apprentjceship learning with temporal logic specifjcatjons. In 2017 IEEE 56th Annual Conference on Decision and Control (CDC) (pp. 4914-4921). IEEE. [2] Kasenberg, D., & Scheutz, M. (2018, April). Norm confmict resolutjon in stochastjc domains. In Thirty-Second AAAI Conference on Artjfjcial Intelligence.
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
– is a set of atomic propositjons – is the set of propositjons true at
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
– Lexicographic ordering
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
– a Markov Decision Process – a set of (safe/co-safe) LTL objectjves – are the weight and priority vectors respectjvely
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
– e.g. → good prefjx any fjnite
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
–
–
–
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
– is an LTL statement – is a set of (tjmestep, literal) pairs suffjcient to show
that is unsatjsfactory for
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
[3] Kasenberg, D., Roque, A., Thielstrom, R. and Scheutz, M., 2019. Engaging in Dialogue about an Agent’s Norms and Behaviors. In Proceedings of the 1st Workshop on Interactjve Natural Language Technology for Explainable Artjfjcial Intelligence (NL4XAI 2019) (pp. 26-28). [4] Kasenberg, D., Roque, A., Thielstrom, R., Chita-Tegmark, M. and Scheutz, M., 2019. Generatjng justjfjcatjons for norm-related agent decisions. In Proceedings of the 12th Internatjonal Conference on Natural Language Generatjon (pp. 484-493).
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
– Buys the glasses, leaves the watch
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
– Need to represent multjple trajectories or probability
– Impractjcal for nontrivial domains
@dkasenberg
Daniel Kasenberg
dmk@cs.tufus.edu dkasenberg.github.io
– NSF IIS grant 43520050 – NASA grant C17-2D00-TU
– Matuhias Scheutz (advisor; co-author) – Ravenna Thielstrom (co-author) – Antonio Roque – Meia Chita-Tegmark – others