Steve Kroon PLEASED: Planning, Learning, and Search for - - PowerPoint PPT Presentation

steve kroon
SMART_READER_LITE
LIVE PREVIEW

Steve Kroon PLEASED: Planning, Learning, and Search for - - PowerPoint PPT Presentation

Steve Kroon PLEASED: Planning, Learning, and Search for Decision-making. http://www.cs.sun.ac.za/~kroon/decision.html Maties Machine Learning: 25 October 2019 This group considers almost any aspect of the general decision-making problem,


slide-1
SLIDE 1

Steve Kroon

PLEASED: Planning, Learning, and Search for Decision-making. http://www.cs.sun.ac.za/~kroon/decision.html

Maties Machine Learning: 25 October 2019

slide-2
SLIDE 2

“This group considers almost any aspect of the general decision-making problem, including sequential decision-making under uncertainty. Major sub-problems we consider are planning, machine learning, and search algorithms. Our approach is grounded in probability theory and game theory for managing uncertainty and multi-agent systems.”

Images: https://mimiandeunice.com/wp-content/uploads/2011/08/ME_447_Decisions-640x199.png

slide-3
SLIDE 3

Single agent, single decision

Principled - grounded in:

  • Probability theory
  • Decision theory
  • Game theory

Typically requires:

  • A model (perhaps >1)
  • Data
  • Payoff function

Good foundation: Bayesian decision theory

Images:https://www.azimuthproject.org/azimuth/files/BayesianSDT-bigpic.png

slide-4
SLIDE 4

Extensions

  • Sequential decision making

○ Search ○ Planning ○ Bayes filter ○ Reinforcement learning

  • Multi-agent settings

○ Adversarial ○ Collaborative

  • Tractable inference/decision making

○ Inference approaches ○ Search techniques ○ Choice of approximations

Images:https://www.azimuthproject.org/azimuth/files/BayesianSDT-bigpic.png

slide-5
SLIDE 5

Group members

slide-6
SLIDE 6

Learning theory*

*understanding relationships between and properties of machine learning/statistical models and approaches to fitting them

slide-7
SLIDE 7

Search/Planning

slide-8
SLIDE 8

Bayesian analysis

Images: https://miro.medium.com/max/1002/1*hblsrFOWViHS43l5YpUXeQ.png

slide-9
SLIDE 9

Latent variable models/ variational inference

Images:https://miro.medium.com/max/800/1*pZo_IcxW1GVuH2vQKdoIMQ.jpeg

slide-10
SLIDE 10

Process monitoring and fault detection/diagnosis

Images:https://www.ericsson.com/49d220/assets/global/qbank/2019/06/13/architecture-50-109173resize436234crop00436234autoorientquality90stripbackground23ffffffextensionjpgid8.jpg

slide-11
SLIDE 11

Specific interests

Learning Theory* (mostly NNs) Search/Planning (mostly MCTS) Bayesian analysis Latent variable models and variational inference Process monitoring, fault detection and diagnosis

*understanding relationships between and properties of machine learning/statistical models and approaches to fitting them

Common elements:

  • Regularizing effects of model,

inference, and optimization

  • Tractable inference/search
  • Dynamical systems
slide-12
SLIDE 12

THANK YOU - QUESTIONS?