Raising AI: Tutoring Matters Jordi Bieger 1 (jbieger@gmail.com), - - PowerPoint PPT Presentation

raising ai tutoring matters
SMART_READER_LITE
LIVE PREVIEW

Raising AI: Tutoring Matters Jordi Bieger 1 (jbieger@gmail.com), - - PowerPoint PPT Presentation

Raising AI: Tutoring Matters Jordi Bieger 1 (jbieger@gmail.com), Kristinn R. Thrisson 1,2 & Deon Garrett 2 1 Reykjavik University | Center for Analysis & Design of Intelligent Agents 2 Icelandic Institute for Intelligent Machines Raising


slide-1
SLIDE 1

Raising AI: Tutoring Matters

Jordi Bieger1 (jbieger@gmail.com), Kristinn R. Thórisson1,2 & Deon Garrett2

1Reykjavik University | Center for Analysis & Design of Intelligent Agents 2Icelandic Institute for Intelligent Machines

slide-2
SLIDE 2

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Path to Adult-Level AI

Design & Implementation Learning Profit?

slide-3
SLIDE 3

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

  • Typical AI project:

– The system only learns on the final task – The system is alone

  • Raising AI:

– Helping an AI system learn, grow from baby-AI into adult-AI, and realize its potential

slide-4
SLIDE 4

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Why raising?

  • Guidance necessary to deal with complex new

situations

  • Less sophisticated system needed to reach the

same level of intelligence

  • Biologically plausible
slide-5
SLIDE 5

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Goals for the paper

  • Argue for the importance of research into raising AI
  • Discuss issues related to raising and tutoring
  • Unite research from different fields under the

perspective of raising AI

  • Provide a starting point for various techniques for

tutoring AI

slide-6
SLIDE 6

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Tutoring matters

  • Focus on tasks rather than environments or

cognitive stages

  • Tutoring methods and learning algorithms impose

requirements on each other

  • Tutoring doesn’t always help
  • Tutoring can be difficult
  • Human tutors may be expensive and/or inefficient
slide-7
SLIDE 7

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Tutoring Techniques

  • Heuristic Rewarding
  • Decomposition
  • Simplification
  • Situation Selection
  • Teleoperation
  • Demonstration
  • Coaching
  • Explanation
  • Cooperation
slide-8
SLIDE 8

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Tutoring by Demonstration

  • Show the learner what

to do

  • Add tutor observation

dimensions to state

  • Requirements:

– Generalization – Desire to imitate – Ability to map tutor actions to learner actions

  • Tabular Q-learning agent
  • Simple grid navigation task
slide-9
SLIDE 9

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Questions?

  • Heuristic Rewarding
  • Decomposition
  • Simplification
  • Situation Selection
  • Teleoperation
  • Demonstration
  • Coaching
  • Explanation
  • Cooperation
slide-10
SLIDE 10

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

http://en.ru.is

end of presentation

slide-11
SLIDE 11

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Heuristic Rewards

  • Giving the learner intermediate feedback about

performance

  • Related:

– Reward shaping – Gamification – Heuristics in e.g. minimax game playing

slide-12
SLIDE 12

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Decomposition

  • Decomposition of whole, complex tasks into

smaller components

  • Related:

– Whole-task vs. part-task training – Curriculum learning – (Catastrophic interference) – (Transfer learning) – (Multitask learning)

slide-13
SLIDE 13

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Simplification

  • Starting with a simplified version of the final task

and gradually increasing the complexity

  • Related:

– Shaping (B.F. Skinner) – Curriculum learning – Decomposition

slide-14
SLIDE 14

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Situation Selection

  • Selecting situations (or data) for the learner to

focus on

– e.g. simpler or more difficult situations

  • Related

– Boosting – ML application development – Big Data – Active learning / teaching

slide-15
SLIDE 15

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Teleoperation

  • Temporarily taking control of the learner’s actions

so they can experience them

– Right level of abstraction

  • Applications:

– Tennis / golf / chess – Robot ping pong – Artificial tutor

slide-16
SLIDE 16

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Demonstration

  • Showing the learner how to accomplish a task
  • Requirements:

– Desire to imitate – Ability to map tutor’s actions onto own actions – Generalization ability

  • Related:

– Apprenticeship learning – Inverse reinforcement learning – Imitation learning

slide-17
SLIDE 17

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Coaching

  • Giving the learner direct instructions of what action

to take during the task

  • Requirements:

– Ability to map language-based instruction onto actions – Generalization ability

  • Related:

– Supervised learning

slide-18
SLIDE 18

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Explanation

  • Explaining to the learner how to approach certain

situations before the starts (a new instance of) the task

  • Requirements:

– Language – Generalization ability

  • Related:

– Imperative programming – Analogies

slide-19
SLIDE 19

http://en.ru.is

Raising AI | Jordi Bieger (jbieger@gmail.com)

Cooperation

  • Doing a task together with the learner to facilitate
  • ther tutoring techniques