Constructive Interaction for Talking about Interesting Topics - - PowerPoint PPT Presentation

constructive interaction for talking about interesting
SMART_READER_LITE
LIVE PREVIEW

Constructive Interaction for Talking about Interesting Topics - - PowerPoint PPT Presentation

Constructive Interaction for Talking about Interesting Topics Kristiina Jokinen & Graham Wilcock University of Tartu & University of Helsinki Outline Previous Work Emergent verbal behaviour Current Work Open-domain


slide-1
SLIDE 1

Constructive Interaction for Talking about Interesting Topics

Kristiina Jokinen & Graham Wilcock

University of Tartu & University of Helsinki

slide-2
SLIDE 2

Outline

  • Previous Work
  • Emergent verbal behaviour
  • Current Work
  • Open-domain conversations
  • Future Work
  • Speech, gaze and gesture

Jokinen & Wilcock LREC 2012, Istanbul 2

slide-3
SLIDE 3

Previous Work

  • Emergent Verbal Behaviour in

Human-Robot Interaction

  • CogInfoCom 2011, Budapest
  • Adding Speech to a Robotics

Simulator (demo)

  • IWSDS 2011, Granada

Jokinen & Wilcock LREC 2012, Istanbul 3

slide-4
SLIDE 4

Pyro: Python Robotics

Open source Python robotics toolkit

http://pyrobotics.org

For teaching and research

Simulators and real robots

Artificial intelligence and robotics

Reinforcement learning, fuzzy decisions,

neural networks, genetic algorithms, ...

4 LREC 2012, Istanbul Jokinen & Wilcock

slide-5
SLIDE 5

5 LREC 2012, Istanbul Jokinen & Wilcock

slide-6
SLIDE 6

Pyspeech: Python Speech

Open source Python speech interface

http://code.google.com/p/pyspeech

Speech input and output

Speech recognition functions Text-to-speech functions

For Windows computers

Uses Microsoft Speech Engine

6 LREC 2012, Istanbul Jokinen & Wilcock

slide-7
SLIDE 7

Non-verbal behaviour

  • Autonomous behaviour
  • Wander randomly
  • Avoid obstacles
  • Follow a wall
  • etc.
  • Robot acts silently

Jokinen & Wilcock LREC 2012, Istanbul 7

slide-8
SLIDE 8

Verbal behaviour

  • Robot explains its
  • wn actions
  • ”object on right”
  • (therefore) ”turn left”
  • Monologue
  • One-way info
  • Can be irritating
  • ”clear, clear, clear...”

Jokinen & Wilcock LREC 2012, Istanbul 8

slide-9
SLIDE 9

Cooperative Verbal Behaviour

Autonomous verbal behaviour

Robot explains its actions

Interactive verbal behaviour

Human requests ”go back”, ”left”

Cooperative verbal behaviour

Human requests ”talk less!” Robot changes its verbosity level No repeating, only says new things

9 LREC 2012, Istanbul Jokinen & Wilcock

slide-10
SLIDE 10

Outline

  • Previous Work
  • Emergent verbal behaviour
  • Current Work
  • Open-domain conversations
  • Future Work
  • Speech, gaze and gesture

Jokinen & Wilcock LREC 2012, Istanbul 10

slide-11
SLIDE 11

Fixed-domain Dialogues

Example: Flight reservations

Using finite state transitions Easy to implement in Pyro simulator

Fixed-domain database

Flights, cities, days Easy to add new flights, new cities

Difficult to switch domains

Jokinen & Wilcock LREC 2012, Istanbul 11

slide-12
SLIDE 12

Open-domain Dialogues

Example: Talking about Shakespeare

Using information from the web Wikipedia articles on any topic

If robot has identified the topic:

Get Wikipedia article about Shakespeare Start reading out the first paragraph

How to continue the conversation?

Jokinen & Wilcock LREC 2012, Istanbul 12

slide-13
SLIDE 13

13 LREC 2012, Istanbul Jokinen & Wilcock

slide-14
SLIDE 14

14 LREC 2012, Istanbul Jokinen & Wilcock

slide-15
SLIDE 15

How to Continue?

Split text into chunks

Speak a chunk, pause for feedback Is the human interested or not?

Feedback may be non-verbal

Gaze, facial expression, body language

If human seems interested:

Continue same topic, next chunk

15 LREC 2012, Istanbul Jokinen & Wilcock

slide-16
SLIDE 16

16 LREC 2012, Istanbul Jokinen & Wilcock

slide-17
SLIDE 17

How to Switch Topics?

Dialogue, not monologue

Don’t read out entire article Would be irritating (need ”talk less!”)

Hypertext links

Links identify possible topic shifts Topic navigation by clicking on links Clicking shows human’s positive interest

17 LREC 2012, Istanbul Jokinen & Wilcock

slide-18
SLIDE 18

Topic and NewInfo

Jokinen PhD thesis (1994)

Response Generation in Information-seeking Dialogues

Jokinen & Wilcock (2003)

Adaptivity and Response Generation in a Spoken Dialogue System

18 LREC 2012, Istanbul Jokinen & Wilcock

slide-19
SLIDE 19

Topic and NewInfo

Topic

Need to know the current topic Need to keep track of topic shifts

NewInfo

Gives some new information about Topic Dialogue response is based on NewInfo

19 LREC 2012, Istanbul Jokinen & Wilcock

slide-20
SLIDE 20

Topic Chains

Hyperlinks -> NewInfos -> topic shifts

Robot: ”Shakespeare was born and raised in

Stratford-upon-Avon” (NewInfo)

Human: ”Stratford-upon-Avon?” (topic shift) Robot: ”Stratford-upon-Avon is a market town

and civil parish in Warwickshire, England”

Human: ”Warwickshire?” (topic shift)

20 LREC 2012, Istanbul Jokinen & Wilcock

slide-21
SLIDE 21

21 LREC 2012, Istanbul Jokinen & Wilcock

slide-22
SLIDE 22

When to Switch Topic

Echoing a NewInfo

Echoing shows human’s interest ”Shakespeare’s sexuality?”

Immediate topic shift

Download new article ”The sexuality of William Shakespeare” Start speaking first chunk

22 LREC 2012, Istanbul Jokinen & Wilcock

slide-23
SLIDE 23

23 LREC 2012, Istanbul Jokinen & Wilcock

slide-24
SLIDE 24

When to Continue

Feedback may be non-verbal

Eye gaze (looking at robot) Facial expression Body language (coming closer)

If human seems interested:

Continue same topic, next chunk

24 LREC 2012, Istanbul Jokinen & Wilcock

slide-25
SLIDE 25

25 LREC 2012, Istanbul Jokinen & Wilcock

slide-26
SLIDE 26

Switching Topics

Echoing

May show human didn’t hear clearly ”Hamnet?” (... or was it ”Hamlet”?)

Topic shift anyway

Download new article ”Hamnet Shakespeare” Start speaking first chunk

26 LREC 2012, Istanbul Jokinen & Wilcock

slide-27
SLIDE 27

27 LREC 2012, Istanbul Jokinen & Wilcock

slide-28
SLIDE 28

28 LREC 2012, Istanbul Jokinen & Wilcock

slide-29
SLIDE 29

How to Find Initial Topic?

Quite Difficult!

Open vocabulary speech recognition? Basically, this doesn’t work

Spelling the Topic?

”S”, ”H”, ”A”, ”K”, ... ”Sierra”, ”Hotel”, ”Alpha”, ”Kilo”, ... Wikipedia helps if first few letters known

29 LREC 2012, Istanbul Jokinen & Wilcock

slide-30
SLIDE 30

30 LREC 2012, Istanbul Jokinen & Wilcock

slide-31
SLIDE 31

Outline

  • Previous Work
  • Emergent verbal behaviour
  • Current Work
  • Open-domain spoken dialogues
  • Future Work
  • Speech, gaze and gesture

Jokinen & Wilcock LREC 2012, Istanbul 31

slide-32
SLIDE 32

Future Work

  • eNTERFACE 2012
  • Speech, Gaze and Gesturing –

Multimodal Conversational Interaction with Nao Robot

  • International Summer Workshop on

Multimodal Interaction, Metz, July 2012

Jokinen & Wilcock LREC 2012, Istanbul 32

slide-33
SLIDE 33

Thank you!

Questions? Nao says: Ask the

dumb human today, but in future you’ll get better answers from clever me.

Jokinen & Wilcock LREC 2012, Istanbul 33