Dialogue Structure Annotation for Multi-Floor Interaction David - - PowerPoint PPT Presentation

dialogue structure annotation for
SMART_READER_LITE
LIVE PREVIEW

Dialogue Structure Annotation for Multi-Floor Interaction David - - PowerPoint PPT Presentation

Dialogue Structure Annotation for Multi-Floor Interaction David Traum, Cassidy Henry, Stephanie Lukin, Ron Artstein, Felix Gervitz, Kimberly A. Pollard, Claire Bonial, Su Lei, Clare R. Voss, Matthew Marge, Cory J. Hayes, Susan G. Hill The work


slide-1
SLIDE 1

The work depicted here was sponsored by the U.S. Army. Statements and opinions expressed do not necessarily reflect the position or the policy of the United States Government, and no official endorsement should be inferred.

David Traum, Cassidy Henry, Stephanie Lukin, Ron Artstein, Felix Gervitz, Kimberly A. Pollard, Claire Bonial, Su Lei, Clare R. Voss, Matthew Marge, Cory J. Hayes, Susan G. Hill

Dialogue Structure Annotation for Multi-Floor Interaction

slide-2
SLIDE 2

2

Outline

1. Conceptual Framework ▪ Meso-level dialogue structure ▪ Multi-floor Dialogue & multi- communicators ▪ Multi-floor dialogue structure 2. Multi-floor Dialogue Structure Annotation scheme 3. Data ▪ Doman: Human-robot collaboration ▪ 2 Wizards ▪ Example Annotations ▪ Corpus Statistics

4. Structure Patterns

5. Uses of data and Future work

slide-3
SLIDE 3

3

Types of Dialogue Structure (Traum & Nakatani 1999)

Structure Content ▪ Intentional ▪ Linguistic ▪ Relational/Rhetorical ▪ Attentional State ▪ Turn-taking/floor management ▪ Grounding ▪ Participant structure Structure Granularity ▪ Micro – within a single turn ▪ Meso – short subdialogue ▪ Macro – full conversation

slide-4
SLIDE 4

4

Meso-level Dialogue Structure Annotations

Structure Types

▪ Intentional:

Transaction Units – smallest unit of specified and performed action, including all dialogue needed to accomplish this

▪ Relational/Rhetorical :

Relations between utterances within a transaction Annotations ▪ TUs: cluster of utterances

▪ Not necessarily sequential

▪ Relations: Label 2nd part utterance with

▪ Antecedent ▪ Relation type

slide-5
SLIDE 5

5

Example: At a lunch counter ▪ Customer: I’d like a cheeseburger ▪ Waiter: one cheeseburger. ▪ Waiter: (placing burger in bag) here you go. ▪ Customer: thanks! ▪ Waiter: would you like fries with that? ▪ Customer: Sure, a large one please! ▪ Waiter: (placing fries box in bag): one large fries.

slide-6
SLIDE 6

6

Example: Transaction Units (TUs) ▪ Customer: I’d like a cheeseburger ▪ Waiter: one cheeseburger. ▪ Waiter: (placing burger in bag) here you go. ▪ Customer: thanks! ▪ Waiter: would you like fries with that? ▪ Customer: Sure, a large one please! ▪ Waiter: (placing fries box in bag): one large fries.

slide-7
SLIDE 7

7

Example: Relations

  • 1. Customer: I’d like a cheeseburger
  • 2. Waiter: one cheeseburger.
  • 3. Waiter: (placing burger in bag) here you go.
  • 4. Customer: thanks!
  • 5. Waiter: would you like fries with that?
  • 6. Customer: Sure, large please!
  • 7. Waiter: (placing fries in bag): one large fries.

Acknowledgement Acknowledgement 3rd turn feedback Answer Acknowledgement

slide-8
SLIDE 8

8

Example: TU Structures

  • 1. Customer: I’d like a cheeseburger
  • 2. Waiter: one cheeseburger.
  • 3. Waiter: (placing burger in bag) here you go.
  • 4. Customer: thanks!
  • 5. Waiter: would you like fries with that?
  • 6. Customer: Sure, large please!
  • 7. Waiter: (placing fries in bag): one large fries.

Acknowledgement Acknowledgement 3rd turn feedback Answer Acknowledgement 1 2 3 4 6 5 7

Ack Ack 3rd TF Ack Answer

slide-9
SLIDE 9

9

Floor and Participant Structure

Participants and Floors ▪ Single floor Dyadic (A,B) ▪ Single floor Multiparty: (A,B,C,…) ▪ Multiple floors (with different sets

  • f participants): {(A B), (C D E)}

Kinds of Interactions between Floors

▪ Same purpose, distinct participants ▪ Co-located, observable

▪ Participants play different roles for different floors (e.g. active participant vs overhearer)

▪ Some Shared participant(s)

▪ multi-communicating (Rentch et al)

▪ Multi-floor dialogue:

▪ Same purpose ▪ Some Multi-communicating participant(s) ▪ Content flows across floors

slide-10
SLIDE 10

10

Examples of (observable) Multi-floor dialogue Live Interpretation Indirect Action

slide-11
SLIDE 11

11

Multi-floor Relation types

▪ Expansions - relate utterances that are produced by the same participant within the same floor. ▪ Responses - relate utterances by different participants in the same floor. ▪ Translations - relate utterances in different floors Examples: 1. (A,B) A->B: I’ll have a cheeseburger 2. (A,B) A->B: and a small coke 1. (A,B) A->B: a small coke 2. (A,B) B->A: no coke, pepsi 1. (A,B) A->B: I’ll have a cheeseburger 2. (B,C) B->C: Cheeseburger!!

slide-12
SLIDE 12

12

Relations by type (1)

Expansions

a) Continue b) (self-) Correction c) Link-next d) Summarization

Translation

a) Translation <from,to> b) Partial c) Quotation d) Comment

slide-13
SLIDE 13

13

Relations by type (2) Responses

  • a. Processing: positive feedback at perception level
  • b. acknowledgement: positive feedback of

understanding

  • c. clarification: negative feedback of understanding
  • d. question-response
  • e. reciprocal response: e.g. “hello” -> “hello”

f. 3rd turn feedback: response to feedback

  • g. other
slide-14
SLIDE 14

14

Response sub-relations

acknowledgment ▪ ack-done ▪ ack-doing ▪ ack-wilco ▪ ack-understand ▪ ack-try ▪ ack-unsure ▪ ack-cant clarification ▪ req-clar ▪ clar-repair ▪ missing info ▪ nack ▪ req-repeat ▪ clar-repeat question-response ▪ answer ▪ Non-Answer-Response (NAR)

slide-15
SLIDE 15

15

Domain: Human-Robot Collaboration

15

(Marge et al., 2016, IEEE RO-MAN)

Human Commander VIEWS ROBOT (remote from Commander) VERBAL COMMANDS

Remote reconnaissance task

  • Unfamiliar environment
  • Bandwidth limitations
  • User and robot not co-present
  • What would the human users want to

say?

  • Need to collect a corpus in order to train

and evaluate the system.

  • How would users naturally

collaborate with this robot teammate?

slide-16
SLIDE 16

17

Multi-floor data collection setting

  • Robot assisted by two human

“wizards”

  • Dialogue Manager (DM) is the

language “brain” of the robot

  • Robot Navigator (RN) moves

robot based on instructions 17

Human Commander VIEWS “Behind the scenes” RN MOVES ROBOT

DM-WIZARD Robot Navigator

VERBAL COMMANDS

slide-17
SLIDE 17

18

Example Interaction

18

slide-18
SLIDE 18

19

Commander

19 Commander – Human Participant

  • Verbally Instructs a Robot
  • Sees text message responses,

LIDAR map, and images sent from

  • nboard robot
slide-19
SLIDE 19

20

Wizard #1 – Dialogue Manager

20 Dialogue Manager Wizard (DM-Wizard, DM)

  • Handles all language functions of “robot”
  • Responds to CMD and robot navigator (RN) via text message
  • Serves as mediator between RN and CMD
slide-20
SLIDE 20

21

Wizard #2 – Robot Navigator

21

move

Robot Navigator Wizard (RN-Wizard, RN)

  • Handles all navigation function of “robot”
  • Constrained language received -> joysticks robot
  • Separation of wizards:
  • reduces cognitive load/wizard labor
  • removes intuition of interpreting commands
slide-21
SLIDE 21

22

Example Interaction

22

Proceed forward

slide-22
SLIDE 22

23

Example Interaction

23

How far? You can tell me to move to an object that you see, or a distance

slide-23
SLIDE 23

24

Example Interaction

24

Proceed forward three feet

slide-24
SLIDE 24

25

Example Interaction

25

Executing…

slide-25
SLIDE 25

26

Example Interaction

26

move forward three feet

slide-26
SLIDE 26

27

Example Interaction

27

move

*moves robot forward 3 feet*

slide-27
SLIDE 27

28

Example Interaction

28

done

slide-28
SLIDE 28

29

Example Interaction

29

done

slide-29
SLIDE 29

30

Data - Transcripts ▪ Time aligned transcripts of 4 data streams

▪ 2 audio streams

▪ CMD and RN

▪ 2 text streams

▪ DM->CMD, DM->RN

▪ Two conversational floors present

Commander

(Audio Stream 1)

DM->Commander

(Chat Room 1)

DM->RN

(Chat Room 2)

RN

(Audio Stream 2)

face the doorway on your right and take a picture there’s a door ahead of me on the right and one just behind me on the

  • right. which would

you like me to face? the door ahead of you on the right move to face the door ahead of you on the right, image executing... image sent sent

30

slide-30
SLIDE 30

32

Left floor: CMD, DM

Commander

(Audio Stream 1)

DM->Commander

(Chat Room 1)

DM->RN

(Chat Room 2)

RN

(Audio Stream 2)

face the doorway on your right and take a picture there’s a door ahead of me on the right and one just behind me on the

  • right. which would

you like me to face? the door ahead of you on the right move to face the door ahead of you on the right, image executing... image sent sent

Commander Participant VIEWS “Behind the scenes” RN MOVES ROBOT

DM-WIZARD Robot Navigator

VERBAL COMMANDS

32

slide-31
SLIDE 31

33

Right Floor: DM, RN

Commander

(Audio Stream 1)

DM->Commander

(Chat Room 1)

DM->RN

(Chat Room 2)

RN

(Audio Stream 2)

face the doorway on your right and take a picture there’s a door ahead of me on the right and one just behind me on the

  • right. which would

you like me to face? the door ahead of you on the right move to face the door ahead of you on the right, image executing... image sent sent

Commander Participant VIEWS “Behind the scenes” RN MOVES ROBOT

DM-WIZARD Robot Navigator

VERBAL COMMANDS

33

slide-32
SLIDE 32

34

DM translates (to) left and right

Commander

(Audio Stream 1)

DM->Commander

(Chat Room 1)

DM->RN

(Chat Room 2)

RN

(Audio Stream 2)

face the doorway on your right and take a picture there’s a door ahead of me on the right and one just behind me on the

  • right. which would

you like me to face? the door ahead of you on the right move to face the door ahead of you on the right, image executing... image sent sent

Commander Participant VIEWS “Behind the scenes” RN MOVES ROBOT

DM-WIZARD Robot Navigator

VERBAL COMMANDS

34

slide-33
SLIDE 33

35

Corpus Statistics

Basics

▪ 60 dialogues

▪ 20 participants ▪ 3 dialogues each ▪ ~20 hours

▪ 11454 Total Utterances

▪ 3,573 from commanders ▪ 5,154 from DM ▪ 2,727 from RN

Dialogue Structure Annotations ▪ 2,230 Transaction Units ▪ 11,058 Relations ▪ 644 Unique TU Tree structures

▪ Classified into 5 types 35

slide-34
SLIDE 34

36

Frequent Relations

Type Subtype # % Translation 4282 39 Translate-r 2355 21 Translate-l 1911 17 comment 21 <1 Expansion 1583 14 Continue 1175 11 Link-next 337 3 correction 50 <1 summarize 20 <1 Type Subtype # % Response 5193 47 acknowledge 3998 36 clarification 569 5 processing 315 3 Question- response 212 2

  • ther

48 <1 3rd turn feedback 37 <1 reciprocal 14 <1

slide-35
SLIDE 35

37

Structural Types of Transactions (TUs)

▪ Minimal TU: single instruction, acks, no repair ▪ Extended-Link TU: multiple instructions, with expansions ▪ Repair TU: contains at least one repair

▪ successfully resolved or ▪ abandoned

▪ QA TU: starts with question & response rather than instruction

▪ simple question, ▪ later instruction

▪ Other TU: none of the above (e.g. no response or translation)

slide-36
SLIDE 36

38

Example minimal TU

Left Floor Right Floor Annotations Utt # Commander DM→CMD DM→RN RN TU # Antecedent Relation 1 move forward three feet 1 2

  • k

1 1 ack-wilco 3

move forward 3 feet

1 1 translation-r 4 done 1 3 ack-done 5

I moved forward 3 feet

1 4 translation-l

slide-37
SLIDE 37

39

Structural Types of Transactions (TUs)

▪ Minimal TU: single instruction, acks, no repair ▪ Extended-Link TU: multiple instructions, with expansions ▪ Repair TU: contains at least one repair

▪ successfully resolved or ▪ abandoned

▪ QA TU: starts with question & response rather than instruction

▪ simple question, ▪ later instruction

▪ Other TU: none of the above (e.g. no response or translation)

slide-38
SLIDE 38

40

Example Extended-Link TU

Left Floor Right Floor Annotations

Utt #

Commander DM→CMD DM→RN RN TU Ant Rel

1

face west 1

2

and take a photo 1 1 continue

3

face west, photo 1 2* translation-r

4

executing... 1 2* ack-doing

5

image sent 1 3 ack-done

6

sent

1 5 translation-l

slide-39
SLIDE 39

41

Structural Types of Transactions (TUs)

▪ Minimal TU: single instruction, acks, no repair ▪ Extended-Link TU: multiple instructions, with expansions ▪ Repair TU: contains at least one repair

▪ successfully resolved or ▪ abandoned

▪ QA TU: starts with question & response rather than instruction

▪ simple question, ▪ later instruction

▪ Other TU: none of the above (e.g. no response or translation)

slide-40
SLIDE 40

42

Example Repair TU

Left Floor Right Floor Annotations

Utt #

Commander DM→CMD DM→RN RN

TU

Ant Relation

1 move to where you see the first cone 1 2 I’m not sure which object you are referring to. Can you describe it in another way, using color or its location? 1 1 request- clarification 3 move to the cone on the right a red cone on the right 1 2 clarification- repair 4 move to face the cone on the right 1 3 translation-r 5 executing... 1 3 ack-doing 6 take another picture 2 7 done 1 4 ack-done 8 done 1 7 translation-l 9 image 2 6 translation-r 10 image sent 2 9 ack-done

slide-41
SLIDE 41

43

Structural Types of Transactions (TUs)

▪ Minimal TU: single instruction, acks, no repair ▪ Extended-Link TU: multiple instructions, with expansions ▪ Repair TU: contains at least one repair

▪ successfully resolved or ▪ abandoned

▪ QA TU: starts with question & response rather than instruction

▪ simple question, ▪ later instruction

▪ Other TU: none of the above (e.g. no response or translation)

slide-42
SLIDE 42

44

Example Q&A TUS

Left Floor Right Floor Annotations

Utt #

Commander DM→Commander DM→R N RN TU Ant Rel

1

how many window openings do you see in front of you 1

2

three 1 1 answer

3

do you see a yellow flashlight 2

4

processing... 2 3 processing

5

I’m not sure 2 3 answer

6

If you describe an object, you can help me to learn what it is. 2 3 non-answer response

slide-43
SLIDE 43

45

Structural Types of Transactions (TUs)

▪ Minimal TU: single instruction, acks, no repair ▪ Extended-Link TU: multiple instructions, with expansions ▪ Repair TU: contains at least one repair

▪ successfully resolved or ▪ abandoned

▪ QA TU: starts with question & response rather than instruction

▪ simple question, ▪ later instruction

▪ Other TU: none of the above (e.g. no response or translation)

slide-44
SLIDE 44

46

Examples of Other TU

Left Floor Right Floor Annotations

Utt #

Commander DM→Commander DM→RN RN TU Ant Rel

1

i'm ready

1

2

I'm also ready

1 1 Reciprocal- response

3

Would you like me to send a picture so you can see the room?"

2

4

Turn 90 degrees left 3

slide-45
SLIDE 45

47

Frequency of TU Structures (% of corpus) ▪ Minimal TU (48%) ▪ Extended-Link TU (26%) ▪ Repair TU (11%)

▪ 9% successfully resolved ▪ 2% abandoned

▪ QA TU (~5)%

▪ 4% simple question ▪ 1% lead to instruction

▪ Other TU (11%)

slide-46
SLIDE 46

48

Applications of Annotated Data ▪ Examination of Dialogue Structure Overlap (Henry et al WiNLP 2018) ▪ Stylistic differences across individuals and conditions (Lukin et al Sigdial 2018) ▪ Automating NLU and dialogue management (Gervits et al ACL 2018 Demo)

slide-47
SLIDE 47

49

Future Work ▪ More data collection – in simulation, further annotation ▪ Analysis of other levels – dialogue act type, parameter type, etc. ▪ Analysis of other multi-floor dialogue corpora

▪ Simultaneous interpretation ▪ Observability of other floors

▪ Observable (e.g. restaurant ordering) ▪ Semi-observable (e.g. interpretation to another language) ▪ Non-observable (Botlanguage)

slide-48
SLIDE 48

50

Thank You! ▪ Questions?