Modeling Vocal Interaction for Text-Independent Participant - - PowerPoint PPT Presentation

modeling vocal interaction for text independent
SMART_READER_LITE
LIVE PREVIEW

Modeling Vocal Interaction for Text-Independent Participant - - PowerPoint PPT Presentation

Introduction Framework Experiments Conclusions Modeling Vocal Interaction for Text-Independent Participant Characterization in Multi-Party Conversation Kornel Laskowski 1 , 2 , Mari Ostendorf 3 & Tanja Schultz 1 , 2 1 Cognitive Systems


slide-1
SLIDE 1

Introduction Framework Experiments Conclusions

Modeling Vocal Interaction for Text-Independent Participant Characterization in Multi-Party Conversation

Kornel Laskowski1,2, Mari Ostendorf3 & Tanja Schultz1,2

1Cognitive Systems Labs, Universit¨

at Karlsruhe

2Language Technologies Institute, Carnegie Mellon University

  • 3Dept. Electrical Engineering, University of Washington

June 20, 2008

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-2
SLIDE 2

Introduction Framework Experiments Conclusions

Vocal Interaction (Dabbs & Ruback, 1987)

vocal activity patterns for all K participants, seen together talkspurt start/end times = text-independence at time t,

vocal activity of participant k: qt [k] ∈

V ≡ {, } ≡ {0, 1}

entire K-participant conversation: qt ∈

VK

we’ll use a discretized version (frame step = 200 ms)

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-3
SLIDE 3

Introduction Framework Experiments Conclusions

Vocal Interaction (Dabbs & Ruback, 1987)

vocal activity patterns for all K participants, seen together talkspurt start/end times = text-independence at time t,

vocal activity of participant k: qt [k] ∈

V ≡ {, } ≡ {0, 1}

entire K-participant conversation: qt ∈

VK

we’ll use a discretized version (frame step = 200 ms)

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-4
SLIDE 4

Introduction Framework Experiments Conclusions

Vocal Interaction (Dabbs & Ruback, 1987)

vocal activity patterns for all K participants, seen together talkspurt start/end times = text-independence at time t,

vocal activity of participant k: qt [k] ∈

V ≡ {, } ≡ {0, 1}

entire K-participant conversation: qt ∈

VK

we’ll use a discretized version (frame step = 200 ms)

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-5
SLIDE 5

Introduction Framework Experiments Conclusions

Vocal Interaction (Dabbs & Ruback, 1987)

vocal activity patterns for all K participants, seen together talkspurt start/end times = text-independence at time t,

vocal activity of participant k: qt [k] ∈

V ≡ {, } ≡ {0, 1}

entire K-participant conversation: qt ∈

VK

we’ll use a discretized version (frame step = 200 ms)

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-6
SLIDE 6

Introduction Framework Experiments Conclusions

Vocal Interaction (Dabbs & Ruback, 1987)

vocal activity patterns for all K participants, seen together talkspurt start/end times = text-independence at time t,

vocal activity of participant k: qt [k] ∈

V ≡ {, } ≡ {0, 1}

entire K-participant conversation: qt ∈

VK

we’ll use a discretized version (frame step = 200 ms)

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-7
SLIDE 7

Introduction Framework Experiments Conclusions

Vocal Interaction (Dabbs & Ruback, 1987)

vocal activity patterns for all K participants, seen together talkspurt start/end times = text-independence at time t,

vocal activity of participant k: qt [k] ∈

V ≡ {, } ≡ {0, 1}

entire K-participant conversation: qt ∈

VK

we’ll use a discretized version (frame step = 200 ms)

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-8
SLIDE 8

Introduction Framework Experiments Conclusions

Vocal Interaction (Dabbs & Ruback, 1987)

vocal activity patterns for all K participants, seen together talkspurt start/end times = text-independence at time t,

vocal activity of participant k: qt [k] ∈

V ≡ {, } ≡ {0, 1}

entire K-participant conversation: qt ∈

VK

we’ll use a discretized version (frame step = 200 ms)

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-9
SLIDE 9

Introduction Framework Experiments Conclusions

Vocal Interaction (Dabbs & Ruback, 1987)

vocal activity patterns for all K participants, seen together talkspurt start/end times = text-independence at time t,

vocal activity of participant k: qt [k] ∈

V ≡ {, } ≡ {0, 1}

entire K-participant conversation: qt ∈

VK

we’ll use a discretized version (frame step = 200 ms)

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-10
SLIDE 10

Introduction Framework Experiments Conclusions

Vocal Interaction (Dabbs & Ruback, 1987)

vocal activity patterns for all K participants, seen together talkspurt start/end times = text-independence at time t,

vocal activity of participant k: qt [k] ∈

V ≡ {, } ≡ {0, 1}

entire K-participant conversation: qt ∈

VK

we’ll use a discretized version (frame step = 200 ms)

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-11
SLIDE 11

Introduction Framework Experiments Conclusions

Vocal Interaction (Dabbs & Ruback, 1987)

vocal activity patterns for all K participants, seen together talkspurt start/end times = text-independence at time t,

vocal activity of participant k: qt [k] ∈

V ≡ {, } ≡ {0, 1}

entire K-participant conversation: qt ∈

VK

we’ll use a discretized version (frame step = 200 ms)

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-12
SLIDE 12

Introduction Framework Experiments Conclusions

Vocal Interaction (Dabbs & Ruback, 1987)

vocal activity patterns for all K participants, seen together talkspurt start/end times = text-independence at time t,

vocal activity of participant k: qt [k] ∈

V ≡ {, } ≡ {0, 1}

entire K-participant conversation: qt ∈

VK

we’ll use a discretized version (frame step = 200 ms)

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-13
SLIDE 13

Introduction Framework Experiments Conclusions

Vocal Interaction (Dabbs & Ruback, 1987)

vocal activity patterns for all K participants, seen together talkspurt start/end times = text-independence at time t,

vocal activity of participant k: qt [k] ∈

V ≡ {, } ≡ {0, 1}

entire K-participant conversation: qt ∈

VK

we’ll use a discretized version (frame step = 200 ms)

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-14
SLIDE 14

Introduction Framework Experiments Conclusions

Participant Characterization

Mary Jane Joe Fred Sam

a useful partition of the conversation participants

role influence seniority dominance ranking (of the above)

for all time,

the class of participant k: g [k] ∈ C ≡ {C1, · · · , CN} K-participant group: g ∈

C ≡ h (C)
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-15
SLIDE 15

Introduction Framework Experiments Conclusions

Participant Characterization

Mary Jane Joe Fred Sam

a useful partition of the conversation participants

role influence seniority dominance ranking (of the above)

for all time,

the class of participant k: g [k] ∈ C ≡ {C1, · · · , CN} K-participant group: g ∈

C ≡ h (C)
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-16
SLIDE 16

Introduction Framework Experiments Conclusions

Participant Characterization

Mary Jane Joe Fred Sam

a useful partition of the conversation participants

role influence seniority dominance ranking (of the above)

for all time,

the class of participant k: g [k] ∈ C ≡ {C1, · · · , CN} K-participant group: g ∈

C ≡ h (C)
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-17
SLIDE 17

Introduction Framework Experiments Conclusions

Participant Characterization

Mary Jane Joe Fred Sam

a useful partition of the conversation participants

role influence seniority dominance ranking (of the above)

for all time,

the class of participant k: g [k] ∈ C ≡ {C1, · · · , CN} K-participant group: g ∈

C ≡ h (C)
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-18
SLIDE 18

Introduction Framework Experiments Conclusions

Participant Characterization

Mary Jane Joe Fred Sam

a useful partition of the conversation participants

role influence seniority dominance ranking (of the above)

for all time,

the class of participant k: g [k] ∈ C ≡ {C1, · · · , CN} K-participant group: g ∈

C ≡ h (C)
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-19
SLIDE 19

Introduction Framework Experiments Conclusions

What we’re trying to do

1 2 3 4 1 2 3 4

{qt} ∈

VK×T

F g ∈ h (C)

1 given a sequence of T K-participant states qt 2 compute & model features F 3 infer required equivalence classes g [k] of each participant

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-20
SLIDE 20

Introduction Framework Experiments Conclusions

What we’re trying to do

1 2 3 4 1 2 3 4

{qt} ∈

VK×T

F g ∈ h (C)

1 given a sequence of T K-participant states qt 2 compute & model features F 3 infer required equivalence classes g [k] of each participant

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-21
SLIDE 21

Introduction Framework Experiments Conclusions

What we’re trying to do

1 2 3 4 1 2 3 4

{qt} ∈

VK×T

F g ∈ h (C)

1 given a sequence of T K-participant states qt 2 compute & model features F 3 infer required equivalence classes g [k] of each participant

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-22
SLIDE 22

Introduction Framework Experiments Conclusions

What we’re trying to do

1 2 3 4 1 2 3 4

{qt} ∈

VK×T

F g ∈ h (C)

1 given a sequence of T K-participant states qt 2 compute & model features F 3 infer required equivalence classes g [k] of each participant

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-23
SLIDE 23

Introduction Framework Experiments Conclusions

Outline of Talk

  • 0. ... Intro (Motivation & Related Work)
  • 1. Computational Framework
  • 2. Experiments
  • 3. Conclusions
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-24
SLIDE 24

Introduction Framework Experiments Conclusions

Motivation

having observed a conversation/meeting, being able to say something about the participants is a basic competence in conversation understanding lots of research in psycho- and socio-linguistics, 1950-

converation analysis small group research non-verbal interaction

findings suggest that a participant’s talkspurt deployment timing is conditioned on

instrumental status characteristics, e.g. task-specific skills “diffuse status characteristics”, e.g. gender, age, race, etc.

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-25
SLIDE 25

Introduction Framework Experiments Conclusions

Motivation

having observed a conversation/meeting, being able to say something about the participants is a basic competence in conversation understanding lots of research in psycho- and socio-linguistics, 1950-

converation analysis small group research non-verbal interaction

findings suggest that a participant’s talkspurt deployment timing is conditioned on

instrumental status characteristics, e.g. task-specific skills “diffuse status characteristics”, e.g. gender, age, race, etc.

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-26
SLIDE 26

Introduction Framework Experiments Conclusions

Motivation

having observed a conversation/meeting, being able to say something about the participants is a basic competence in conversation understanding lots of research in psycho- and socio-linguistics, 1950-

converation analysis small group research non-verbal interaction

findings suggest that a participant’s talkspurt deployment timing is conditioned on

instrumental status characteristics, e.g. task-specific skills “diffuse status characteristics”, e.g. gender, age, race, etc.

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-27
SLIDE 27

Introduction Framework Experiments Conclusions

Motivation

having observed a conversation/meeting, being able to say something about the participants is a basic competence in conversation understanding lots of research in psycho- and socio-linguistics, 1950-

converation analysis small group research non-verbal interaction

findings suggest that a participant’s talkspurt deployment timing is conditioned on

instrumental status characteristics, e.g. task-specific skills “diffuse status characteristics”, e.g. gender, age, race, etc.

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-28
SLIDE 28

Introduction Framework Experiments Conclusions

Motivation

having observed a conversation/meeting, being able to say something about the participants is a basic competence in conversation understanding lots of research in psycho- and socio-linguistics, 1950-

converation analysis small group research non-verbal interaction

findings suggest that a participant’s talkspurt deployment timing is conditioned on

instrumental status characteristics, e.g. task-specific skills “diffuse status characteristics”, e.g. gender, age, race, etc.

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-29
SLIDE 29

Introduction Framework Experiments Conclusions

Motivation

having observed a conversation/meeting, being able to say something about the participants is a basic competence in conversation understanding lots of research in psycho- and socio-linguistics, 1950-

converation analysis small group research non-verbal interaction

findings suggest that a participant’s talkspurt deployment timing is conditioned on

instrumental status characteristics, e.g. task-specific skills “diffuse status characteristics”, e.g. gender, age, race, etc.

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-30
SLIDE 30

Introduction Framework Experiments Conclusions

Motivation

having observed a conversation/meeting, being able to say something about the participants is a basic competence in conversation understanding lots of research in psycho- and socio-linguistics, 1950-

converation analysis small group research non-verbal interaction

findings suggest that a participant’s talkspurt deployment timing is conditioned on

instrumental status characteristics, e.g. task-specific skills “diffuse status characteristics”, e.g. gender, age, race, etc.

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-31
SLIDE 31

Introduction Framework Experiments Conclusions

Motivation

having observed a conversation/meeting, being able to say something about the participants is a basic competence in conversation understanding lots of research in psycho- and socio-linguistics, 1950-

converation analysis small group research non-verbal interaction

findings suggest that a participant’s talkspurt deployment timing is conditioned on

instrumental status characteristics, e.g. task-specific skills “diffuse status characteristics”, e.g. gender, age, race, etc.

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-32
SLIDE 32

Introduction Framework Experiments Conclusions

Related Computational Work

static characterization of meeting participants

dominance rankings: Rienks & Heylen, 2005 influence rankings: Rienks et al., 2006

static characterization of radio talk show participants

roles: Vinciarelli, 2007

dynamic characterization of meeting participants

roles: Banerjee & Rudnicky, 2004 roles: Zancanaro et al., 2006 roles: Rienks et al., 2006

static characterization of conversations

meeting types: Laskowski et al., 2007

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-33
SLIDE 33

Introduction Framework Experiments Conclusions

Related Computational Work

static characterization of meeting participants

dominance rankings: Rienks & Heylen, 2005 influence rankings: Rienks et al., 2006

static characterization of radio talk show participants

roles: Vinciarelli, 2007

dynamic characterization of meeting participants

roles: Banerjee & Rudnicky, 2004 roles: Zancanaro et al., 2006 roles: Rienks et al., 2006

static characterization of conversations

meeting types: Laskowski et al., 2007

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-34
SLIDE 34

Introduction Framework Experiments Conclusions

Related Computational Work

static characterization of meeting participants

dominance rankings: Rienks & Heylen, 2005 influence rankings: Rienks et al., 2006

static characterization of radio talk show participants

roles: Vinciarelli, 2007

dynamic characterization of meeting participants

roles: Banerjee & Rudnicky, 2004 roles: Zancanaro et al., 2006 roles: Rienks et al., 2006

static characterization of conversations

meeting types: Laskowski et al., 2007

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-35
SLIDE 35

Introduction Framework Experiments Conclusions

Related Computational Work

static characterization of meeting participants

dominance rankings: Rienks & Heylen, 2005 influence rankings: Rienks et al., 2006

static characterization of radio talk show participants

roles: Vinciarelli, 2007

dynamic characterization of meeting participants

roles: Banerjee & Rudnicky, 2004 roles: Zancanaro et al., 2006 roles: Rienks et al., 2006

static characterization of conversations

meeting types: Laskowski et al., 2007

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-36
SLIDE 36

Introduction Framework Experiments Conclusions

Detecting Participant Types Independently

1 2 3 4 1 2 3 4

F

1 cannot model interaction with specific other types

feature space with non-specific others may be non-convex

2 may require recombination heuristics

≥2 participants may be assigned a unique type

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-37
SLIDE 37

Introduction Framework Experiments Conclusions

Detecting Participant Types Independently

2 3 4 2 3 4 1 1

F1

1 cannot model interaction with specific other types

feature space with non-specific others may be non-convex

2 may require recombination heuristics

≥2 participants may be assigned a unique type

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-38
SLIDE 38

Introduction Framework Experiments Conclusions

Detecting Participant Types Independently

3 4 3 4 1 2 1 2

F2

1 cannot model interaction with specific other types

feature space with non-specific others may be non-convex

2 may require recombination heuristics

≥2 participants may be assigned a unique type

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-39
SLIDE 39

Introduction Framework Experiments Conclusions

Detecting Participant Types Independently

4 4 1 1 2 3 2 3

F3

1 cannot model interaction with specific other types

feature space with non-specific others may be non-convex

2 may require recombination heuristics

≥2 participants may be assigned a unique type

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-40
SLIDE 40

Introduction Framework Experiments Conclusions

Detecting Participant Types Independently

1 1 2 2 3 4 3 4

F4

1 cannot model interaction with specific other types

feature space with non-specific others may be non-convex

2 may require recombination heuristics

≥2 participants may be assigned a unique type

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-41
SLIDE 41

Introduction Framework Experiments Conclusions

Detecting Participant Types Independently

1 1 2 2 3 4 3 4

F4

Problems:

1 cannot model interaction with specific other types

feature space with non-specific others may be non-convex

2 may require recombination heuristics

≥2 participants may be assigned a unique type

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-42
SLIDE 42

Introduction Framework Experiments Conclusions

Detecting Participant Types Independently

1 1 2 2 3 4 3 4

F4

Problems:

1 cannot model interaction with specific other types

feature space with non-specific others may be non-convex

2 may require recombination heuristics

≥2 participants may be assigned a unique type

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-43
SLIDE 43

Introduction Framework Experiments Conclusions

Detecting Participant Types Independently

1 1 2 2 3 4 3 4

F4

Problems:

1 cannot model interaction with specific other types

feature space with non-specific others may be non-convex

2 may require recombination heuristics

≥2 participants may be assigned a unique type

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-44
SLIDE 44

Introduction Framework Experiments Conclusions

Detecting Participant Types Independently

1 2 3 4 1 2 3 4

F

Problems:

1 cannot model interaction with specific other types

feature space with non-specific others may be non-convex

2 may require recombination heuristics

≥2 participants may be assigned a unique type

Solution: model participants jointly

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-45
SLIDE 45

Introduction Framework Experiments Conclusions

Detecting Participant Types Jointly

1 2 3 4 1 2 3 4

F

1 F describes interaction between all K participants 2 the most likely group assignment g∗ is identified by

enumerating over all possible group assignments g ∈ h (C) g∗ = arg max

g ∈ h(C)

P ( g | F ) = arg max

g ∈ h(C)

P ( g ) MM P ( F | g )

  • BM
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-46
SLIDE 46

Introduction Framework Experiments Conclusions

Detecting Participant Types Jointly

1 2 3 4 1 2 3 4

F

1 F describes interaction between all K participants 2 the most likely group assignment g∗ is identified by

enumerating over all possible group assignments g ∈ h (C) g∗ = arg max

g ∈ h(C)

P ( g | F ) = arg max

g ∈ h(C)

P ( g ) MM P ( F | g )

  • BM
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-47
SLIDE 47

Introduction Framework Experiments Conclusions

Detecting Participant Types Jointly

1 2 3 4 1 2 3 4

F

1 F describes interaction between all K participants 2 the most likely group assignment g∗ is identified by

enumerating over all possible group assignments g ∈ h (C) g∗ = arg max

g ∈ h(C)

P ( g | F ) = arg max

g ∈ h(C)

P ( g ) MM P ( F | g )

  • BM
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-48
SLIDE 48

Introduction Framework Experiments Conclusions

Detecting Participant Types Jointly

1 2 3 4 1 2 3 4

F

1 F describes interaction between all K participants 2 the most likely group assignment g∗ is identified by

enumerating over all possible group assignments g ∈ h (C) g∗ = arg max

g ∈ h(C)

P ( g | F ) = arg max

g ∈ h(C)

P ( g ) MM P ( F | g )

  • BM
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-49
SLIDE 49

Introduction Framework Experiments Conclusions

Detecting Participant Types Jointly

1 2 3 4 1 2 3 4

F

1 F describes interaction between all K participants 2 the most likely group assignment g∗ is identified by

enumerating over all possible group assignments g ∈ h (C) g∗ = arg max

g ∈ h(C)

P ( g | F ) = arg max

g ∈ h(C)

P ( g ) MM P ( F | g )

  • BM
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-50
SLIDE 50

Introduction Framework Experiments Conclusions

Detecting Participant Types Jointly

1 2 3 4 1 2 3 4

F

1 F describes interaction between all K participants 2 the most likely group assignment g∗ is identified by

enumerating over all possible group assignments g ∈ h (C) g∗ = arg max

g ∈ h(C)

P ( g | F ) = arg max

g ∈ h(C)

P ( g ) MM P ( F | g )

  • BM
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-51
SLIDE 51

Introduction Framework Experiments Conclusions

Detecting Participant Types Jointly

1 2 3 4 1 2 3 4

F

1 F describes interaction between all K participants 2 the most likely group assignment g∗ is identified by

enumerating over all possible group assignments g ∈ h (C) g∗ = arg max

g ∈ h(C)

P ( g | F ) = arg max

g ∈ h(C)

P ( g ) MM P ( F | g )

  • BM
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-52
SLIDE 52

Introduction Framework Experiments Conclusions

What is h (C)?

That depends on what C is... each of K types assigned to exactly one participant h (C) is a permutation space |h (C) | = K! each participant can be one

  • f any N types

h (C) is a Cartesian product |h (C) | = NK

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-53
SLIDE 53

Introduction Framework Experiments Conclusions

What is h (C)?

Unique Types each of K types assigned to exactly one participant h (C) is a permutation space |h (C) | = K! Non-Unique Types each participant can be one

  • f any N types

h (C) is a Cartesian product |h (C) | = NK

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-54
SLIDE 54

Introduction Framework Experiments Conclusions

What is h (C)?

Unique Types C = {C1, C2, · · · , CK} each of K types assigned to exactly one participant h (C) is a permutation space |h (C) | = K! Non-Unique Types each participant can be one

  • f any N types

h (C) is a Cartesian product |h (C) | = NK

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-55
SLIDE 55

Introduction Framework Experiments Conclusions

What is h (C)?

Unique Types C = {C1, C2, · · · , CK} each of K types assigned to exactly one participant h (C) is a permutation space |h (C) | = K! Non-Unique Types each participant can be one

  • f any N types

h (C) is a Cartesian product |h (C) | = NK

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-56
SLIDE 56

Introduction Framework Experiments Conclusions

What is h (C)?

Unique Types C = {C1, C2, · · · , CK} each of K types assigned to exactly one participant h (C) is a permutation space |h (C) | = K! Non-Unique Types each participant can be one

  • f any N types

h (C) is a Cartesian product |h (C) | = NK

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-57
SLIDE 57

Introduction Framework Experiments Conclusions

What is h (C)?

Unique Types C = {C1, C2, · · · , CK} each of K types assigned to exactly one participant h (C) is a permutation space |h (C) | = K! Non-Unique Types each participant can be one

  • f any N types

h (C) is a Cartesian product |h (C) | = NK

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-58
SLIDE 58

Introduction Framework Experiments Conclusions

What is h (C)?

Unique Types C = {C1, C2, · · · , CK} each of K types assigned to exactly one participant h (C) is a permutation space |h (C) | = K! Non-Unique Types C = {C1, C2, · · · , CN} each participant can be one

  • f any N types

h (C) is a Cartesian product |h (C) | = NK

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-59
SLIDE 59

Introduction Framework Experiments Conclusions

What is h (C)?

Unique Types C = {C1, C2, · · · , CK} each of K types assigned to exactly one participant h (C) is a permutation space |h (C) | = K! Non-Unique Types C = {C1, C2, · · · , CN} each participant can be one

  • f any N types

h (C) is a Cartesian product |h (C) | = NK

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-60
SLIDE 60

Introduction Framework Experiments Conclusions

What is h (C)?

Unique Types C = {C1, C2, · · · , CK} each of K types assigned to exactly one participant h (C) is a permutation space |h (C) | = K! Non-Unique Types C = {C1, C2, · · · , CN} each participant can be one

  • f any N types

h (C) is a Cartesian product |h (C) | = NK

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-61
SLIDE 61

Introduction Framework Experiments Conclusions

What is h (C)?

Unique Types C = {C1, C2, · · · , CK} each of K types assigned to exactly one participant h (C) is a permutation space |h (C) | = K! Non-Unique Types C = {C1, C2, · · · , CN} each participant can be one

  • f any N types

h (C) is a Cartesian product |h (C) | = NK

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-62
SLIDE 62

Introduction Framework Experiments Conclusions

In our experiments...

Unique Roles, C = R AMI Meeting Corpus

design scenario train: 98 meetings dev: 20 meetings eval: 20 meetings K = 4, always

R = {PM, ME, UI, ID} Seniority Levels, C = S ICSI Meeting Corpus

naturally occurring 3 meeting types (Bed,Bmr,Bro) train: 33 meetings dev: 18 meetings eval: 16 meetings 3≥K≥9

S = {grad, phd, prof}

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-63
SLIDE 63

Introduction Framework Experiments Conclusions

In our experiments...

Unique Types Unique Roles, C = R AMI Meeting Corpus

design scenario train: 98 meetings dev: 20 meetings eval: 20 meetings K = 4, always

R = {PM, ME, UI, ID} Non-Unique Types Seniority Levels, C = S ICSI Meeting Corpus

naturally occurring 3 meeting types (Bed,Bmr,Bro) train: 33 meetings dev: 18 meetings eval: 16 meetings 3≥K≥9

S = {grad, phd, prof}

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-64
SLIDE 64

Introduction Framework Experiments Conclusions

In our experiments...

Unique Types Unique Roles, C = R AMI Meeting Corpus

design scenario train: 98 meetings dev: 20 meetings eval: 20 meetings K = 4, always

R = {PM, ME, UI, ID} Non-Unique Types Seniority Levels, C = S ICSI Meeting Corpus

naturally occurring 3 meeting types (Bed,Bmr,Bro) train: 33 meetings dev: 18 meetings eval: 16 meetings 3≥K≥9

S = {grad, phd, prof}

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-65
SLIDE 65

Introduction Framework Experiments Conclusions

In our experiments...

Unique Types Unique Roles, C = R AMI Meeting Corpus

design scenario train: 98 meetings dev: 20 meetings eval: 20 meetings K = 4, always

R = {PM, ME, UI, ID} Non-Unique Types Seniority Levels, C = S ICSI Meeting Corpus

naturally occurring 3 meeting types (Bed,Bmr,Bro) train: 33 meetings dev: 18 meetings eval: 16 meetings 3≥K≥9

S = {grad, phd, prof}

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-66
SLIDE 66

Introduction Framework Experiments Conclusions

In our experiments...

Unique Types Unique Roles, C = R AMI Meeting Corpus

design scenario train: 98 meetings dev: 20 meetings eval: 20 meetings K = 4, always

R = {PM, ME, UI, ID} Non-Unique Types Seniority Levels, C = S ICSI Meeting Corpus

naturally occurring 3 meeting types (Bed,Bmr,Bro) train: 33 meetings dev: 18 meetings eval: 16 meetings 3≥K≥9

S = {grad, phd, prof}

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-67
SLIDE 67

Introduction Framework Experiments Conclusions

In our experiments...

Unique Types Unique Roles, C = R AMI Meeting Corpus

design scenario train: 98 meetings dev: 20 meetings eval: 20 meetings K = 4, always

R = {PM, ME, UI, ID} Non-Unique Types Seniority Levels, C = S ICSI Meeting Corpus

naturally occurring 3 meeting types (Bed,Bmr,Bro) train: 33 meetings dev: 18 meetings eval: 16 meetings 3≥K≥9

S = {grad, phd, prof}

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-68
SLIDE 68

Introduction Framework Experiments Conclusions

In our experiments...

Unique Types Unique Roles, C = R AMI Meeting Corpus

design scenario train: 98 meetings dev: 20 meetings eval: 20 meetings K = 4, always

R = {PM, ME, UI, ID} Non-Unique Types Seniority Levels, C = S ICSI Meeting Corpus

naturally occurring 3 meeting types (Bed,Bmr,Bro) train: 33 meetings dev: 18 meetings eval: 16 meetings 3≥K≥9

S = {grad, phd, prof}

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-69
SLIDE 69

Introduction Framework Experiments Conclusions

In our experiments...

Unique Types Unique Roles, C = R AMI Meeting Corpus

design scenario train: 98 meetings dev: 20 meetings eval: 20 meetings K = 4, always

R = {PM, ME, UI, ID} Non-Unique Types Seniority Levels, C = S ICSI Meeting Corpus

naturally occurring 3 meeting types (Bed,Bmr,Bro) train: 33 meetings dev: 18 meetings eval: 16 meetings 3≥K≥9

S = {grad, phd, prof}

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-70
SLIDE 70

Introduction Framework Experiments Conclusions

Feature Types in F

1 probability of vocalizing (V) 2 probability of initiating vocalization (VI) in prior silence 3 probability of continuing vocalization (VC) in prior non-overlap 4 probability of initiating overlap (OI) in prior non-overlap 5 probability of continuing overlap (OC) in prior overlap

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-71
SLIDE 71

Introduction Framework Experiments Conclusions

Feature Types in F

1 probability of vocalizing (V) 2 probability of initiating vocalization (VI) in prior silence 3 probability of continuing vocalization (VC) in prior non-overlap 4 probability of initiating overlap (OI) in prior non-overlap 5 probability of continuing overlap (OC) in prior overlap

k f V

k

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-72
SLIDE 72

Introduction Framework Experiments Conclusions

Feature Types in F

1 probability of vocalizing (V) 2 probability of initiating vocalization (VI) in prior silence 3 probability of continuing vocalization (VC) in prior non-overlap 4 probability of initiating overlap (OI) in prior non-overlap 5 probability of continuing overlap (OC) in prior overlap

k f VI

k

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-73
SLIDE 73

Introduction Framework Experiments Conclusions

Feature Types in F

1 probability of vocalizing (V) 2 probability of initiating vocalization (VI) in prior silence 3 probability of continuing vocalization (VC) in prior non-overlap 4 probability of initiating overlap (OI) in prior non-overlap 5 probability of continuing overlap (OC) in prior overlap

k f VC

k

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-74
SLIDE 74

Introduction Framework Experiments Conclusions

Feature Types in F

1 probability of vocalizing (V) 2 probability of initiating vocalization (VI) in prior silence 3 probability of continuing vocalization (VC) in prior non-overlap 4 probability of initiating overlap (OI) in prior non-overlap 5 probability of continuing overlap (OC) in prior overlap

k f OI

k,j

j

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-75
SLIDE 75

Introduction Framework Experiments Conclusions

Feature Types in F

1 probability of vocalizing (V) 2 probability of initiating vocalization (VI) in prior silence 3 probability of continuing vocalization (VC) in prior non-overlap 4 probability of initiating overlap (OI) in prior non-overlap 5 probability of continuing overlap (OC) in prior overlap

k j f OC

k,j

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-76
SLIDE 76

Introduction Framework Experiments Conclusions

Models

behavior model (BM), where θ is a 1-dimensional Gaussian P ( F | g ) =

K

  • k=1

P

  • f V

k | θV g[k]

  • P
  • f VI

k

| θVI

g[k]

  • P
  • f VC

k

| θVC

g[k]

  • ×

K

  • j=k

P

  • f OI

k,j | θOI g[k],g[j]

  • P
  • f OC

k,j | θOC g[k],g[j]

  • membership model (MM)

P ( g ) = 1 Zg

K

  • k=1

P ( g [k] )

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-77
SLIDE 77

Introduction Framework Experiments Conclusions

Models

behavior model (BM), where θ is a 1-dimensional Gaussian P ( F | g ) =

K

  • k=1

P

  • f V

k | θV g[k]

  • P
  • f VI

k

| θVI

g[k]

  • P
  • f VC

k

| θVC

g[k]

  • ×

K

  • j=k

P

  • f OI

k,j | θOI g[k],g[j]

  • P
  • f OC

k,j | θOC g[k],g[j]

  • membership model (MM)

P ( g ) = 1 Zg

K

  • k=1

P ( g [k] )

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-78
SLIDE 78

Introduction Framework Experiments Conclusions

Models

behavior model (BM), where θ is a 1-dimensional Gaussian P ( F | g ) =

K

  • k=1

P

  • f V

k | θV g[k]

  • P
  • f VI

k

| θVI

g[k]

  • P
  • f VC

k

| θVC

g[k]

  • ×

K

  • j=k

P

  • f OI

k,j | θOI g[k],g[j]

  • P
  • f OC

k,j | θOC g[k],g[j]

  • membership model (MM)

P ( g ) = 1 Zg

K

  • k=1

P ( g [k] )

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-79
SLIDE 79

Introduction Framework Experiments Conclusions

Unique Role R Classification

Feature AMI Type R f V

k

44 f VI

k

*41 f VC

k

34 f OI

k,j

*53 f OC

k,j

— best* 53 all 46 priors 25

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-80
SLIDE 80

Introduction Framework Experiments Conclusions

Aside: Looking for the Leader

find one unique role only, g [k] ∈ L = {L ≡ PM, ¬L}

0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.005 0.01 0.015 0.02 0.025 (¬L,¬L) (¬L,¬L) (¬L,L) (¬L,L) (L,¬L) (L,¬L) feature fVI feature fOI (¬L,¬L) (¬L,L) (L,¬L)

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-81
SLIDE 81

Introduction Framework Experiments Conclusions

Leader L Detection

Feature AMI Type R L f V

k

44 — f VI

k

*41 *60 f VC

k

34 — f OI

k,j

*53 *60 f OC

k,j

— — best* 53 60 all 46 75 priors 25 25

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-82
SLIDE 82

Introduction Framework Experiments Conclusions

Seniority Level Feature Distributions

0.2 0.4 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 (GRAD,*) (PHD,GRAD) (GRAD,*) (PHD,GRAD) (PHD,PHD) (PHD,PROF) (PHD,PHD) (PHD,PROF) (PROF,GRAD) (PROF,GRAD) (PROF,PHD) (PROF,PHD) feature fV feature fOC (GRAD,GRAD) (GRAD,PHD) (GRAD,PROF) (PHD,GRAD) (PHD,PHD) (PHD,PROF) (PROF,GRAD) (PROF,PHD)

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-83
SLIDE 83

Introduction Framework Experiments Conclusions

Seniority Level S Classification

Feature AMI ICSI Type R L S f V

k

44 — *52 f VI

k

*41 *60 52 f VC

k

34 — — f OI

k,j

*53 *60 *59 f OC

k,j

— — *59 best* 53 60 61 all 46 75 58 priors 25 25 45

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-84
SLIDE 84

Introduction Framework Experiments Conclusions

Conversation-Type-Dependent S Classification

condition models on automatically inferred meeting type Feature AMI ICSI Type R L S S|t∗ f V

k

44 — *52 *57 f VI

k

*41 *60 52 56 f VC

k

34 — — 62 f OI

k,j

*53 *60 *59 *59 f OC

k,j

— — *59 *63 best* 53 60 61 67 all 46 75 58 57 priors 25 25 45 45

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-85
SLIDE 85

Introduction Framework Experiments Conclusions

Conclusions

assigned unique roles R in the AMI corpus

53% accuracy, 37% rel error reduction over baseline improves to 75%, when only manager (PM) is sought best features: initiation of talkspurts in silence and in overlap

seniority level S in the ICSI corpus

61% accuracy, 29% rel error reduction over baseline improves to 67%, with conditioning on inferred meeting type improves to 73%, with conditioning on true meeting type best features: overal talkspurt production, initiation and continuation of talkspurts in overlap

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-86
SLIDE 86

Introduction Framework Experiments Conclusions

Conclusions

assigned unique roles R in the AMI corpus

53% accuracy, 37% rel error reduction over baseline improves to 75%, when only manager (PM) is sought best features: initiation of talkspurts in silence and in overlap

seniority level S in the ICSI corpus

61% accuracy, 29% rel error reduction over baseline improves to 67%, with conditioning on inferred meeting type improves to 73%, with conditioning on true meeting type best features: overal talkspurt production, initiation and continuation of talkspurts in overlap

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-87
SLIDE 87

Introduction Framework Experiments Conclusions

Conclusions

assigned unique roles R in the AMI corpus

53% accuracy, 37% rel error reduction over baseline improves to 75%, when only manager (PM) is sought best features: initiation of talkspurts in silence and in overlap

seniority level S in the ICSI corpus

61% accuracy, 29% rel error reduction over baseline improves to 67%, with conditioning on inferred meeting type improves to 73%, with conditioning on true meeting type best features: overal talkspurt production, initiation and continuation of talkspurts in overlap

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-88
SLIDE 88

Introduction Framework Experiments Conclusions

Conclusions

assigned unique roles R in the AMI corpus

53% accuracy, 37% rel error reduction over baseline improves to 75%, when only manager (PM) is sought best features: initiation of talkspurts in silence and in overlap

seniority level S in the ICSI corpus

61% accuracy, 29% rel error reduction over baseline improves to 67%, with conditioning on inferred meeting type improves to 73%, with conditioning on true meeting type best features: overal talkspurt production, initiation and continuation of talkspurts in overlap

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-89
SLIDE 89

Introduction Framework Experiments Conclusions

Conclusions

assigned unique roles R in the AMI corpus

53% accuracy, 37% rel error reduction over baseline improves to 75%, when only manager (PM) is sought best features: initiation of talkspurts in silence and in overlap

seniority level S in the ICSI corpus

61% accuracy, 29% rel error reduction over baseline improves to 67%, with conditioning on inferred meeting type improves to 73%, with conditioning on true meeting type best features: overal talkspurt production, initiation and continuation of talkspurts in overlap

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-90
SLIDE 90

Introduction Framework Experiments Conclusions

Conclusions

assigned unique roles R in the AMI corpus

53% accuracy, 37% rel error reduction over baseline improves to 75%, when only manager (PM) is sought best features: initiation of talkspurts in silence and in overlap

seniority level S in the ICSI corpus

61% accuracy, 29% rel error reduction over baseline improves to 67%, with conditioning on inferred meeting type improves to 73%, with conditioning on true meeting type best features: overal talkspurt production, initiation and continuation of talkspurts in overlap

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-91
SLIDE 91

Introduction Framework Experiments Conclusions

Conclusions

assigned unique roles R in the AMI corpus

53% accuracy, 37% rel error reduction over baseline improves to 75%, when only manager (PM) is sought best features: initiation of talkspurts in silence and in overlap

seniority level S in the ICSI corpus

61% accuracy, 29% rel error reduction over baseline improves to 67%, with conditioning on inferred meeting type improves to 73%, with conditioning on true meeting type best features: overal talkspurt production, initiation and continuation of talkspurts in overlap

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-92
SLIDE 92

Introduction Framework Experiments Conclusions

(Potential) Implications

1 Participant Characterization

talkspurt deployment timing is predictive first baseline for several of the explored tasks proposed framework allows for inclusion of potentially complementary information, to prosodic/lexical/semantic features

2 Dialogue Systems

agent talkspurt deployment may contribute to agent personality

3 Speech Activity Detection

performance likely to improve with conditioning on participant characteristics

  • r joint inference of SAD and participant characteristics
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-93
SLIDE 93

Introduction Framework Experiments Conclusions

(Potential) Implications

1 Participant Characterization

talkspurt deployment timing is predictive first baseline for several of the explored tasks proposed framework allows for inclusion of potentially complementary information, to prosodic/lexical/semantic features

2 Dialogue Systems

agent talkspurt deployment may contribute to agent personality

3 Speech Activity Detection

performance likely to improve with conditioning on participant characteristics

  • r joint inference of SAD and participant characteristics
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-94
SLIDE 94

Introduction Framework Experiments Conclusions

(Potential) Implications

1 Participant Characterization

talkspurt deployment timing is predictive first baseline for several of the explored tasks proposed framework allows for inclusion of potentially complementary information, to prosodic/lexical/semantic features

2 Dialogue Systems

agent talkspurt deployment may contribute to agent personality

3 Speech Activity Detection

performance likely to improve with conditioning on participant characteristics

  • r joint inference of SAD and participant characteristics
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-95
SLIDE 95

Introduction Framework Experiments Conclusions

(Potential) Implications

1 Participant Characterization

talkspurt deployment timing is predictive first baseline for several of the explored tasks proposed framework allows for inclusion of potentially complementary information, to prosodic/lexical/semantic features

2 Dialogue Systems

agent talkspurt deployment may contribute to agent personality

3 Speech Activity Detection

performance likely to improve with conditioning on participant characteristics

  • r joint inference of SAD and participant characteristics
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-96
SLIDE 96

Introduction Framework Experiments Conclusions

(Potential) Implications

1 Participant Characterization

talkspurt deployment timing is predictive first baseline for several of the explored tasks proposed framework allows for inclusion of potentially complementary information, to prosodic/lexical/semantic features

2 Dialogue Systems

agent talkspurt deployment may contribute to agent personality

3 Speech Activity Detection

performance likely to improve with conditioning on participant characteristics

  • r joint inference of SAD and participant characteristics
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-97
SLIDE 97

Introduction Framework Experiments Conclusions

(Potential) Implications

1 Participant Characterization

talkspurt deployment timing is predictive first baseline for several of the explored tasks proposed framework allows for inclusion of potentially complementary information, to prosodic/lexical/semantic features

2 Dialogue Systems

agent talkspurt deployment may contribute to agent personality

3 Speech Activity Detection

performance likely to improve with conditioning on participant characteristics

  • r joint inference of SAD and participant characteristics
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-98
SLIDE 98

Introduction Framework Experiments Conclusions

(Potential) Implications

1 Participant Characterization

talkspurt deployment timing is predictive first baseline for several of the explored tasks proposed framework allows for inclusion of potentially complementary information, to prosodic/lexical/semantic features

2 Dialogue Systems

agent talkspurt deployment may contribute to agent personality

3 Speech Activity Detection

performance likely to improve with conditioning on participant characteristics

  • r joint inference of SAD and participant characteristics
  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA

slide-99
SLIDE 99

Introduction Framework Experiments Conclusions

Thank you for attending.

Many thanks also to: Jean Carletta, for many helpful comments Liz Shriberg, for access to the ICSI MRDA Corpus

  • K. Laskowski, M. Ostendorf, T. Schultz

SIGdial 2008, Columbus OH, USA