Re Recognizing Afffect in Dialog Systems ms Nate - PowerPoint PPT Presentation

Re Recognizing ¡ ¡Afffect ¡ ¡in ¡ ¡Dialog ¡ ¡ Systems ms Nate ¡Perkins ¡

Problem • Iden.fy ¡emo.onal ¡state ¡in ¡human ¡speech ¡dialog ¡

Why? • Tutoring ¡systems ¡ • Call ¡center ¡systems ¡ • Second ¡language ¡learning ¡systems ¡ • Virtual ¡agents ¡

What ¡are ¡we ¡iden=fying? • Emo.onal ¡state ¡is ¡difficult ¡to ¡define ¡for ¡humans ¡let ¡alone ¡computers ¡ • Target ¡broad ¡categories ¡ • Posi.ve/nega.ve/neutral ¡ • Nega.ve/non-‑nega.ve ¡ • Certain/uncertain ¡ • Posi.ve/nega.ve, ¡ac.ve/passive ¡ • posi.ve-‑ac.ve ¡: ¡joy ¡ • nega.ve-‑passive ¡: ¡frustra.on ¡

How ¡do ¡we ¡iden=fy ¡it ¡and ¡then ¡annotate? • Cross-‑valida.on ¡of ¡annota.ons ¡ • Coached ¡uGerances ¡targe.ng ¡specific ¡emo.onal ¡states ¡

What ¡features ¡are ¡relevant? • Overview ¡ • Discourse ¡ • ¡focus ¡on ¡‘what’, ¡‘how’, ¡and ¡‘when’ ¡something ¡is ¡said ¡ • Acous.c ¡barge-‑in ¡ • Ques.on ¡ • Seman.c ¡barge-‑in ¡ • Acous.c ¡prosodic ¡ • Rejec.on ¡ • Fundamental ¡freq ¡stats ¡ • Repeat ¡ • Energy/intensity ¡ • ‘local’ ¡vs ¡‘global’ ¡features ¡ • pitch ¡ • ‘local’ ¡– ¡prior ¡two ¡uGerances’ ¡features ¡ • ‘global’ ¡– ¡avg ¡of ¡all ¡prior ¡uGerances ¡ • Acous.c ¡temporal ¡ • Speaker ¡ • Total ¡.me ¡ • Gender ¡ • Total ¡silence ¡ • Subject ¡ • Speaking ¡rate ¡ • Facial ¡ • Lexical ¡ • Word ¡n-‑grams ¡ • Character ¡n-‑grams ¡ • Emo.onal ¡salience ¡ • Mutual ¡informa.on ¡between ¡words ¡and ¡emo.onal ¡state ¡ • derived ¡

Models • Independent ¡classifiers ¡for ¡different ¡categories ¡ • Aggregate ¡classifiers ¡via ¡interpola.on ¡ • Try ¡different ¡combina.ons ¡to ¡find ¡best ¡result ¡

Results • Some ¡instances ¡where ¡non-‑acous.c ¡out-‑performed ¡acous.c ¡in ¡ certain ¡experiments ¡ • Acous.c ¡+ ¡lexical ¡ • Generally ¡: ¡mix ¡of ¡all ¡feature ¡categories ¡performs ¡best ¡

Ques=ons • What ¡do ¡you ¡see ¡as ¡the ¡next ¡steps ¡in ¡terms ¡of ¡using ¡these ¡predic.ons ¡in ¡a ¡ dialogue ¡system? ¡The ¡authors ¡men.on ¡that ¡this ¡informa.on ¡can ¡"enhance" ¡their ¡ tutoring ¡system ¡but ¡they ¡don't ¡explicitly ¡go ¡into ¡how. ¡For ¡example, ¡if ¡the ¡system ¡ knows ¡the ¡user ¡is ¡experiencing ¡a ¡"nega.ve" ¡emo.on, ¡how ¡might ¡it ¡adapt ¡to ¡ address ¡that? ¡ • I ¡found ¡their ¡classifica.on ¡into ¡nega.ve, ¡posi.ve ¡and ¡neutral ¡groupings ¡a ¡liGle ¡ unnatural ¡and ¡unsa.sfying. ¡For ¡example, ¡"bored" ¡is ¡part ¡of ¡the ¡nega.ve ¡group ¡ but ¡it ¡seems ¡like ¡one ¡might ¡express ¡boredom ¡with ¡a ¡lack ¡of ¡emo.on, ¡but ¡"no ¡ strong ¡expression ¡of ¡emo.on" ¡is ¡how ¡the ¡neutral ¡category ¡is ¡defined. ¡And ¡ "frustra.on" ¡and ¡"uncertainty" ¡are ¡also ¡both ¡part ¡of ¡the ¡nega.ve ¡category ¡but ¡it ¡ seems ¡like ¡these ¡would ¡be ¡expressed ¡with ¡vastly ¡different ¡features. ¡Thoughts? ¡ • The ¡authors ¡of ¡“Predic.ng ¡Emo.on ¡in ¡Spoken ¡Dialogue ¡from ¡Mul.ple ¡Knowledge ¡ Sources” ¡call ¡contextual ¡features, ¡local ¡and ¡global, ¡the ¡features ¡of ¡the ¡two ¡ preceding ¡students ¡and ¡the ¡average ¡of ¡all ¡students ¡features. ¡How ¡is ¡this ¡related ¡ to ¡a ¡‘context’ ¡for ¡the ¡emo.ons ¡of ¡a ¡student? ¡

Ques=ons ¡(cont) • The ¡authors ¡of ¡“Predic.ng ¡Emo.on ¡in ¡Spoken ¡Dialogue ¡from ¡Mul.ple ¡Knowledge ¡Sources” ¡assume ¡that ¡ implemen.ng ¡emo.ons ¡in ¡a ¡automated ¡dialog ¡system ¡should ¡improve ¡the ¡performance ¡of ¡such ¡a ¡system. ¡ Isn’t ¡this ¡though ¡contrary ¡to ¡the ¡experience ¡of ¡people, ¡that ¡tend ¡to ¡behave ¡differently ¡with ¡a ¡machine ¡than ¡ with ¡a ¡human? ¡As ¡the ¡corpus ¡for ¡this ¡study ¡is ¡on ¡a ¡human-‑human ¡dialog ¡corpus, ¡the ¡results ¡should ¡not ¡be ¡ easily ¡transferable ¡to ¡an ¡automated ¡system, ¡or? ¡ • I’m ¡interested ¡in ¡Thor’s ¡second ¡ques.on—the ¡asser.on ¡that ¡this ¡system ¡may ¡not ¡be ¡easily ¡transferable ¡to ¡a ¡ human-‑machine ¡interac.on ¡given ¡its ¡training ¡on ¡a ¡human-‑human ¡corpus. ¡I ¡agree ¡with ¡this ¡assessment, ¡but ¡I ¡ also ¡wonder: ¡isn’t ¡the ¡goal ¡of ¡spoken ¡dialogue ¡systems ¡to ¡facilitate ¡a ¡conversa.on ¡such ¡as ¡those ¡experienced in ¡human-‑human ¡interac.on? ¡If ¡that ¡is ¡the ¡case, ¡then ¡training ¡on ¡a ¡human-‑human ¡corpus ¡makes ¡sense ¡for ¡a ¡ long-‑term ¡goal. ¡Is ¡it ¡feasible ¡to ¡expect ¡humans’ ¡behavior ¡with ¡spoken ¡dialogue ¡systems ¡to ¡change ¡as ¡systems ¡ improve, ¡and ¡should ¡research ¡be ¡preparing ¡for ¡this ¡purpose? ¡ • How ¡would ¡it ¡extend ¡to ¡non-‑English ¡language, ¡and ¡non-‑college ¡level ¡student, ¡secngs? ¡ Is ¡the ¡system ¡of ¡annota.on ¡language ¡independent, ¡since ¡it ¡is ¡a ¡human ¡(na.ve ¡speaker) ¡process? ¡ The ¡authors ¡men.on ¡they ¡are ¡exploring ¡other ¡emo.on ¡annota.on ¡schemes ¡-‑ ¡are ¡any ¡of ¡those ¡language/ culture ¡group ¡agnos.c ¡(is ¡that ¡even ¡a ¡possibility)? ¡ • Could ¡the ¡manual ¡features, ¡such ¡as ¡barge-‑in ¡or ¡''is ¡ques.on'', ¡be ¡automa.cally ¡derived ¡ from ¡the ¡raw ¡data ¡they ¡currently ¡have? ¡ • Using ¡just ¡lexical ¡items ¡produced ¡a ¡rela.vely ¡high ¡accuracy, ¡which ¡differs ¡from ¡other ¡studies. ¡ ¡Is ¡that ¡due ¡this ¡ specific ¡context ¡/ ¡domain? ¡

Modeling Affect in Dialog Katherine Topping LING575 Spring 2016

Adapting to Multiple Affective States in Spoken Dialogue

Overview • Wizard-of-Oz tutoring system • Previous work on multiple affect systems showed no significant improvements in task success, but showed other benefits such as increased user satisfaction • Comparing effectiveness of system recognizing only one affect (uncertainty) versus new system responding to two different user affects (uncertainty and disengagement) • Two most frequent user affective states that occur in system

Baseline System: UNC_ADAPT • (Un)certainty automatically classified by logistic regression model • Features of speech signal (i.e. prosody) • Automatic transcript • Dialogue context • System responds based upon answer’s (in)correctness and (un)certainty • Wizard used in present experiment • Inter-annotator agreement of 0.85 (correctness) and 0.62 (uncertainty) Kappa

New System: UNC -DISE_ADAPT • Adds disengagement, characterized by signs of boredom or irritation • Leaden monotone, sarcasm, off-task sounds • Inter-annotator agreement of 0.55 Kappa • Responses divided into correct+disengaged (COR-DISE) and incorrect+disengaged (INC-DISE) • Hypothesized that UNC_ADAPT response to incorrectness insufficient for INC-DISE turn (user already disengaged) • User must reengage to benefit from supplementary info • System gives "productive interaction feedback" to INC-DISE turns, followed by fill-in-the-blank version of original question

Experimental Procedure • College students with no college-level physics • Assigned to either UNC_ADAPT or UNC-DISE_ADAPT • Users: • Read short physics text • Took pretest and pre-motivation survey • Worked 5 "training" problem dialogs with system • Took post-motivation survey and user satisfaction survey • Took posttest isomorphic to pretest • Worked a "test" problem with UNC_ADAPT

Performance • Small decrease in learning gain/user satisfaction means for UNC- DISE • Previous study showed UNC had significantly higher learning gain than no-adapt system • UNC-DISE also outperforms no-adapt consistently • While adding new affect adaptations may not yield additive improvements, it also doesn’t hurt performance

Performance • Low-DISE users had higher motivation gain in UNC_ADAPT • High-DISE users had higher motivation gain in UNC-DISE_ADAPT

Re Recognizing Afffect in Dialog Systems ms Nate - PowerPoint PPT Presentation

Re Recognizing Afffect in Dialog Systems ms Nate Perkins Problem Iden.fy emo.onal state in human speech dialog Why? Tutoring systems Call center

Speech Processing 15-492/18-492 Spoken Dialog Systems Advanced Concepts in Dialog Spoken Dialog

Advanced NLU & Dialog Models Ling575 Spoken Dialog Systems April 21, 2016 Roadmap

Wrapping Up Ling575 Spoken Dialog Systems June 5, 2013 Roadmap Overview Distinctive

Dialog Management EE596/LING580 -- Conversational Artificial Intelligence Hao Cheng University

SDS: ASR, NLU, & VXML Ling575 Spoken Dialog April 14, 2016 Roadmap Dialog System

AI DIALOG SEARCH news services Josef Krupi ka Michal Svoboda Goals dialog system

Dialog Models 11-716 September 18, 2003 Thomas Harris What is a (dialog) model? A model is

Speech Processing 15-492/18-492 Spoken Dialog Systems Tree based dialogs VoiceXML State-based

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Spoken Dialog Systems SDS

Speech Processing 15-492/18-492 Spoken Dialog Systems SDS components Spoken Dialog Systems More

Speech Processing 11-492/18-495 Speech Processing 11-492/18-495 Spoken Dialog Systems Conversing

Recognizing objects and actions in Finding boundaries images and video Recognizing

Dialogue and Conversational Agents Ling575 Spoken Dialog Systems April 2, 2015 Roadmap

Dialogue and Conversational Agents Ling575 Spoken Dialog Systems March 29, 2017 Roadmap

Speech Processing 15-492/18-492 Spoken Dialog Systems Conversing with machines Spoken Dialog

Dialogue and Conversational Agents Ling575 Spoken Dialog Systems March 31, 2016 Roadmap

Highlights from BESIII Hai-Bo Li for BESIII Collaboration Institute of High Energy Physics

[ with the schedulers collaboration] Alberto Miranda, PhD Researcher on HPC I/O

Learning Word Representations by Jointly Modeling Syntagmatic and Paradigmatic Relations Fei Sun ,

Agenda Agenda Linda Rammler, UConn UCEDD (copy from Agenda handout) Fr. John Gallagher,

Linear Programming in Low Dimensions (most slides by Nati Srebro) September 11, 2003 Lecture 3:

Learning Word Representations by Jointly Modeling Syntagmatic and Paradigmatic Relations Fei Sun ,

UAM@SOCO 2014: Detection of Source Code Re-use by mean of Combining Different Types of

Comparing Learning Models for Korean Sound-symbolic Vowel Harmony Darrell Larsen and Jeffrey

Re Recognizing Afffect in Dialog Systems ms Nate - PowerPoint PPT Presentation

Re Recognizing Afffect in Dialog Systems ms Nate Perkins Problem Iden.fy emo.onal state in human speech dialog Why? Tutoring systems Call center

Speech Processing 15-492/18-492 Spoken Dialog Systems Advanced Concepts in Dialog Spoken Dialog

Advanced NLU &amp; Dialog Models Ling575 Spoken Dialog Systems April 21, 2016 Roadmap

Wrapping Up Ling575 Spoken Dialog Systems June 5, 2013 Roadmap Overview Distinctive

Dialog Management EE596/LING580 -- Conversational Artificial Intelligence Hao Cheng University

SDS: ASR, NLU, &amp; VXML Ling575 Spoken Dialog April 14, 2016 Roadmap Dialog System

AI DIALOG SEARCH news services Josef Krupi ka Michal Svoboda Goals dialog system

Dialog Models 11-716 September 18, 2003 Thomas Harris What is a (dialog) model? A model is

Speech Processing 15-492/18-492 Spoken Dialog Systems Tree based dialogs VoiceXML State-based

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Spoken Dialog Systems SDS

Speech Processing 15-492/18-492 Spoken Dialog Systems SDS components Spoken Dialog Systems More

Speech Processing 11-492/18-495 Speech Processing 11-492/18-495 Spoken Dialog Systems Conversing

Recognizing objects and actions in Finding boundaries images and video Recognizing

Dialogue and Conversational Agents Ling575 Spoken Dialog Systems April 2, 2015 Roadmap

Dialogue and Conversational Agents Ling575 Spoken Dialog Systems March 29, 2017 Roadmap

Speech Processing 15-492/18-492 Spoken Dialog Systems Conversing with machines Spoken Dialog

Dialogue and Conversational Agents Ling575 Spoken Dialog Systems March 31, 2016 Roadmap

Highlights from BESIII Hai-Bo Li for BESIII Collaboration Institute of High Energy Physics

[ with the schedulers collaboration] Alberto Miranda, PhD Researcher on HPC I/O

Learning Word Representations by Jointly Modeling Syntagmatic and Paradigmatic Relations Fei Sun ,

Agenda Agenda Linda Rammler, UConn UCEDD (copy from Agenda handout) Fr. John Gallagher,

Linear Programming in Low Dimensions (most slides by Nati Srebro) September 11, 2003 Lecture 3:

Learning Word Representations by Jointly Modeling Syntagmatic and Paradigmatic Relations Fei Sun ,

UAM@SOCO 2014: Detection of Source Code Re-use by mean of Combining Different Types of

Comparing Learning Models for Korean Sound-symbolic Vowel Harmony Darrell Larsen and Jeffrey

Advanced NLU & Dialog Models Ling575 Spoken Dialog Systems April 21, 2016 Roadmap

SDS: ASR, NLU, & VXML Ling575 Spoken Dialog April 14, 2016 Roadmap Dialog System