Towards Customizable Individualized Dialogue Systems Marilyn - PowerPoint PPT Presentation

Towards Customizable Individualized Dialogue Systems Marilyn Walker, S. Whittaker, R. Moore, J. Moore and S. Young Universities of Sheffield, Edinburgh and Cambridge

Spoken Dialogue Systems (HCI and Human Modeling) • An intelligent artifact that can interact with humans to complete certain tasks • An important experimental vehicle for Cognitive Science • Cognitive hypotheses about dialogue can be embodied in system and tested • Hypotheses related to individual differences in interaction Marilyn Walker University of Sheffield

Spoken Dialogue Systems: THE PAST to the PRESENT • Travel information systems, e.g. ATIS, SUNDIAL, Communicator • System initiative limited vocabulary dialogue => mixed initiative large vocabulary ASR • Commercial systems in many domains (but still limitations) Marilyn Walker University of Sheffield

Key of “Mr. Right” is instructability and feedback Systems that are personalized to respect individual differences That learn over time how to improve their performance

What kind of learning would be important? o Modeling individual differences o One of the most consistent results from cognitive science � Cognitive load (young vs. the elderly) � Learning differences (visual vs. verbal learners) � Interactive style (casual vs. formal, directive vs. nondirective)

Spoken Dialogue Systems Speech Speech TTS ASR Automatic Speech Text-to-Speech Recognition Synthesis Data, Rules Words Words SLG SLU Spoken Language Spoken Language Generation Understanding Meaning Goal DM Dialog Management Marilyn Walker University of Sheffield

Hypothesis: Individualization and customization depend on methods for training the DM and SLG

Training the Dialogue Manager and Response Generator • Dialogue management: o Reinforcement learning: Levin etal 97, Walker etal 98, Litman etal, 2000, Scheffler and Young 2002 • Spoken language generation o Decision-theoretic user models (Carenini and Moore IJCAI 2001a, b; Walker etal, Cognitive Science 2004) o Rankboost (form of boosting) (Rambow etal ACL01, Walker etal NAACL01, Stent etal ACL04) Marilyn Walker University of Sheffield

Reinforcement Learning • System characterized in terms of a set of states, and actions that can be taken in each state • Actions can be something said to the user, a whole subdialogue, or accessing the database • The rewards received on reaching a state or at end of dialogue are used to learn which actions lead to highest rewards

Reinforcement Learning (cont) + ∑ = a ' U a S ( , ) R S ( ) M max U a S ( , ) i i j ij a ' j • Actions a, a’ • States Si, Sj • Utility U • M a ij , probability of going from State i to State j on doing action a (estimated from experimental data) • R(Si) - the immediate reward for getting to State i • U(final state): the delayed reward for completing the dialogue

Experiments with human users • ELVIS: User • NJFun: Task Completion Satisfaction increased increased from .52 (training) from 27.5 (training) to to .64 (test) (p < .06) 31.7 (test) (p < .05) Marilyn Walker University of Sheffield

Individualizing the reward function? o Cognitive load (young vs. the elderly) o Learning differences (visual vs. verbal learners) o Interactive style (casual vs. formal, directive vs. nondirective)

Summary (RL) • Reinforcement learning allows you to represent any system actions as choices the system is making in a particular dialogue state • The reward function can be based on any evaluation metric you wish to optimize • Experiments so far suggest that the method provides measurable and significant system improvements on chosen metric Marilyn Walker University of Sheffield

Decision-theoretic models for content selection • User interacts with system to indicate importance of different domain attributes in decision making • User-tailored responses select content depending on individual preferences • Experiments in real-estate, restaurant and travel domains show increased effectiveness in decision making (Carenini and Moore, Walker etal, Stent etal, Moore etal) • Open questions about degree of conciseness and form of information presentation (cognitive load for processing information) Marilyn Walker University of Sheffield

Boosting to customize form of response for SLG • Example responses represented by a set of features describing any potential aspect of the response • Each response has an associated rating derived from human feedback (e.g. Informational Coherence) • These ratings induce a partial order over the set of possible responses • The training method learns how to reproduce this partial order (ranking) of responses. Marilyn Walker University of Sheffield

Rankboost Algorithm ( See Schapire 99, Iyer etal 98) • Each response x represented as sum of m indicator functions where each function threshholded on a feature count: h s (x) = 1 if feature-count > 1, else 0 • Each function h s (x) has single α s parameter • Ranking Score: F(x) = Σ s s α s h s (x), ranks competing responses • Training data is a set of pairs (x,y) for each example x rated higher than y • Training: set the parameters α s to minimize the loss function Loss = Σ (x,y) e – (F (x) –F(y)) • As Loss is minimized, (F(x) – F(y)) where x is preferred to y is pushed to positive and ranking errors will tend to be reduced Marilyn Walker University of Sheffield

Examples: Learned Rules applied to test fold Realization Human RankBoost 0.45 Babbo has the best overall quality among the selected 1.5 restaurants because it has superb food quality, with excellent service, and it has excellent decor. Babbo has excellent service. It has superb food quality. 2.0 0.21 It has excellent decor. It has the best overall quality among the selected restaurants. Since Babbo has excellent service and superb food 3.5 0.77 quality, with excellent decor, it has the best overall quality among the selected restaurants. Babbo has excellent service and superb food quality, 4 0.88 with excellent decor. It has the best overall quality among the selected restaurants With excellent decor, excellent service and superb food 0.91 5 quality, Babbo has the best overall quality among the selected restaurants.. Marilyn Walker University of Sheffield

Response Generation Summary • Can train response generation to rank possible responses • Example: Ranking based on user feedback on the response’s informational coherence • However, user feedback could be oriented to any evaluation metric associated with the response, e.g. measures collected automatically via neurophysiological probes Marilyn Walker University of Sheffield

Proposal for Individualization • Reward function for reinforcement learning could be based in individual feedback • Multi-attribute models individualize content selection but degree of conciseness and form of presentation left uncustomized • Boosting method would support individualized conciseness and presentation form, given individualized feedback for ranking • Need research on metrics, probes to collect them, which differences most important, methods for training with smaller amounts of human interactive/feedback data Marilyn Walker University of Sheffield

Towards Customizable Individualized Dialogue Systems Marilyn - PowerPoint PPT Presentation

Towards Customizable Individualized Dialogue Systems Marilyn Walker, S. Whittaker, R. Moore, J. Moore and S. Young Universities of Sheffield, Edinburgh and Cambridge Spoken Dialogue Systems (HCI and Human Modeling) An intelligent

Customizable Domain- Customizable Domain -Specific Computing Specific Computing Jason Cong

Language and Computers Speech acts Rules Early dialogue Dialog Systems systems ELIZA Other

Real-life needs Individualized Individualized Feedback in ITS Feedback in ITS Detmar Meurers

dialogue systems, dialogue modeling 15 June 2007 ptt dialogue systems: intro 1/71 Dialog

dialogue notations and design Dialogue Notations and Design Dialogue Notations

Dialogue corpora NPFL070 December 11, 2019 (NPFL070) Dialogue corpora December 11, 2019 1 /

dialogue notations and Dialogue linked to the semantics of the system what it does

Describing Customizable Products on the Web of Data Linked Data On the Web Workshop - Rio de

CSpace CSpace CSpace CSpace A More Practical and A More Practical and A

The Computer and Natural Language Speech acts Discourse structure (Ling 445/515) Early dialogue

Care Planning: The Road Map for Individualized Resident Care Kathy Sanders RN, RAC-CT, DNS-CT

Julie Casaert Stephanie Lamanna Quality Improvement Project: Individualized Care Planning

Optimizing the Management of Acute Myeloid Leukemia: Individualized Therapy Optimizing the

Seven Essential Tips for Developing and Implementing Individualized Quality Control Plans Bill

Language Technology II: Natural Language Dialogue Verbal Output Generation in Dialogue

2 nd Dialogue on REDD Finance Mechanisms The Forests Dialogue 19-20 June 2009 Montreux,

o To start your script, click on the Add Dialogue box and select which character you want to

Camp Prosperity: Resist! Strategies for Pushing Back Against Harmful Policies August 1, 2018

Dr. Karim Bouzoubaa UI Hall of Fame or Shame? The buttons are limited to text labels: pi

Instrumental Interaction in Multisurface Environments Michel Beaudouin-Lafon Universit

Type-based Human-Computer Dialogue Peter Ljunglf Dept. of Computer Science and Engineering

Dialogue Systems System: [returns a list of flights] Show: (Arrival-time) Origin (City

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Spoken Dialog Systems SDS

implementation support Implementation support programming tools levels of services for

Sambuz

Useful Links

Newsletter

Mail Us