Combining Modalities in Multimodal Interfaces Focus on speech and - PowerPoint PPT Presentation

Combining Modalities in Multimodal Interfaces Focus on speech and gestures Focus on speech and gestures Gabriel Skantze gabriel@speech.kth.se

Common misconceptions Oviatt “Ten myths about multimodal interaction” 1. If you build a multimodal system, users will interact multimodally. Multimodal interaction 2. Speech and pointing is the dominant multimodal integration pattern. 3. Multimodal input involves simultaneous signals. 4. Speech is the primary input mode in any 4. Speech is the primary input mode in any multimodal system that includes it. 5. Multimodal language does not differ linguistically from unimodal language.

Common misconceptions Oviatt “Ten myths about multimodal interaction” 6. Multimodal integration involves redundancy of content between modes. 7. Individual error-prone recognition technologies Multimodal interaction combine multimodally to produce even greater unreliability. 8. All users’ multimodal commands are integrated in a uniform way. uniform way. 9. Different input modes are capable of transmitting comparable content. 10. Enhanced efficiency is the main advantage of multimodal systems.

Multimodal interface = Multimodal interaction? • Video: BTSLogic provides Directory Assistance and Information Services solutions to telecommunications carriers and operator services Multimodal interaction companies worldwide. Almost all users (95% to 100%) prefer to interact multimodally if they are given the choice. But this does not mean that all interaction is multimodal, rather that the best option is used for every task. About 20% of the interaction has been observed to be multimodal with multimodal interfaces.

Depends on the type of task Multimodal interaction

…and Complexity of task Multimodal interaction

Put That There [Bolt, 1980]

More than put that there • Combinations of written input, manual gesturing, and facial expressions can generate symbolic information that is more richly expressive than information that is more richly expressive than simple object selection. • Speak-and-point pattern only comprises 14% of Multimodal interaction all spontaneous multimodal utterances. – Pen input is used to create graphics, symbols and signs, gestural marks, digits and lexical content. • In interpersonal multimodal communication, • In interpersonal multimodal communication, pointing gestures account for less than 20% of all gestures. • Conclusion: Multimodal systems should handle other input than speak-and-point.

Simultaneous or Sequential

Speech is not everything • Traditionally, speech has been viewed as the primary modality and writing, gestures and haptic as merely supporting modalities. supporting modalities. • However, the other modalities can give information that is not present in the speech signal, e.g., spatial information Multimodal interaction • Pen input precedes speech in 99% of sequentially integrated multimodal commands, and in most simultaneously-integrated ones.

Speech in multimodality • Briefer, syntactically simpler, and less disfluent than users’ unimodal speech. than users’ unimodal speech. Multimodal interaction “Place a boat dock on the east, no, west end of Reward Lake.” [drawing rectangle] “Add dock.”

Complementary, not redundant • Multimodal input is actually mostly complementary, not redundant • Speech and pen give different semantic information: • Speech and pen give different semantic information: – subject, verb, and object spoken, – location with pen. Multimodal interaction • Even during multimodal correction of errors, redundant information is given less than 1% of the time. • During human communication, spontaneous speech • During human communication, spontaneous speech and gesturing do not involve duplicate information. • Designers of multimodal systems therefore should not expect to rely on duplicated information when processing multimodal language.

Unimodal errors are corrected 1. User may select least error prone least error prone modality Multimodal interaction 2. User may switch modality 3. Mutual 3. Mutual disambiguation

Individual patterns • Large individual differences in interaction patterns. interaction patterns. • User keeps using the same pattern from the Multimodal interaction beginning to the end. • Hence: Multimodal systems that can detect and adapt to a user’s and adapt to a user’s dominant interaction type can considerably improve recognition rates.

Strict Multimodality • Strict modality redundancy: redundancy: – All user actions should be possible to express using each modality Multimodal interaction – All system information should be possible to present in each modality • Motivation: • Motivation: – Flexibility, predictability – “Design for all”

Coupling content & modality • All modalities are not equal for all messages. • Speech/writing can convey much information, • Speech/writing can convey much information, but complex spatial shapes, relations among graphic objects, or precise location information is Multimodal interaction difficult… – … but trivial to sketch using a pen. • Speech delivers information directly and • Speech delivers information directly and intentionally, – but gaze reflects the speaker’s focus of interest more passively and unintentionally. • Hence adapt the input modality to the task

Combining Modalities in Multimodal Interfaces Focus on speech and - PowerPoint PPT Presentation

Combining Modalities in Multimodal Interfaces Focus on speech and gestures Focus on speech and gestures Gabriel Skantze gabriel@speech.kth.se Common misconceptions Oviatt Ten myths about multimodal interaction 1. If you build a

Programming Modalities Modalities of Programming In 2020, there are three prevalent modalities

DFG Graduiertenkolleg 1564 (Research Training Group 1564) Imaging New Modalities Multimodal

Multimodal Machine Learning Louis-Philippe (LP) Morency CMU Multimodal Communication and Machine

Multimodal Machine Learning Louis-Philippe (LP) Morency CMU Multimodal Communication and Machine

T Topic 7 i 7 Interfaces and Abstract Interfaces and Abstract Classes Interfaces Interfaces

Overview: Multimodal Architecture and Interfaces Deborah Dahl W3C Workshop on Multimodal

Discovering Natural Language Commands in Multimodal Interfaces Arjun Srinivasan Mira Dontcheva

Marie-France Bellin Technical innovations in existing modalities New imaging modalities

69a History of Massage: Modalities 69a History of Massage: Modalities Class Outline 5

69a History of Massage: Modalities 69a History of Massage: Modalities Class Outline 5 minutes

Modalities in HoTT Egbert Rijke, Mike Shulman, Bas Spitters 1706.07526 Higher toposes Internal

Multimodal Interaction & Interfaces Interfaces Gabriel Skantze gabriel@speech.kth.se

Multimodal Corridor Planning & Engineering Analysis Project A1A MULTIMODAL CORRIDOR PLANNING

MULTIMODAL OPTIMIZATION MIKE PREUSS. Multimodal Optimization 1 2014-09-14 Mike Preuss

CSSE 220 Interfaces and Polymorphism Check out Interfaces from SVN Interfaces What, When,

The History of Interaction Batch Interfaces Command-Line Interfaces Graphical User

Amenable signed permutations Harry Tamvakis University of Maryland November 2, 2019 Harry

Log-convexity of q -Catalan numbers Lynne Butler and undergraduate Pat Flanigan at Haverford

FUNDAMENTAL PERFORMANCE LIMITS OF ANALOG-TO-DIGITAL COMPRESSION Alon Kipnis Stanford University

0-0 Ten Fantastic Facts on Bruhat Order Sara Billey http://www.math.washington.edu/

MATH 105: Finite Mathematics 6-1: Sets Prof. Jonathan Duncan Walla Walla College Winter

Lecture 6: Universal Gates CK Cheng Dept. of Computer Science and Engineering University of

Lecture 6: Universal Gates CSE 140: Components and Design Techniques for Digital Systems Spring

Sets and Subsets MDM4U: Mathematics of Data Management Imagine you have a complete set of 2010

Sambuz

Useful Links

Newsletter

Mail Us

Combining Modalities in Multimodal Interfaces Focus on speech and - PowerPoint PPT Presentation

Combining Modalities in Multimodal Interfaces Focus on speech and gestures Focus on speech and gestures Gabriel Skantze gabriel@speech.kth.se Common misconceptions Oviatt Ten myths about multimodal interaction 1. If you build a

Programming Modalities Modalities of Programming In 2020, there are three prevalent modalities

DFG Graduiertenkolleg 1564 (Research Training Group 1564) Imaging New Modalities Multimodal

Multimodal Machine Learning Louis-Philippe (LP) Morency CMU Multimodal Communication and Machine

Multimodal Machine Learning Louis-Philippe (LP) Morency CMU Multimodal Communication and Machine

T Topic 7 i 7 Interfaces and Abstract Interfaces and Abstract Classes Interfaces Interfaces

Overview: Multimodal Architecture and Interfaces Deborah Dahl W3C Workshop on Multimodal

Discovering Natural Language Commands in Multimodal Interfaces Arjun Srinivasan Mira Dontcheva

Marie-France Bellin Technical innovations in existing modalities New imaging modalities

69a History of Massage: Modalities 69a History of Massage: Modalities Class Outline 5

69a History of Massage: Modalities 69a History of Massage: Modalities Class Outline 5 minutes

Modalities in HoTT Egbert Rijke, Mike Shulman, Bas Spitters 1706.07526 Higher toposes Internal

Multimodal Interaction &amp; Interfaces Interfaces Gabriel Skantze gabriel@speech.kth.se

Multimodal Corridor Planning &amp; Engineering Analysis Project A1A MULTIMODAL CORRIDOR PLANNING

MULTIMODAL OPTIMIZATION MIKE PREUSS. Multimodal Optimization 1 2014-09-14 Mike Preuss

CSSE 220 Interfaces and Polymorphism Check out Interfaces from SVN Interfaces What, When,

The History of Interaction Batch Interfaces Command-Line Interfaces Graphical User

Amenable signed permutations Harry Tamvakis University of Maryland November 2, 2019 Harry

Log-convexity of q -Catalan numbers Lynne Butler and undergraduate Pat Flanigan at Haverford

FUNDAMENTAL PERFORMANCE LIMITS OF ANALOG-TO-DIGITAL COMPRESSION Alon Kipnis Stanford University

0-0 Ten Fantastic Facts on Bruhat Order Sara Billey http://www.math.washington.edu/

MATH 105: Finite Mathematics 6-1: Sets Prof. Jonathan Duncan Walla Walla College Winter

Lecture 6: Universal Gates CK Cheng Dept. of Computer Science and Engineering University of

Lecture 6: Universal Gates CSE 140: Components and Design Techniques for Digital Systems Spring

Sets and Subsets MDM4U: Mathematics of Data Management Imagine you have a complete set of 2010

Sambuz

Useful Links

Newsletter

Mail Us

Multimodal Interaction & Interfaces Interfaces Gabriel Skantze gabriel@speech.kth.se

Multimodal Corridor Planning & Engineering Analysis Project A1A MULTIMODAL CORRIDOR PLANNING