multimodal interaction for next generation networks
play

Multimodal Interaction for Next Generation Networks Jrgen Sienel - PowerPoint PPT Presentation

Multimodal Interaction for Next Generation Networks Jrgen Sienel Alcatel Research & Innovation Stuttgart, Germany W3C MM Workshop Sophia Antipolis July 19, 2004 Alcatel e-Business Networking www.alcatel.com/enterprise Outline


  1. Multimodal Interaction for Next Generation Networks Jürgen Sienel Alcatel Research & Innovation Stuttgart, Germany W3C MM Workshop Sophia Antipolis July 19, 2004 Alcatel e-Business Networking www.alcatel.com/enterprise

  2. Outline Motivation Multimodal Applications Multimodal Architecture Approaches Standardisation Issues Conclusion 19- 7 - 2004 Page 2 Alcatel R&I

  3. Converged Functionality Access Information Services through Communication Networks to deliver next generation services, across the domains of enterprise, fixed and mobile, across disparate devices IT Data Converged Functionality Voice Private Public Domains 19- 7 - 2004 Page 3 Alcatel R&I

  4. Motivation Mobile Environment 19- 7 - 2004 Page 4 Alcatel R&I

  5. Human – Machine Communication Customizable (operator, user), adaptive to user profile, preference and terminal capability Multimodal User Interaction Voice/ Graphic Mobile Fixed Graphic Voice Voice/ Graphic Public Car Home Areas 19- 7 - 2004 Page 5 Alcatel R&I

  6. Multimodal Interaction Reasons Human perception allows the parallel processing of multiple input channels Higher „Bandwidth“ of communication (Non-verbal) Concentration on strength of each modality Selection of most appropriate modality depending on environment, e.g. noisy context, e.g. driving in a car complexity of task, e.g. directory assistance device capability , e.g. small displays preferences and disabilities of the user, e.g. visually impaired 19- 7 - 2004 Page 6 Alcatel R&I

  7. Multimodal Applications Operators Visions Services / Features Enablers Application Area Telephone Services Voice-activated dialing, Call Handling Automatic Speech Recognition Information Services Voice Portals, Wireless Web, Telematics Text-to-Speech Messaging Handling of Voice mail, email and UM, IM Web Interfaces Operator Services Voice deputy, Directory Assistance Enterprise Applications Call/Contact center Multimodal Interaction Mobile Commerce Multi Modal Event Notification, Mobile transactions User Identification Security Services Speaker verification, Biometrics 19- 7 - 2004 Page 7 Alcatel R&I

  8. Multimodal Application Instant Messaging ▼ Adaptation to terminal capability and user preference ▼ Flexible combination of visual and acoustical interaction ▼ Customization 19- 7 - 2004 Page 8 Alcatel R&I

  9. Approaches Multimodal Browser Microphone Keyboard Mouse Pen Display Loudspeaker User Interface API Handwriting Recognition Application HTTP Web- API Server Speech Recognition Browser API Speech Generation TTS Client Device 19- 7 - 2004 Page 9 Alcatel R&I

  10. Multimodal Browser Some pros and cons Pros: All functionality in one device Just one handler for a document Easy synchronisation methods of graphics and voice Direct interpretation and handling of sensors Cons: Limited resources on mobile devices Dependent on the device Multilinguality may be missing Interaction Management has no deeper application knowledge, can only interpret the document Transfer of application data (e.g. grammars) might be more expensive than transfer of speech 19- 7 - 2004 Page 10 Alcatel R&I

  11. Multimodal Architectures Server based Approach Application Server Server Application Terminal Terminal XML page Browser/ Result Multi-modal Graphics signalling Interpreter Speech output SD Speech input SE recognition- grammar SE: Speech coding or DSR results SD: Speech decoding or TTS Distributed Speech Recognition: Speech Server Backend processing will be in the Speech (Recognition, TTS) Server Speech Server Speech Server 19- 7 - 2004 Page 11 Alcatel R&I

  12. Server based Approach Some pros and cons Pros: Exact knowledge of the application Handling of meta dialog Storage of voice records for security reasons (banking application) Easy support of multilingual applications Cons: How to get detailed knowledge about the class of device Direct interpretation and handling of sensor data in terminal Harder to synchronise (delays) No sharing of ASR and TTS resources 19- 7 - 2004 Page 12 Alcatel R&I

  13. Approaches Distributed Architecture CLIENT SIDE SERVER SIDE Media Server ASR Front-end TTS HWR NGN MuMo Backend Application Internet Server Web Browser Dialogmanager Resource W3C HTTP Manager MM Mark- Context / up Presence MuMo Proxy Applications Interaction Server 19- 7 - 2004 Page 13 Alcatel R&I

  14. Multimodal Browser Some pros and cons Pros: On demand functionality (use of local functionality where possible) Storage of voice records for security reasons (banking application) Could support of multilingual applications Better interpretation and handling of sensors and device capabilities Optimised network traffic May support multiple devices Cons: Complex Synchronisation Interaction Management has no deeper application knowledge, can only interpret the document Higher standardisation effort needed Architecture may be not transparent for application developer 19- 7 - 2004 Page 14 Alcatel R&I

  15. Requirements for Standardisation Multimodal Framework and Components System and Environment Definition Result proposition (EMMA) Support of Distributed Processing (DOM) Interface to Media Processing Modules (SpeechSc) Improved device descriptions and presence (DI, OMA) WebService Interface for component binding Interface on parallel devices Definition of modality independent dialogs and content 19- 7 - 2004 Page 15 Alcatel R&I

  16. Conclusion Next Generation Networks will provide converged IT and communication access to a set of existing and new services and application Independence from the end-user device is must Multimodal Interfaces support the usability of such services and devices A network centric architecture offering On-Demand capabilities can support the multi device access Standardisation has to be continued, more interaction between the organisations might be needed to fulfil the common vision 19- 7 - 2004 Page 16 Alcatel R&I

  17. www.alcatel.com 19- 7 - 2004 Page 17 Alcatel R&I

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend