Overview: Multimodal Architecture and Interfaces Deborah Dahl W3C - - PowerPoint PPT Presentation

overview multimodal architecture and interfaces
SMART_READER_LITE
LIVE PREVIEW

Overview: Multimodal Architecture and Interfaces Deborah Dahl W3C - - PowerPoint PPT Presentation

Overview: Multimodal Architecture and Interfaces Deborah Dahl W3C Workshop on Multimodal Architecture and Interfaces 16-17 November 2007 W3C/Keio, Fujisawa, Japan Conversational Technologies MMI Architecture A loosely-coupled, event-based


slide-1
SLIDE 1

Conversational Technologies

Overview: Multimodal Architecture and Interfaces

Deborah Dahl W3C Workshop on Multimodal Architecture and Interfaces 16-17 November 2007 W3C/Keio, Fujisawa, Japan

slide-2
SLIDE 2

Conversational Technologies

MMI Architecture

A loosely-coupled, event-based architecture for integrating multiple modalities into applications

  • Modality components can be local or distributed
  • All communication is event-based
  • Based on a set of standard life-cycle events
  • Components can also expose other events as required
  • Encapsulation protects component data
  • Encapsulation enhances extensibility to new modalities
  • Represents user inputs in a standard way (EMMA)
slide-3
SLIDE 3

Conversational Technologies

Constituents

  • Defines five basic components of an MMI system

– Runtime Framework or Browser: initializes application and runs markup – Interaction Manager: coordinates modality components and provides application flow – Modality Components: provide modality capabilities such as speech, pen, keyboard, mouse – Data Model: handles shared data (part of the Interaction Manager) – DCCI (Delivery Context Client Interfaces): device properties and user preferences

  • Most recent Working Draft, 11 December, 2006

http://www.w3.org/TR/mmi-arch/

slide-4
SLIDE 4

Conversational Technologies

MMI Architecture Principles

  • Runtime Framework communicates with Modality

Components through asynchronous events

  • Modality Components don’t communicate directly with

each other but indirectly through the Runtime Framework

  • Components must implement basic life cycle events,

may expose others

  • Modality components can be nested (e.g. a Voice Dialog

component like a VoiceXML <form>)

  • Components need not be markup-based
  • EMMA communicates users’ inputs to IM
slide-5
SLIDE 5

Conversational Technologies

Instantiated MMI Architecture

Runtime Framework Interaction Manager (SCXML, including Data Model) Interaction Manager (SCXML, including Data Model) DCCI DCCI Ink (InkML) Ink (InkML) Speech Interaction (VoiceXML + SSML + SRGS+SISR) Speech Interaction (VoiceXML + SSML + SRGS+SISR) Haptic Haptic Keyboard Keyboard Pointing Pointing

Life cycle events + EMMA Life cycle events + EMMA Life cycle events + EMMA Life cycle events + EMMA Life cycle events + EMMA

slide-6
SLIDE 6

Conversational Technologies

Life Cycle Events

Check status of MC Modality Runtime Framework StatusRequest Deactivate context Modality Runtime Framework ClearContext Resume processing Modality Runtime Framework Resume Acknowledge Resume Runtime Framework Modality ResumeResponse Send data values either either Data Acknowledge Prepare Runtime Framework Modality PauseResponse Acknowledge Cancel Runtime Framework Modality CancelResponse Stop processing Modality Runtime Framework Cancel Suspend processing Modality Runtime Framework Pause Acknowledge Start Runtime Framework Modality StartResponse Finished running Runtime Framework Modality Done Report status Runtime Framework Modality StatusResponse Run markup Modality Runtime Framework Start Acknowledge Prepare Runtime Framework Modality PrepareResponse Pre-load markup Modality Runtime Framework Prepare Send new context id Modality Runtime Framework NewContextResponse Request new context Runtime Framework Modality NewContextRequest

Purpose To From Event

slide-7
SLIDE 7

Conversational Technologies

Summary

  • MMI Architecture provides a general, clean

interface to a wide range of modality components

  • EMMA provides a standard and general way of

representing user inputs

  • Very easy to integrate new modalities
  • Loose coupling and lack of access to internal

modality data improves security