Interchangeable Modalities W3C Workshop on MultiModal Interaction - - PowerPoint PPT Presentation

interchangeable modalities
SMART_READER_LITE
LIVE PREVIEW

Interchangeable Modalities W3C Workshop on MultiModal Interaction - - PowerPoint PPT Presentation

Interchangeable Modalities W3C Workshop on MultiModal Interaction 22-23 July 2013, New York Background: iSpeech is a Text-to-Speech & Speech Recognition Company Enterprise Enterprise Mobile, Auto, Home, & Fast growing list of


slide-1
SLIDE 1

Interchangeable Modalities

W3C Workshop on MultiModal Interaction

22-23 July 2013, New York

slide-2
SLIDE 2

Background: iSpeech is a Text-to-Speech & Speech Recognition Company

Developer s

15,000+ devs in 12 months, 2x growth of

Enterprise

Fast growing list of Mobile, Auto, Home and Publishing Customers

Consumers

30+ million app downloads

Developer Experience

25,000+ developers

Enterprise

Mobile, Auto, Home, & Publishing Customers

Consumer Experience

30+ million app downloads

slide-3
SLIDE 3

Credibility: Developer Ecosystem

> 25K developers registered > 2 billion API calls serviced > 99.9% uptime .

Mobile Devs Mobile OEM/OS Auto Home Publishing

slide-4
SLIDE 4

Speech: New Frontiers

2 Mobile/Nav 1 Entertainment 3 eLearning 4 Telephony

Growth Following New Use Cases

Breakdown of Developers by Segment & Activity

Developers API Usage

slide-5
SLIDE 5

Challenges of Speech Technology

  • Many technologists have never ‘experienced’

working directly with speech technologies

  • Uncharted Technology Waters: Audio DSP, NLP,

Domain/Grammar/Lexicon, Multimodal UI

  • Speech mirrors humans; more like ‘wetware’

than ‘software’?

  • Life-cycle of continuous adaptations and QA
slide-6
SLIDE 6

Consideration: Speech Technology Value Chain

ASR NLP TTS

slide-7
SLIDE 7

Standards & HTLM5

  • 25,000 Developers X 10 ways to package

web services (APIs and SDKs) And that’s just

Cloud, dozens more embedded engines to account

  • HTML5 adoption – audio playback of TTS
  • k, audio recording (ASR) not widely used
  • Example: impact of mature standards on

use of Speech Technology: VoiceXML, SSML, SRGS, MRCP

slide-8
SLIDE 8

Talkz™ Case Study

  • Talkz - successor to Drivesafe.ly
  • Interchangeable Multimodal App
  • Available today through iTunes
slide-9
SLIDE 9
slide-10
SLIDE 10
slide-11
SLIDE 11
slide-12
SLIDE 12
slide-13
SLIDE 13
slide-14
SLIDE 14
slide-15
SLIDE 15

Section 508 1998 2000 Windows Narrator VoiceOver Mac OS X 2005 2006 YouTube Captions 1st Accesible Smart- phone 2009 2013 Global Public Inclusive Infrastructure (GPII)

Multi-Modal’s Silver Lining : Universal Accessibility

“The gap between usability and accessibility is narrowing and with it the digital divide between disabled and non-disabled people.” - Robin Christopherson, AbilityNet

slide-16
SLIDE 16

Conclusions for Multi-Modal Developers

  • Plan to Partner & Partner to Plan
  • Multimodal is a CENTRAL UI pillar,

not an after thought

slide-17
SLIDE 17

Craig Campbell Chief Evangelist ccampbell@iSpeech.org

www.iSpeech.org/developers