Building a Voice Assistant for Enterprise Manju Vijayakumar Lead - - PowerPoint PPT Presentation

building a voice assistant for enterprise
SMART_READER_LITE
LIVE PREVIEW

Building a Voice Assistant for Enterprise Manju Vijayakumar Lead - - PowerPoint PPT Presentation

Building a Voice Assistant for Enterprise Manju Vijayakumar Lead Software Engineer, Salesforce @vmanju QConSF, Nov 2018 Agenda Why Voice? Demo of Einstein Voice Assistant Conversational AI Ecosystem Natural Language


slide-1
SLIDE 1

Building a Voice Assistant for Enterprise

@vmanju QConSF, Nov 2018

Manju Vijayakumar Lead Software Engineer, Salesforce

slide-2
SLIDE 2

Agenda

  • Why Voice?
  • Demo of Einstein Voice Assistant
  • Conversational AI

○ Ecosystem ○ Natural Language Understanding (NLU)

  • Challenges
  • Future

○ Considerations ○ What’s next for NLP and AI

slide-3
SLIDE 3

Voice Recognition - A Story in 3 pictures

Source on Twitter

slide-4
SLIDE 4
slide-5
SLIDE 5
slide-6
SLIDE 6

From programmatic to natural interactions

Computing is Evolving

Point & Click Command Line Touch Voice

slide-7
SLIDE 7

Deliver an intelligent assistant that leverages Voice and NLU capabilities to understand, and support users in accomplishing their goals

slide-8
SLIDE 8

EINSTEIN VOICE DEMO

Pilot

slide-9
SLIDE 9

Meet Amy, a busy salesperson

Amy needs to update Salesforce

slide-10
SLIDE 10
slide-11
SLIDE 11

How did Voice Assistant help Amy?

Accuracy & timeliness of data capture Visible to the team Unstructured data -> Structured data

  • Productive
  • No system expertise
slide-12
SLIDE 12

Building Blocks of Voice Assistant

slide-13
SLIDE 13

ASR Automatic Speech Recognition

slide-14
SLIDE 14

ASR Automatic Speech Recognition NLU Natural Language Understanding

slide-15
SLIDE 15

ASR Automatic Speech Recognition NLU Natural Language Understanding CRM Integration

slide-16
SLIDE 16

Conversational AI Ecosystem

slide-17
SLIDE 17

Conversational AI Ecosystem

Einstein Platform

Automatic Speech Recognition Models Natural Language Understanding Models Salesforce CRM Metadata

slide-18
SLIDE 18

Conversational AI Ecosystem

Einstein Platform Conversational API

Named Entity Recognition

Entity Resolution Text Classification Context Management Automatic Speech Recognition Models Natural Language Understanding Models Salesforce CRM Metadata Slot Filling

slide-19
SLIDE 19

Conversational AI Ecosystem

Einstein Platform Conversational API

Named Entity Recognition

Entity Resolution Text Classification Context Management Automatic Speech Recognition Models Natural Language Understanding Models Salesforce CRM Metadata Slot Filling

Einstein Voice Assistant Einstein Voice Bots Smart Speakers* Voice Navigation*

slide-20
SLIDE 20

Conversational AI Service

Conversational API

Named Entity Recognition

Entity Resolution Text Classification Context Management Slot Filling

slide-21
SLIDE 21

Conversational AI Service

Conversational API

Named Entity Recognition

Entity Resolution Text Classification Context Management Slot Filling

slide-22
SLIDE 22

Named Entity Recognition (NER)

The committee of 30 government and university scientists and engineers, led by McCleese, was asked to recommend to the space agency by the end of this month a rationale and strategy for precursor flights and the sample-return missions.

The ‘O’ committee ‘O’

  • f

‘O’ ... ‘O’ McCleese ‘PER’ the ‘DATE’ end ‘DATE’

  • f

‘DATE’ this ‘DATE’ month ‘DATE’ *CoNLL format

slide-23
SLIDE 23

Named Entity Recognition (NER)

The committee of 30 government and university scientists and engineers, led by McCleese, was asked to recommend to the space agency by the end of this month a rationale and strategy for precursor flights and the sample-return missions.

NER7 model recognizes 7 entities: Person, Organization, Location, Date, Time, Money, Percentage

The ‘O’ committee ‘O’

  • f

‘O’ ... ‘O’ McCleese ‘PER’ the ‘DATE’ end ‘DATE’

  • f

‘DATE’ this ‘DATE’ month ‘DATE’ *CoNLL format

slide-24
SLIDE 24

What are the entities in the text ?

PERSON ORGANIZATION DATE MONEY Follow up call with Chris in two weeks DATE (two weeks is normalized to 2018/07/15)

slide-25
SLIDE 25

Conversational AI Service

Conversational API

Named Entity Recognition

Entity Resolution Text Classification Context Management Slot Filling

slide-26
SLIDE 26

Entity Resolution - Is this entity in my CRM ?

Salesforce CRM DB Records matched for ‘Acme’ Send records to user to disambiguate

slide-27
SLIDE 27

Conversational AI Service

Conversational API

Named Entity Recognition

Entity Resolution Text Classification Context Management Slot Filling

slide-28
SLIDE 28

Context Management - What data do we have so far ?

{ "context": { "Organization": { "id": "001XXXX", "name": "Acme Corp" }, }, ... } Do we have

  • rganization

in the context?

slide-29
SLIDE 29

Conversational AI Service

Conversational API

Named Entity Recognition

Entity Resolution Text Classification Context Management Slot Filling

slide-30
SLIDE 30

{ "probabilities": [ { "label": "CREATE", "probability": 0.9904295 }, { "label": "UPDATE", "probability": 0.009345241 }, ... ] }

Text Classification - What are the intents ?

Acme Corp’s timeline for purchasing Marketing software is set for July 1st and may purchase up to $250K of product Follow up call with Chris in two weeks

Language API Intent model

Prediction request JSON

slide-31
SLIDE 31

Conversational AI Service

Conversational API

Named Entity Recognition

Entity Resolution Text Classification State Management Slot Filling

slide-32
SLIDE 32

Slot Filling - What are the slots for each action item ?

Fill in the date and money slots for Update action Fill in the date slot and person slot for Create Task action. Here, date is normalized: In 2 weeks => 10/7/18

slide-33
SLIDE 33

Challenges

slide-34
SLIDE 34

Data challenges

Heterogenous database How do you make it work for every customer schema ?

AccountID Name Phone AccountID Name Phone Bank Account

  • Customers can define custom schemas
  • Schemas are not consistent
slide-35
SLIDE 35

Data challenges

Inconsistent data Which Acme Corp. did you mean ?

  • Lots of duplicates
  • Identify the most relevant ‘Acme’
  • Affects user experience
slide-36
SLIDE 36

Automatic Speech Recognition is not perfect

DOMAIN SPECIFIC JARGON AUDIO ENVIRONMENT ACCENTS & LINGUISTIC PROFILES

slide-37
SLIDE 37

Named Entity Recognition is not perfect

Named Entity Recognition is easy for humans but hard for machines

slide-38
SLIDE 38

Named Entity Recognition is not perfect

Today, JP Morgan and I spoke about... ..the san juan center is led by a team of scientists.. ..Man joy and I met today at Starbucks to discuss..

Cannot identify san juan as a location due to case sensitivity “Manju” misspelled as “Man Joy”. Misspelled pronouns are hard to catch Is JP Morgan a company or a person ?

slide-39
SLIDE 39

Future Considerations

slide-40
SLIDE 40

Optimized Models

  • Configurable
  • Normalized

Feedback

  • Capture Feedback
  • Retrain Models

Voice

  • Guided user experience
  • Multi channel
slide-41
SLIDE 41

Deep learning

What’s next for NLP and AI?

?

Architecture engineering for single tasks Machine learning with feature engineering

slide-42
SLIDE 42

Deep learning

What’s next for NLP and AI?

Single multitask model

Architecture engineering for single tasks Machine learning with feature engineering

To learn more: decaNLP.com

slide-43
SLIDE 43

Voice Recognition - A Story in 3 pictures

Source on Twitter

slide-44
SLIDE 44

Voice Recognition - The Complete story

Source on Twitter

slide-45
SLIDE 45
slide-46
SLIDE 46

Key Takeaways

Language understanding is AI Complete. Focus on solving customer pain points in your domain. Voice will become the new User Interface.

slide-47
SLIDE 47
slide-48
SLIDE 48

Resources

Einstein.ai - published papers, research etc. Einstein.ai/careers - We are hiring! @vmanju