Wrapping Up Ling575 Spoken Dialog Systems June 5, 2013 Roadmap - - PowerPoint PPT Presentation

wrapping up
SMART_READER_LITE
LIVE PREVIEW

Wrapping Up Ling575 Spoken Dialog Systems June 5, 2013 Roadmap - - PowerPoint PPT Presentation

Wrapping Up Ling575 Spoken Dialog Systems June 5, 2013 Roadmap Overview Distinctive factors in dialog: Human-human Human-computer Dialog components & dialog management Specialized topics: Detailed


slide-1
SLIDE 1

Wrapping Up

Ling575 Spoken Dialog Systems June 5, 2013

slide-2
SLIDE 2

Roadmap

— Overview

— Distinctive factors in dialog:

— Human-human — Human-computer

— Dialog components & dialog management — Specialized topics:

— Detailed analysis of:

— Distinctive factors — Techniques and applications

— Discussion:

— Trends, techniques, interrelations

slide-3
SLIDE 3

Characteristics of Dialog

— Human-human:

— Multi-party interaction:

— Flexible turn-taking, mixed initiative

— Speech acts:

— Actions via speech, levels of interpretation

— Implicature:

— Grice’s maxims

— Cooperativity & closure:

— Grounding and levels of display

— Corrections, repairs, and confirmations

slide-4
SLIDE 4

Characteristics of Dialog

— Human-computer – most deployed systems

— Multi-party interaction:

slide-5
SLIDE 5

Characteristics of Dialog

— Human-computer – most deployed systems

— Multi-party interaction:

— Rigid silence-based turn-taking, system or “mixed” initiative

— Speech acts:

slide-6
SLIDE 6

Characteristics of Dialog

— Human-computer – most deployed systems

— Multi-party interaction:

— Rigid silence-based turn-taking, system or “mixed” initiative

— Speech acts:

— Actions via speech: dialog acts, NLU

— Implicature:

slide-7
SLIDE 7

Characteristics of Dialog

— Human-computer – most deployed systems

— Multi-party interaction:

— Rigid silence-based turn-taking, system or “mixed” initiative

— Speech acts:

— Actions via speech: dialog acts, NLU

— Implicature:

— Um… depends on dialog management, NLU

— Grounding:

slide-8
SLIDE 8

Characteristics of Dialog

— Human-computer – most deployed systems

— Multi-party interaction:

— Rigid silence-based turn-taking, system or “mixed” initiative

— Speech acts:

— Actions via speech: dialog acts, NLU

— Implicature:

— Um… depends on dialog management, NLU

— Grounding:

— Confirmation: implicit/explicit: learned? — Corrections, repairs: problematic

— Why?

slide-9
SLIDE 9

Characteristics of Dialog

— Human-computer – most deployed systems

— Multi-party interaction:

— Rigid silence-based turn-taking, system or “mixed” initiative

— Speech acts:

— Actions via speech: dialog acts, NLU

— Implicature:

— Um… depends on dialog management, NLU

— Grounding:

— Confirmation: implicit/explicit: learned? — Corrections, repairs: problematic

— Constrained by complexity, processing, speed, etc

slide-10
SLIDE 10

Dialog System Components

— HMM-based ASR models — NLU: call-routing, semantic grammars — Dialog acts and recognition — Dialog management:

— Finite-state — Frame-based

— VoiceXML

— Information state — Statistical dialog management

— Lots of examples!

slide-11
SLIDE 11

Topics

— In-depth discussions:

— Computational approaches to make human-computer

interaction more like human-human interaction — Many issues raised in characterizing dialog:

— Multi-party

slide-12
SLIDE 12

Topics

— In-depth discussions:

— Computational approaches to make human-computer

interaction more like human-human interaction — Many issues raised in characterizing dialog:

— Multi-party: multi-party interaction, turn-taking, initiative — Grounding

slide-13
SLIDE 13

Topics

— In-depth discussions:

— Computational approaches to make human-computer

interaction more like human-human interaction — Many issues raised in characterizing dialog:

— Multi-party: multi-party interaction, turn-taking, initiative — Grounding: Miscommunication & repair, incremental processing — Interpretation:

slide-14
SLIDE 14

Topics

— In-depth discussions:

— Computational approaches to make human-computer

interaction more like human-human interaction — Many issues raised in characterizing dialog:

— Multi-party: multi-party interaction, turn-taking, initiative — Grounding: Miscommunication & repair, incremental processing — Interpretation: Reference, affect, subjectivity, personification,

information structure, prosody

— Multi-modality

— Applications and issues:

— Tutoring, machine translation, information-seeking — Non-native speech

slide-15
SLIDE 15

Interconnections

Sentiment Reference Persona Turn- taking Apps: MT Multi- party Prosody Tutoring Non- native Multi- modality Miscomm unication Info. Struct Increment Affect Initiative

slide-16
SLIDE 16

Interconnections

Sentiment Reference Persona Turn- taking Apps: MT Multi- party Prosody Tutoring Non- native Multi- modality Miscomm unication Info. Struct Increment Affect Initiative

slide-17
SLIDE 17

Techniques & Sources of Information

— Range of techniques:

slide-18
SLIDE 18

Techniques & Sources of Information

— Range of techniques:

— Deep processing, shallow processing, manual rules

— Machine learning:

slide-19
SLIDE 19

Techniques & Sources of Information

— Range of techniques:

— Deep processing, shallow processing, manual rules

— Machine learning:

— Anything from decision trees to POMDPs

— Information sources:

slide-20
SLIDE 20

Techniques & Sources of Information

— Range of techniques:

— Deep processing, shallow processing, manual rules

— Machine learning:

— Anything from decision trees to POMDPs

— Information sources:

— Acoustic, lexical, prosodic, timing, syntactic,

semantic, pragmatic, etc Multimodal: gaze, gesture, etc

— Integration

slide-21
SLIDE 21

Techniques & Sources of Information

— Range of techniques:

— Deep processing, shallow processing, manual rules

— Machine learning:

— Anything from decision trees to POMDPs

— Information sources:

— Acoustic, lexical, prosodic, timing, syntactic, semantic,

pragmatic, etc Multimodal: gaze, gesture, etc

— Integration: Complex and varied

— Huge feature vectors, tandem models, blackboards, learned

— Substantial strides, but huge remaining challenges

slide-22
SLIDE 22

Questions?

— Favorite topic? — Most surprising result? — Most obvious result? — Most surprising gap?