Wrapping Up
Ling575 Spoken Dialog Systems June 5, 2013
Wrapping Up Ling575 Spoken Dialog Systems June 5, 2013 Roadmap - - PowerPoint PPT Presentation
Wrapping Up Ling575 Spoken Dialog Systems June 5, 2013 Roadmap Overview Distinctive factors in dialog: Human-human Human-computer Dialog components & dialog management Specialized topics: Detailed
Ling575 Spoken Dialog Systems June 5, 2013
Human-human Human-computer
Detailed analysis of:
Distinctive factors Techniques and applications
Trends, techniques, interrelations
Flexible turn-taking, mixed initiative
Actions via speech, levels of interpretation
Grice’s maxims
Grounding and levels of display
Corrections, repairs, and confirmations
Rigid silence-based turn-taking, system or “mixed” initiative
Rigid silence-based turn-taking, system or “mixed” initiative
Actions via speech: dialog acts, NLU
Rigid silence-based turn-taking, system or “mixed” initiative
Actions via speech: dialog acts, NLU
Um… depends on dialog management, NLU
Rigid silence-based turn-taking, system or “mixed” initiative
Actions via speech: dialog acts, NLU
Um… depends on dialog management, NLU
Confirmation: implicit/explicit: learned? Corrections, repairs: problematic
Rigid silence-based turn-taking, system or “mixed” initiative
Actions via speech: dialog acts, NLU
Um… depends on dialog management, NLU
Confirmation: implicit/explicit: learned? Corrections, repairs: problematic
Finite-state Frame-based
VoiceXML
Information state Statistical dialog management
interaction more like human-human interaction Many issues raised in characterizing dialog:
Multi-party
interaction more like human-human interaction Many issues raised in characterizing dialog:
Multi-party: multi-party interaction, turn-taking, initiative Grounding
interaction more like human-human interaction Many issues raised in characterizing dialog:
Multi-party: multi-party interaction, turn-taking, initiative Grounding: Miscommunication & repair, incremental processing Interpretation:
interaction more like human-human interaction Many issues raised in characterizing dialog:
Multi-party: multi-party interaction, turn-taking, initiative Grounding: Miscommunication & repair, incremental processing Interpretation: Reference, affect, subjectivity, personification,
information structure, prosody
Multi-modality
Tutoring, machine translation, information-seeking Non-native speech
Sentiment Reference Persona Turn- taking Apps: MT Multi- party Prosody Tutoring Non- native Multi- modality Miscomm unication Info. Struct Increment Affect Initiative
Sentiment Reference Persona Turn- taking Apps: MT Multi- party Prosody Tutoring Non- native Multi- modality Miscomm unication Info. Struct Increment Affect Initiative
semantic, pragmatic, etc Multimodal: gaze, gesture, etc
Deep processing, shallow processing, manual rules
Anything from decision trees to POMDPs
Acoustic, lexical, prosodic, timing, syntactic, semantic,
pragmatic, etc Multimodal: gaze, gesture, etc
Integration: Complex and varied
Huge feature vectors, tandem models, blackboards, learned