SLIDE 1
Dave Toney, Korin Richmond Toney, Korin Richmond Dave David - - PowerPoint PPT Presentation
Dave Toney, Korin Richmond Toney, Korin Richmond Dave David - - PowerPoint PPT Presentation
Dave Toney, Korin Richmond Toney, Korin Richmond Dave David Feinberg David Feinberg Edinburgh University Edinburgh University St Andrews University St Andrews University travel/traffic info directions weather restaurant theatre/concerts
SLIDE 2
SLIDE 3
gender gender age age accent accent emotion emotion (location) (location) improved recognition improved recognition consumer preferences consumer preferences interface complexity interface complexity disposable income disposable income safety safety
SLIDE 4
S: S: Welcome to Welcome to the Glasgow automated tourist information line. the Glasgow automated tourist information line. How can I help you? How can I help you? U: U: Are there any museums nearby? Are there any museums nearby? S: S: The Gallery of Modern Art is closest to you. Admission is free. The Gallery of Modern Art is closest to you. Admission is free. U: U: Are there any plays on tonight? Are there any plays on tonight? S S: : Yes, Macbeth is showing at 6pm in Botanic Park. Yes, Macbeth is showing at 6pm in Botanic Park. You should probably get a taxi or use the underground. You should probably get a taxi or use the underground. U: U: OK, what s the weather like this evening? OK, what s the weather like this evening? S S: : Initially fine, with rain expected later. Initially fine, with rain expected later. U: U: Oh, right. Anything else on? Oh, right. Anything else on?
SLIDE 5
pitch, jitter shimmer harmonics-to-noise ratio
SLIDE 6
microphone mobile
SLIDE 7
ACOUSTI C FEATURE EXTRACTOR NEURAL NETWORK
SPEECH DATABASE
SLIDE 8
ACOUSTI C FEATURE EXTRACTOR NEURAL NETWORK
.wav gender, age
SLIDE 9
CTIMIT: CTIMIT: 3303 recordings, 621 speakers 3303 recordings, 621 speakers 429 male, 192 female 429 male, 192 female age 21 age 21 -
- 55
55 Acoustic Feature Extractor: Acoustic Feature Extractor: Praat Praat Neural Network: Neural Network: Netlab (Matlab) Netlab (Matlab)
SLIDE 10
Gender Gender -
- 94.4% accuracy
94.4% accuracy Age Age -
- no useful learning, possible reasons:
no useful learning, possible reasons: little variation little variation background noise background noise not enough data not enough data
SLIDE 11
Gender Gender -
- 94.4% accuracy
94.4% accuracy Age Age -
- no useful learning, possible reasons:
no useful learning, possible reasons: little variation little variation background noise background noise not enough data not enough data
MACROPHONE (land-line)
SLIDE 12
Conversational interfaces Conversational interfaces Personalizing interaction very worthwhile Personalizing interaction very worthwhile Acoustic cues to user profile Acoustic cues to user profile Integrated profiling module Integrated profiling module
SLIDE 13