www.amiproject.org
German Research Center for Artificial Intelligence GmbH
Automatic Disfluency Automatic Disfluency Detection in Multi-party - - PowerPoint PPT Presentation
German Research Center for Artificial Intelligence GmbH Automatic Disfluency Automatic Disfluency Detection in Multi-party Detection in Multi-party www.amiproject.org Conversations Conversations Feast, 30th September 2009 , 30th September
German Research Center for Artificial Intelligence GmbH
Sebastian Germesin September 09 2
German Research Center for Artificial Intelligence GmbH
Sebastian Germesin September 09 3
German Research Center for Artificial Intelligence GmbH
Sebastian Germesin September 09 4
German Research Center for Artificial Intelligence GmbH
disfluent speech
Sebastian Germesin September 09 5
German Research Center for Artificial Intelligence GmbH
Sebastian Germesin September 09 6
German Research Center for Artificial Intelligence GmbH
“The cat uh the dog sneaks around the corner.”
Sebastian Germesin September 09 7
German Research Center for Artificial Intelligence GmbH
“The cat uh the dog sneaks around the corner.”
Sebastian Germesin September 09 8
German Research Center for Artificial Intelligence GmbH
“The cat uh the dog sneaks around the corner.”
Sebastian Germesin September 09 9
German Research Center for Artificial Intelligence GmbH
“The cat uh the dog sneaks around the corner.”
complex
Sebastian Germesin September 09 10
German Research Center for Artificial Intelligence GmbH
“The d dog sneaks around the corner.”
simple
Sebastian Germesin September 09 11
German Research Center for Artificial Intelligence GmbH
Simple disfluencies
Sebastian Germesin September 09 12
German Research Center for Artificial Intelligence GmbH
Sebastian Germesin September 09 13
German Research Center for Artificial Intelligence GmbH
Sebastian Germesin September 09 14
German Research Center for Artificial Intelligence GmbH
: “The cat the cat plays “
this structure is very common in natural language
: “The dog the cat and the bird play”
: “The dog the cat and“
: “The plays cat”
Sebastian Germesin September 09 15
German Research Center for Artificial Intelligence GmbH
→Yes!
→ Use modules for subsets of disfluencies → Use different feature-sets for each module (depending on the disfluency types) → Find “optimal” classifier for each module
Sebastian Germesin September 09 16
German Research Center for Artificial Intelligence GmbH
Sebastian Germesin September 09 17
German Research Center for Artificial Intelligence GmbH
Sebastian Germesin September 09 18
German Research Center for Artificial Intelligence GmbH
1.Use greedy hill-climbing
– Use weight for errors to improve Precision!
2.Reduce classifier library
– Take 10% results in maximal performance loss
Sebastian Germesin September 09 19
German Research Center for Artificial Intelligence GmbH
Sebastian Germesin September 09 20
German Research Center for Artificial Intelligence GmbH
1.Use greedy hill-climbing
– Use weight for errors to improve Precision!
2.Reduce classifier library
– Take 10% results in maximal performance loss
Sebastian Germesin September 09 21
German Research Center for Artificial Intelligence GmbH
Best: J48 "-L -U -M 2 -A"
Sebastian Germesin September 09 22
German Research Center for Artificial Intelligence GmbH
93.5 % 94.5 % 12 m. 94.7 % 95.1 % 6 m. 33 m. new 0.11 94.8 % 95.3 % 6 m. 22 m. 0.42 90.5 % 92.9 % 6 m. 22 m.
83.3 % 88.6 % 12 m. 0.00 85.7 % 90.3 % 6 m.
RT-factor
Accuracy Eval. data Train. data System
Sebastian Germesin September 09 23
German Research Center for Artificial Intelligence GmbH
Sebastian Germesin September 09 24
German Research Center for Artificial Intelligence GmbH
Sebastian Germesin September 09 25
German Research Center for Artificial Intelligence GmbH
Sebastian Germesin September 09 26
German Research Center for Artificial Intelligence GmbH
Sebastian Germesin September 09 27
German Research Center for Artificial Intelligence GmbH
Sebastian Germesin September 09 28
German Research Center for Artificial Intelligence GmbH
Used technologies WEKA toolkit for machine learning Maximum Entropy classifier from Stanford NLP group CRF Tagger from http://crftagger.sourceforge.net/ Features for machine learning:
Lexical:
words, lexical parallelism, (POS-Tags)
Prosodic: duration, pauses, pitch, energy Dynamic:
disfluency types of surrounding words
Speaker:
age, role in meeting, native language