automatic disfluency automatic disfluency detection in
play

Automatic Disfluency Automatic Disfluency Detection in Multi-party - PowerPoint PPT Presentation

German Research Center for Artificial Intelligence GmbH Automatic Disfluency Automatic Disfluency Detection in Multi-party Detection in Multi-party www.amiproject.org Conversations Conversations Feast, 30th September 2009 , 30th September


  1. German Research Center for Artificial Intelligence GmbH Automatic Disfluency Automatic Disfluency Detection in Multi-party Detection in Multi-party www.amiproject.org Conversations Conversations Feast, 30th September 2009 , 30th September 2009 Feast Sebastian Germesin Germesin Sebastian

  2. Outline Outline • Motivation • Theoretical Background www.amiproject.org • Data (AMI Corpus) • Disfluency Detection System • Hybrid Classification Approach • Self-arranging Modules • Experimental Results • Conclusions & Outlook German Research Center for Artificial Intelligence GmbH 2 Sebastian Germesin September 09

  3. Motivation Motivation Example Example www.amiproject.org German Research Center for Artificial Intelligence GmbH 3 Sebastian Germesin September 09

  4. Motivation Motivation • Have to detect (and clean) disfluencies in the transcribed speech www.amiproject.org • Readability • Transcription • Extractive Summarization • Post-Processing • NLP-systems’ performance drop when faced with disfluent speech • Human detector? • Too expensive! • Too slow! ⇒ Automatic Detection System! German Research Center for Artificial Intelligence GmbH 4 Sebastian Germesin September 09

  5. Theoretical Background Theoretical Background Definition Definition www.amiproject.org “Disfluencies are syntactical and grammatical [speech] errors that occur in spoken but not in written language.” [Besser, 2006] German Research Center for Artificial Intelligence GmbH 5 Sebastian Germesin September 09

  6. Theoretical Background Theoretical Background Terminology Terminology www.amiproject.org “The cat uh the dog sneaks around the corner.” German Research Center for Artificial Intelligence GmbH 6 Sebastian Germesin September 09

  7. Theoretical Background Theoretical Background Terminology Terminology www.amiproject.org “The cat uh the dog sneaks around the corner.” Reparandum German Research Center for Artificial Intelligence GmbH 7 Sebastian Germesin September 09

  8. Theoretical Background Theoretical Background Terminology Terminology www.amiproject.org Interregnum “The cat uh the dog sneaks around the corner.” Reparandum German Research Center for Artificial Intelligence GmbH 8 Sebastian Germesin September 09

  9. Theoretical Background Theoretical Background Terminology Terminology www.amiproject.org Interregnum complex “The cat uh the dog sneaks around the corner.” Reparandum Reparans German Research Center for Artificial Intelligence GmbH 9 Sebastian Germesin September 09

  10. Theoretical Background Theoretical Background Terminology Terminology www.amiproject.org simple “The d dog sneaks around the corner.” Reparandum German Research Center for Artificial Intelligence GmbH 10 Sebastian Germesin September 09

  11. Theoretical Background Theoretical Background All Types All Types www.amiproject.org Simple disfluencies German Research Center for Artificial Intelligence GmbH 11 Sebastian Germesin September 09

  12. Data Data quantitative quantitative • AMI meeting corpus www.amiproject.org • 135 meetings (~ 100 hours speech) • 4 participants • task: design a remote control • freely interaction • Many annotations, e.g.: • Transcribed speech • Dialogue acts • Gestures • ... German Research Center for Artificial Intelligence GmbH 12 Sebastian Germesin September 09

  13. Data Data quantitative quantitative • 45 meeting enriched with disfluency www.amiproject.org annotation • 31,000 Disfluencies • 15.8% erroneous words • 41.5% disfluent Dialogue Acts • 80% (33) for training • 20% (12) for evaluation German Research Center for Artificial Intelligence GmbH 13 Sebastian Germesin September 09

  14. Data Data qualitative qualitative • Discovered a heterogeneity towards the www.amiproject.org strictness of different disfluency types 1. Some disfluencies have strict structure • ex.: Repetition : “The cat the cat plays “ 2. Some other disfluencies have also strict structure but this structure is very common in natural language • ex.: Replacement : “The dog the cat plays“ • ex.: Fluent : “The dog the cat and the bird play” 3. Some other disfluencies have no obvious structure • ex.: Disruptions : “The dog the cat and“ • ex.: Order : “The plays cat” German Research Center for Artificial Intelligence GmbH 14 Sebastian Germesin September 09

  15. Automatic System Automatic System Design Question Design Question www.amiproject.org • Can we leverage the heterogeneity of disfluencies for their detection? → Yes! → Use modules for subsets of disfluencies → Use different feature-sets for each module (depending on the disfluency types) → Find “optimal” classifier for each module German Research Center for Artificial Intelligence GmbH 15 Sebastian Germesin September 09

  16. Automatic System Automatic System Hybrid Modules Hybrid Modules • SHS: www.amiproject.org • Stuttering, Hesitation, Slip-of-the-Tongue • REP: • Repetition • DNE: • Discourse Marker, Explicit Editing Term • DEL: • Deletion • REV: • Insertion, Replacement, Restart, Other German Research Center for Artificial Intelligence GmbH 16 Sebastian Germesin September 09

  17. How to combine the modules? combine the modules? How to www.amiproject.org German Research Center for Artificial Intelligence GmbH 17 Sebastian Germesin September 09

  18. Training Process Training Process Self-arranging Modules Self-arranging Modules • Immense search space www.amiproject.org • #( modules ) * #( classifier ) * placeInSystem • Solution(s): • Old system: • Choosen manually • Current system: • Automatically trained 1.Use greedy hill-climbing – Use weight for errors to improve Precision! 2.Reduce classifier library – Take 10% results in maximal performance loss of 2.3% (depending on the module) German Research Center for Artificial Intelligence GmbH 18 Sebastian Germesin September 09

  19. GroDi GroDi Greedy Hill-Climbing Greedy Hill-Climbing www.amiproject.org German Research Center for Artificial Intelligence GmbH 19 Sebastian Germesin September 09

  20. Training Process Training Process Self-arranging Modules Self-arranging Modules • Immense search space www.amiproject.org • #( modules ) * #( classifier ) * placeInSystem • Solution(s): • Old system: • Choosen manually • Current system: • Automatically trained 1.Use greedy hill-climbing – Use weight for errors to improve Precision! 2.Reduce classifier library – Take 10% results in maximal performance loss of 2.3% (depending on the module) German Research Center for Artificial Intelligence GmbH 20 Sebastian Germesin September 09

  21. GroDi GroDi Performance-Curve of J48 Performance-Curve of J48 www.amiproject.org Best : J48 "-L -U -M 2 -A" German Research Center for Artificial Intelligence GmbH 21 Sebastian Germesin September 09

  22. Experimental Results Experimental Results www.amiproject.org Train. Eval. System Accuracy avg. F1 RT-factor data data 6 m. 90.3 % 85.7 % baseline -- 0.00 12 m. 88.6 % 83.3 % old 22 m. 6 m. 92.9 % 90.5 % 0.42 22 m. 6 m. 95.3 % 94.8 % 6 m. 95.1 % 94.7 % new 0.11 33 m. 12 m. 94.5 % 93.5 % German Research Center for Artificial Intelligence GmbH 22 Sebastian Germesin September 09

  23. Conclusions Conclusions • Aims: • Development of a system that automatically www.amiproject.org detects a broad set of disfluencies • Fully automatic learning process • Robust and Fast • Achievements: • Stand-alone tool for detection of disfluencies: GroDi - Get rid of Disfluencies • Self-arranging modules • Detection rate: 95% Accuracy • Real-time factor of 0.11 German Research Center for Artificial Intelligence GmbH 23 Sebastian Germesin September 09

  24. Outlook Outlook www.amiproject.org • Develop module(s) for the detection of Mistake, Order, Omission • Embed other learning approaches, e.g.: • Conditional Random Fields • HMMs • Use other corpus like, e.g., Switchboard German Research Center for Artificial Intelligence GmbH 24 Sebastian Germesin September 09

  25. Thank you! Thank you! www.amiproject.org German Research Center for Artificial Intelligence GmbH 25 Sebastian Germesin September 09

  26. Demo? Demo? www.amiproject.org German Research Center for Artificial Intelligence GmbH 26 Sebastian Germesin September 09

  27. GroDi GroDi Diff. Module Arrangements Diff. Module Arrangements www.amiproject.org German Research Center for Artificial Intelligence GmbH 27 Sebastian Germesin September 09

  28. GroDi GroDi www.amiproject.org  Used technologies  WEKA toolkit for machine learning  Maximum Entropy classifier from Stanford NLP group  CRF Tagger from http://crftagger.sourceforge.net/  Features for machine learning:  Lexical: words, lexical parallelism, (POS-Tags)  Prosodic: duration, pauses, pitch, energy  Dynamic: disfluency types of surrounding words  Speaker: age, role in meeting, native language German Research Center for Artificial Intelligence GmbH 28 Sebastian Germesin September 09

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend