using natural language processing and machine learning to
play

Using Natural Language Processing and Machine Learning to Assist - PowerPoint PPT Presentation

Using Natural Language Processing and Machine Learning to Assist First-Level Customer Support for Contract Management Master thesis Final presentation Michael Legenc Advisor: Daniel Braun Munich, 08.01.2018 Software Engineering


  1. Using Natural Language Processing and Machine Learning to Assist First-Level Customer Support for Contract Management Master thesis – Final presentation Michael Legenc Advisor: Daniel Braun Munich, 08.01.2018 Software Engineering betrieblicher Informationssysteme (sebis) Fakultät für Informatik Technische Universität München wwwmatthes.in.tum.de

  2. Introduction § Scalability issues of email customer supports. § Acceleration by automation and assistance features. Hurdle: Free text . Machine Learning and Natural Language Processing. 2 Master thesis – Final presentation – Michael Legenc

  3. {} API: Classification, Automation, JSON Information Extraction Assistance Contract team High priority Cancellation Demo Cancel contract 3 Master thesis – Final presentation – Michael Legenc

  4. Demo 4 Master thesis – Final presentation – Michael Legenc

  5. Classification: Machine Learning New email Training set Preprocessing, Vectorization Old, labeled emails Supervised machine learning Predicted classification Master thesis – Final presentation – Michael Legenc 5

  6. Training set creation § Dataset: § 18 Mio. unlabeled Emails. § Filtering § Retrievable by an API. § Caching § Command Line Tool: Demo. 6 Master thesis – Final presentation – Michael Legenc

  7. Preprocessing, Vectorization § Vectorization: Tf-idf 7 Master thesis – Final presentation – Michael Legenc

  8. Classification: Evaluation § Final configuration: § Stochastic gradient descent § Tf-idf thresholds: max. 0.4, min. 0.001 8 Master thesis – Final presentation – Michael Legenc

  9. Information extraction Implemented types: § Person § Date § Time § Postcode, City § Money § Vendor § Order-, account- and contractnumber § 9 Master thesis – Final presentation – Michael Legenc

  10. Information extraction Ø Non-ML: Regex, keyword lists and rule-based. 10 Master thesis – Final presentation – Michael Legenc

  11. Information extraction – Machine Learning § Not much support. § Learns from the sentence context. Ø Unknown or misspelled words are recognized by their context. § Training set: § Needs massive input. Public data is not modifiable. § Creation supported by Command Line Tool and special file format. 11 Master thesis – Final presentation – Michael Legenc

  12. Evaluation § Used training set: Created by non-ml approach. § 1000 emails. 3361 entities. § Test set: Manually created. Stanford NER: Non-ML: 12 Master thesis – Final presentation – Michael Legenc

  13. Future work § Transparent training. § Topic segmentation. § Better understanding of coherences. § Automation and assistance features. 13 Master thesis – Final presentation – Michael Legenc

  14. Conclusion Email customer support benefits from automation and assistance: § Time, thus cost saving. § Increased employee and customer satisfaction. § Free text accessibility by ML and NLP . § Selection of algorithms, parameters and preprocessors depends § on data set and concrete application: Interchangeable toolset approach. § Evaluation-based selection. § 14 Master thesis – Final presentation – Michael Legenc

  15. Thank you 15 Master thesis – Final presentation – Michael Legenc

  16. 16 Master thesis – Final presentation – Michael Legenc

  17. 17 Master thesis – Final presentation – Michael Legenc

  18. 18 Master thesis – Final presentation – Michael Legenc

  19. Current customer support team organization at check24´s gas and electricity department Second-level support .. Long-time employees. Requesting if and only if the problem is too complex. Very exceptional. First-level support .. .. Trying to solve all kinds of problems. Incoming email gas@check24.de or strom@check24.de Master thesis – Final presentation – Michael Legenc 19

  20. Current email preprocessing Storage MySQL Server Long-time storing of all incoming and outgoing emails with the determined folder and customer mappings. Customer detection PHP-Server Mapping mails to customers and simple folders like gas/electricity (unseen) inbox/spam. Spam detection Microsoft exchange server Incoming email gas@check24.de or strom@check24.de Master thesis – Final presentation – Michael Legenc 20

  21. Software used by the customer support Webmail: Superficial mail inspection. Customer detection PHP-Server Mapping mails to customers and simple folders like gas/electricity (unseen) inbox/spam. Spam detection Microsoft exchange server Button (e.g. in the red box) leads to a more sophisticated solution. Master thesis – Final presentation – Michael Legenc 21

  22. Software used by the customer support Proprietary software: View mail with further information and processing features. Customer detection PHP-Server Mapping mails to customers and simple folders like gas/electricity (unseen) inbox/spam. Spam detection Microsoft exchange server Button (e.g. in the red box) leads to a more sophisticated solution. Master thesis – Final presentation – Michael Legenc 22

  23. Contribution Toolset: Classification, Information Extraction § Command Line Tool: § Training set creation § Inspection § Optimization § Evaluation § API: Enables integration § 23 Master thesis – Final presentation – Michael Legenc

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend