A corpus based NLP chain for a web based Assisting Conversational - PowerPoint PPT Presentation

A corpus ‐ based NLP ‐ chain for a web ‐ based Assisting Conversational Agent Mao Xuetao, Jean ‐ Paul Sansonnet, François Bouchet LIMSI ‐ CNRS

Outline � Problem Can we use the chatbot architectures as a base � Assisting agents for the analysis and resolution of natural � Assisting agents for web applications and services language assisting requests in web applications � The genealogy of the DIVA toolkit and services? � A typical chatbot architecture � Advantages and drawbacks of the chatbot approach ― Yes, provided we improve drastically their precision and genericity. � Methodology � Methodology: a corpus ‐ based NLP ‐ chain � The linguistic domain of assisting questions Because the linguistic domain of the Function of � Methodology for the corpus collection Assistance is precise and concise, we can rely on � Excerpt from the sub ‐ corpus ‘Marco’ a corpus-based approach to exhibit the inherent � Assistance is a linguistic genre generic phenomena. � Implementation � DIVA NLP ‐ chain � DIVA semantic keys From the collected corpus we can extract: � DIVA formalization phase: ℜ ‐ rules ― A set of generic formalization rules; � DIVA topic files ― A set of generic semantic classes; � DIVA interpretation phase: ℑ ‐ rules ― A set of generic interpretation rules/classes. Conclusion Xuetao, Sansonnet, Bouchet ― LIMSI ‐ CNRS 2

Assisting agents « An Assisting Agent is a software tool with the capacity to resolve help requests, issuing from novice users, about the static structure and the dynamic functioning of software components or services » Project InterViews – February 1999 Following Patti Maes MIT, 1994 person with poor knowledge about the component (novice) User Request help demand in natural language (speech/text) computer application, web service, ambient appliance Component Agent rational, assistant, conversational, (can be embodied) symbolic model of the structure and the functioning Mediator Xuetao, Sansonnet, Bouchet ― LIMSI ‐ CNRS 3

Assisting agents for web applications & services Keys issues: How can we improve • The precision of the Function of Assistance? • The genericity Xuetao, Sansonnet, Bouchet ― LIMSI ‐ CNRS 4

The genealogy of the DIVA toolkit CHS ECA Chatbots DOM Contextual Help Embodied Natural Language Web Applications Systems Conversational Agents Processing and services Assistance Personification Dialogue Ubiquity ACA Webbots Assisting Ludo ‐ social Conversational Agents conversations Problem: How can we DIVA improve the precision and the D OM ‐ Integrated Virtual Agents genericity of the NLP ‐ chain in a web environment? Xuetao, Sansonnet, Bouchet ― LIMSI ‐ CNRS 5

A typical webbot architecture Single pass, rule based, filtering process no no no Filtering by “User natural Filtering by the Evasive Recall of a the generic language utterance ” specific layer List preceding topic layer yes yes yes yes Specific answer linked to the Generic Recall of a Evasive character of the agent answer previous topic answer or to the task of the application Dialogical Common ‐ sense Minimalistic dialogue Task handling: handling session handling Robustness precise but not generic Generic Customized Xuetao, Sansonnet, Bouchet ― LIMSI ‐ CNRS 6

ALICE’s AIML: a simple bot rule AIML is the format used in Wallace’s ALICE chatbot who won several times the Loebner prize. Here is a simple AIML rule (called an atomic category): <category> <pattern>WHAT IS A CIRCLE</pattern> <template> <set_it>a circle</set_it> is the set of points equidistant from a common point called the center. </template> </category> The above rule does the following: 1. Matches a user input like this one: “ Can you tell me what is a circle please?" 2. Sets the internal register "IT" to the value of "a circle" [minimalistic model of the session] 3. Sends the user the answer: " A circle is the set of points equidistant from a common point called the center." Xuetao, Sansonnet, Bouchet ― LIMSI ‐ CNRS 7

A typical finalized dialogue system Symbolic Customized User Application model of the Generic Modeling Lexicon application Reasoning tools about the Natural structure and the “User natural Formal Multimodal Language functioning of the symbolic language utterance” Request Semantic Reaction models of the application Analyzer and the users (tasks and plans handling) “I think that you speak too loud” ACTION: lower sound level “You speak too loud!” JUDGE user [TOOMUCH(system.sound.level)] SAY: “I speak lower now” “You make too much noise, my dear” “The sound level is very very high” USER: update preferences “You make my ears ache” Etc. Xuetao, Sansonnet, Bouchet ― LIMSI ‐ CNRS 8

Evaluating the linguistic capabilities of chatbots ― Wollermann, C. (2004). Evaluierung der linguistischen Fähigkeiten von Chatbots. Magister report, Rheinische ‐ Friedrich ‐ Wilhelms Universität Bonn. ― Wollermann, C. (2006). Proceedings of the Young Researchers' Roundtable on Spoken Dialogue Systems, 75 ‐ 76. Pittsburgh, PA, Sept 2006. “To what extent are chatbot systems able to analyze the users input on the semantic and pragmatic level?” � Evaluation methodology � Four main chatbots: ALICE, EllaZ, Elbot, ULTRA ‐ HAL ‐ ASSISTANT. � A collection of linguistic phenomena where evaluated qualitatively in the chatbot answers to users questions: ― Semantic: Semantic relations, Quantifiers, Anaphora. ― Pragmatic: Grice’s maxims. � Results � Semantic relations: ∅ but for EllaZ which relies on WordNet � Quantifiers: partly handled, in the four chatbots � Anaphora: ∅ � Grice’s maxims: ∅ (unaccountable in chatbots) BOTTOM LINE: A deeper semantic/pragmatic analysis is required for finalized/task ‐ oriented dialogue. QUESTION: Can we improve on the chatbot approach? Jean ‐ Paul Sansonnet ‐‐ LIMSI ‐ CNRS 9

Advantages and drawbacks of the chatbot approach � Advantages: easy, light, precise � They are easy to develop: no large semantic analyzer, no complex reasoning tools; � They are light to deploy in a web ‐ based environment � client architectures can be envisioned; � They provide robust natural language reactions (Evasive list effect – ELIZA effect); � They are tailored and well ‐ suited for the field of ludo ‐ social chat; � When associated with a given application, they can be customized to be extremely precise. � Drawbacks: lack of genericity � Minimalistic/ultra ‐ customized model of the application; � Minimalistic model of the dialogue session and of the users; � No semantic analyzer � lack of precision in the requests (grammar, speech acts, …); � No formal requests � class reactions are directly linked to specific linguistics patterns; � No generic reasoning tools, especially when the function of assistance is concerned. � Need recoding quite everything for each new application, � No reusability, no capitalization. Xuetao, Sansonnet, Bouchet ― LIMSI ‐ CNRS 10

Methodology: a corpus ‐ based NLP ‐ chain Collection of a corpus of natural language requests in assisting situations Base the genericity of the NLP-chain on Lexical Semantic Pragmatic phenomena occurring within the corpus classes keys classes Natural language Pragmatic handling Multimodal Formal syntactico-semantic “User utterance” heuristics Request Reaction analyzer [Chatbot layers] Form [Chatbot layers] Mixed NLP-chain: � Dialogue systems: Intermediate formal form � Chat bots: rule-layers for each phase “If I want to buy such a < QUEST IF THEUSER TOWANT TOOBTAIN such a Scenic, what can I do?” $THECAR WHAT TOCAN THEUSER TODO > < HOW TOOBTAIN $THECAR> Xuetao, Sansonnet, Bouchet ― LIMSI ‐ CNRS 11

The linguistic domain of assisting questions Key hypothesis: The quite restricted linguistic domain concerned makes it tractable: 1. to characterize the distributionality of the linguistic domain, 2. to build a robust semantic analyzer covering the users natural language requests. Text ‐ based Natural language as a ‘syntactical system’: approaches F. de Saussure [1905], N. Chomsky [1955] ‐ Syntax: formal grammars ‐ Lexical semantics Conversation Oral Dialogue: analysis J. L. Austin, J. Searle [1962, 1969] ‐ Speech Acts, ‐ Pragmatics. ‐ Chatbots Artificial Human ‐ Machine Dialogue Systems ‐ Dialogue systems T. Winograd, J. Allen [1972, 1994 ‐ ] Dialogical session Assisting Assisting Q&A systems Requests agents Information Processing Retrieval Systems Pairs of Q/A – no dialogue session Xuetao, Sansonnet, Bouchet ― LIMSI ‐ CNRS 12

A corpus based NLP chain for a web based Assisting Conversational - PowerPoint PPT Presentation

A corpus based NLP chain for a web based Assisting Conversational Agent Mao Xuetao, Jean Paul Sansonnet, Franois Bouchet LIMSI CNRS Outline Problem Can we use the chatbot architectures as a base Assisting agents for

SI485i : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

Corpus Stylistics: Speech, Writing and Thought Presentation in a Corpus of English Writing

The need for Corpus Statistics: Corpus analysis and the identification of linguistically relevant

NLP: Two pictures Wordnet and Word Sense Problem NLP Disambiguation Semantics NLP Trinity

Recurrent Neural Networks Graham Neubig Site https://phontron.com/class/nn4nlp2017/ NLP and

Dental Assisting Joel A. Silva, RDA, CDA Current Dental Assisting Program Dental Assistant Front

Generic Generic and Subjective and Subjective Assisting Assisting Conversational Conversational

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

TrustedOut Corpus Intelligence Corpus Intelligence Makes Intelligence Trustworthy. Florent Solt,

MACAQ : A Multi Annotated Corpus to study how we adapt Answers to various Questions Anne

Ontologies for NLP NLP for Ontologies FOIS 2014 - LogOnto Workshop on Logics and Ontologies for

Study to Select Value Chain and Analyze Selected Value Chain Presentation on Value Chain

Study to Select Value Chain and Analyze Selected Value Chain Presentation on Value Chain

Study to Select Value Chain and Analyze Selected Value Chain Presentation on Value Chain Analysis

Study to Select Value Chain and Analyze Selected Value Chain Presentation on Value Chain

Cyber Tutor Cyber Tutor Alexei Sourin , Leon Ho Chiau Wai School of Computer Engineering,

Chatbots as active members of our society Proseminar Data Mining Luca Dombetzki Fakult fr

AI Dialog Google API Oleg Kov k Assignment Design conversational user interface for a

Machine Learning Survival Kit for Future Death of New Jobs Only 5.5 million jobs are created

Cross-connection Control for Wastewater Treatment Plants Department of Health Mission We work

The influence of an air gap on the response of an explosive to spigot impact . Bolaji Adesokan

SWIM Tool for operational improvement Air Space User Perspective Topics Current

Best Practices for HPA/HPW Construction Presented by: Rob Starr, Residential Training Manager,