AI NTPU Dr. Chih-Chien Wang Dr. Min-Yuh Day Mr. Wei-Jin Gao Mr. - PowerPoint PPT Presentation

AI NTPU Dr. Chih-Chien Wang Dr. Min-Yuh Day Mr. Wei-Jin Gao Mr. Yen-Cheng Chiu Ms. Chun-Lian Wu National Taipei University Tamkang University Taipei, Taiwan

Overview Retrieval based Solr search engine Method + Similarity Short Text Generation Generative Model Emotion Classification model Generative Model Generation + General Purpose Purpose Response Response

Retrieval Based Search responses from corpus.

Overview of Retrieval-based Method • We used Solr to New post Corpus index the corpus. Remove • Before indexing it, Text analysis stop word Pre-processing we perform word Index building Index segmentation, text Σ Score of reciprocal of analysis, and term frequency remove stop words. Cosine similarity analysis • Then, we complete Ranking the Solr index building. Results

Retrieval-based Method: Search the new post • When a new post provided, we searched the Solr index, New post Corpus and obtain the fetched potential candidate Remove Text analysis stop word comments. Pre-processing • We used all terms (words) Index building Index from the provided new post one by one to search the Solr. Σ Score of reciprocal of • If the term appeared in the term frequency post of post-comment pair, we fetched the “comment” Cosine similarity analysis (rather than post) as potential candidates for generated comments. Ranking • Keep the first 500 search results Results

Ranking the Results • We calculated the New post Corpus accumulated inverse term frequency. Remove Text analysis stop word • We computed the cosine Pre-processing similarity between the new Index building Index post and the candidate comments. Σ Score of reciprocal of • We multiplied accumulated term frequency inverse term frequency by cosine similarity as the Cosine similarity analysis relevance score. • The candidate comment Ranking that match the assigned emotion and with highest Results relevance score was treated as the generated comment.

Evaluations | Retrieval-based Method Evaluation Results Overall Average Label 0 Label 1 Label 2 Total Result Submission Method score score 716 200 84 1000 368 0.368 RUN 1 Retrieval Evaluation result

Only 3 teams submit for retrieval based method

Weakness of our retrieval method We do not used semantic analysis before searching • We used only the terms in the new post to search the results. • We should also used similar term with similar meaning to search the corpus. Emotion Categories • We do not consider the noisy of emotion classification. We realize the precision issue of emotion categories after receiving the evaluation results.

Evaluations | Retrieval-based Method Evaluation Results Overall Average Label 0 Label 1 Label 2 Total Result Submission Method score score 716 200 84 1000 368 0.368 RUN 1 Retrieval Evaluation result We realize the precision issue of Only 30% (84/284) response emotion categories after were with correct emotion. receiving the evaluation results. According to the organizers, the accuracy rate for emotion classification was 62% in their NLPCC papers. The actual accuracy rate may be lower than that.

Generative Approach Automatically generate responses to questions

Generative Approach Short Response Generation Generative Model Emotion Classification model

Generative Models Automatically Generated Response in Short text conversion We employed an attention-based Seq2Seq may sequence to sequence be a good Idea (Seq2Seq) network model for the generation-based approach.

Generative Models | Generation-based Method Generate Short Responses to the Dialogue Seq2Seq with attention mechanism Long Short Term Memory (LSTM) as encoder and decoder

Emotion Classification model Data Preprocess Generation model Before training the Corpus model, we perform Pre-processing New post Corpus Pre-processing word segmentation, Remove Text analysis Label index stop word text analysis, and Well-trained Generation One-hot encoding remove stop words model training Model (LSTM) Emotion classifier model Training General Purpose Response (MLP/GRU/LSTM/BiGRU/BiLSTM) Cosine similarity analysis GPR Corpus Ranking Cosine similarity analysis GPR Candidate results Filter Results

Emotion Classification model Generative Model Generation model Corpus Then, we used an attention-based Pre-processing New post Corpus sequence to sequence (Seq2Seq) Pre-processing network model which take Long Remove Text analysis Label index stop word Short Term Memory (LSTM) as encoder and decoder to train the Generative Well-trained One-hot encoding model training Model (LSTM) model using the provided corpus. Emotion classifier model Training General Purpose Response (MLP/GRU/LSTM/BiGRU/BiLSTM) Cosine similarity analysis GPR Corpus Ranking Cosine similarity analysis GPR Candidate results Filter Results

Emotion Classification model Emotion Generation model Corpus We performed preprocessing, label Pre-processing New post Corpus indexing, one-hot Pre-processing encoding, and training Remove Text analysis Label index stop word to train emotion Generative Well-trained classification model One-hot encoding model training Model (LSTM) Emotion classifier model Training General Purpose Response (MLP/GRU/LSTM/BiGRU/BiLSTM) Cosine similarity analysis GPR Corpus We compared the different methods of MLP/GRU/LSTM/BiGRU/BiLSTM for Cosine similarity analysis Ranking developing emotion classification. GPR Candidate results Filter Results

Deep learning approach of Emotion Classification model • MLP, GRU, LSTM, BiGRU, and BiLSTM Evaluations of all all deep learning approachs Evaluation Results DL model Batch size Dropout Epochs Accuracy Loss BiGRU 256 0.5 15 0.880 0.333 BiLSTM 256 0.4 10 0.879 0.335 LSTM 256 0.1 20 0.879 0.335 GRU 256 0.4 20 0.872 0.356 MLP 256 0.4 30 0.843 0.451

Confusion matrix for emotion classification Best Method Bi-GRU

Emotion Classification model Generation model Corpus Pre-processing New post Corpus Pre-processing Remove Text analysis Label index stop word Generative Well-trained One-hot encoding model training Model (LSTM) Similarity Emotion classifier model Training General Purpose Response (MLP/GRU/LSTM/BiGRU/BiLSTM) We computed the cosine similarity Cosine similarity analysis GPR Corpus between the new post and the Cosine similarity analysis Ranking generated candidate comments. GPR The candidate comment that with Candidate results Filter highest cosine similarity with question was treated as the Results generated comment.

Self-Evaluation Use MLP to automatically Performance generate responses Emotion classification Label0 Label1 Label2 Total Overall core Average score MLP 873 85 42 200 169 0.169 GRU 855 69 76 1000 221 0.221 BiGRU 860 72 68 1000 208 0.208 LSTM 864 65 71 1000 207 0.207 BiLSTM 857 84 59 1000 202 0.202

Self-Evaluation Use MLP to automatically The emotion generate responses Performance precision rate was only around 50% Emotion classification Label0 Label1 Label2 Total Overall core Average score MLP 873 85 42 200 169 0.169 GRU 855 69 76 1000 221 0.221 BiGRU 860 72 68 1000 208 0.208 LSTM 864 65 71 1000 207 0.207 BiLSTM 857 84 59 1000 202 0.202

General Purpose Response Generate responses when we do not know how to answer the questions

General Purpose Responses Emotion Classification model we used General Purpose Response(GPR) to Generation model improve the generative-based response Corpus performance. About 1500 general purpose responses were created. Pre-processing New post Corpus Pre-processing Remove The generated comments will be replaced by the Text analysis Label index stop word GPR at filter stage if the new post and generated Generative Well-trained comments received a low relevance score One-hot encoding Model (LSTM) model training computed by cosine similarity (about 30%). Emotion classifier model Training General Purpose Response (MLP/GRU/LSTM/BiGRU/BiLSTM) Cosine similarity analysis GPR Corpus Cosine similarity analysis Ranking GPR Candidate results Filter Results

MLP+ General Use MLP plus GPR to automatically generate responses Purpose Responses Emotion classification Label0 Label1 Label2 Total Overall core Average score MLP 808 124 68 1000 260 0.26 GRU 756 77 167 1000 411 0.411 BiGRU 727 111 162 1000 435 0.435 LSTM 749 89 162 1000 413 0.413 BiLSTM 753 75 172 1000 419 0.419

Use MLP to automatically With or Without GPR generate responses With GPR Without GPR Emotion Difference classification Average score Average score 0.169 +0.091 0.26 MLP 0.221 +0.190 0.411 GRU +0.227 BiGRU 0.435 0.208 0.207 +0.216 0.413 LSTM 0.202 +0.217 BiLSTM 0.419

Overview of Emotion Classification model Generative Generation model Corpus based Method Pre-processing New post Corpus Pre-processing Remove Text analysis Label index stop word Well-trained Generation One-hot encoding model training Model (LSTM) Emotion classifier model Training General Purpose Response (MLP/GRU/LSTM/BiGRU/BiLSTM) Cosine similarity analysis GPR Corpus Ranking Cosine similarity analysis GPR Candidate results Filter Results

AI NTPU Dr. Chih-Chien Wang Dr. Min-Yuh Day Mr. Wei-Jin Gao Mr. - PowerPoint PPT Presentation

AI NTPU Dr. Chih-Chien Wang Dr. Min-Yuh Day Mr. Wei-Jin Gao Mr. Yen-Cheng Chiu Ms. Chun-Lian Wu National Taipei University Tamkang University Taipei, Taiwan Overview Retrieval based Solr search engine Method + Similarity Short Text

Lecture 13 Gaussian Process Models - Part 2 3/06/2018 1 EDA and GPs 2 Variogram When fitting

Restoring & Sustaining Small Employer Health Insurance Coverage in Post- Pandemic New Jersey

ECO 317 Economics of Uncertainty Fall Term 2009 Slides to accompany 14. Information

INSTITUTIONAL EXEMPLARS: DIGITAL LEARNING IMPLEMENTATION STRATEGIES TO IMPROVE STUDENT SUCCESS

Class Feedback Thanks to those that participated! Of 70 responses, 54% thought too fast, 43% just

"IQCP for POCT in the Post-Analytic Stage: The Results are In, Now What Will Become of Them?

ECE 222 Signals & Systems I Syllabus Miscellaneous Notes ece.pdx.edu/ ece2xx/ECE222

Using iRODS as an entry point to VITAM for long-term data preservation IRODS UGM 2020

Indications, evaluation and treatment Rajabrata Sarkar M.D. Ph.D. Barbara Baur Dunlap Professor

John K. Petty Chair, PTS Guidelines Committee Associate Professor of Surgery and Pediatrics Wake

What is trauma? Trauma is the unique individual experience of an event or enduring conditions in

Engagement Evening Written by Dr Lisa Manning Welcome! Objectives Overview of Funding

CSC421/2516 Lecture 19: Bayesian Neural Nets Roger Grosse and Jimmy Ba Roger Grosse and Jimmy Ba

Bayesian Methods for Neural Networks Readings: Bishop, Neural Networks for Pattern Recognition .

Bayesian Estimation & Information Theory Jonathan Pillow Mathematical Tools for Neuroscience

COMS 4721: Machine Learning for Data Science Lecture 5, 1/31/2017 Prof. John Paisley Department

Bayesian estimation of the discrepancy with misspecified parametric models Pierpaolo De Blasi

Refresh Your Understanding: Multi-armed Bandits Select all that are true: Up to slight variations

Some DIC slides David Spiegelhalter MRC Biostatistics Unit, Cambridge with thanks to: Nicky

Introduction to Bayesian models with Stata Ernesto F. L. Amaral Katherine A. C. Willyard May

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture 25: Introduction to

FEASIBLE JOINT POSTERIOR BELIEFS BAYESIAN COMMUNICATION N Receivers: POSTERIOR s 1 S 1 p

Large Sample Robustness Bayes Nets with Incomplete Information Jim Smith and Ali Daneshkhah

technique: assessing anthropogenic emissions of CO,NOx and CO2 and their impacts. J. Brioude

AI NTPU Dr. Chih-Chien Wang Dr. Min-Yuh Day Mr. Wei-Jin Gao Mr. - PowerPoint PPT Presentation

AI NTPU Dr. Chih-Chien Wang Dr. Min-Yuh Day Mr. Wei-Jin Gao Mr. Yen-Cheng Chiu Ms. Chun-Lian Wu National Taipei University Tamkang University Taipei, Taiwan Overview Retrieval based Solr search engine Method + Similarity Short Text

Lecture 13 Gaussian Process Models - Part 2 3/06/2018 1 EDA and GPs 2 Variogram When fitting

Restoring &amp; Sustaining Small Employer Health Insurance Coverage in Post- Pandemic New Jersey

ECO 317 Economics of Uncertainty Fall Term 2009 Slides to accompany 14. Information

INSTITUTIONAL EXEMPLARS: DIGITAL LEARNING IMPLEMENTATION STRATEGIES TO IMPROVE STUDENT SUCCESS

Class Feedback Thanks to those that participated! Of 70 responses, 54% thought too fast, 43% just

&quot;IQCP for POCT in the Post-Analytic Stage: The Results are In, Now What Will Become of Them?

ECE 222 Signals &amp; Systems I Syllabus Miscellaneous Notes ece.pdx.edu/ ece2xx/ECE222

Using iRODS as an entry point to VITAM for long-term data preservation IRODS UGM 2020

Indications, evaluation and treatment Rajabrata Sarkar M.D. Ph.D. Barbara Baur Dunlap Professor

John K. Petty Chair, PTS Guidelines Committee Associate Professor of Surgery and Pediatrics Wake

What is trauma? Trauma is the unique individual experience of an event or enduring conditions in

Engagement Evening Written by Dr Lisa Manning Welcome! Objectives Overview of Funding

CSC421/2516 Lecture 19: Bayesian Neural Nets Roger Grosse and Jimmy Ba Roger Grosse and Jimmy Ba

Bayesian Methods for Neural Networks Readings: Bishop, Neural Networks for Pattern Recognition .

Bayesian Estimation &amp; Information Theory Jonathan Pillow Mathematical Tools for Neuroscience

COMS 4721: Machine Learning for Data Science Lecture 5, 1/31/2017 Prof. John Paisley Department

Bayesian estimation of the discrepancy with misspecified parametric models Pierpaolo De Blasi

Refresh Your Understanding: Multi-armed Bandits Select all that are true: Up to slight variations

Some DIC slides David Spiegelhalter MRC Biostatistics Unit, Cambridge with thanks to: Nicky

Introduction to Bayesian models with Stata Ernesto F. L. Amaral Katherine A. C. Willyard May

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture 25: Introduction to

FEASIBLE JOINT POSTERIOR BELIEFS BAYESIAN COMMUNICATION N Receivers: POSTERIOR s 1 S 1 p

Large Sample Robustness Bayes Nets with Incomplete Information Jim Smith and Ali Daneshkhah

technique: assessing anthropogenic emissions of CO,NOx and CO2 and their impacts. J. Brioude

Restoring & Sustaining Small Employer Health Insurance Coverage in Post- Pandemic New Jersey

"IQCP for POCT in the Post-Analytic Stage: The Results are In, Now What Will Become of Them?

ECE 222 Signals & Systems I Syllabus Miscellaneous Notes ece.pdx.edu/ ece2xx/ECE222

Bayesian Estimation & Information Theory Jonathan Pillow Mathematical Tools for Neuroscience