Neural Networks for Negation Cue Detection in Chinese Hangfeng He 1 - PowerPoint PPT Presentation

Neural Networks for   Negation Cue Detection in Chinese Hangfeng He 1 , Federico Fancellu 2 and Bonnie Webber 2 1 School of Electronics Engineering and Computer Science, Peking University 2 ILCC, School of Informatics, University of Edinburgh hangfenghe@pku.edu.cn, f.fancellu@sms.ed.ac.uk, bonnie@inf.ed.ac.uk

Outline p Introduction p Model p Experiments p Error Analysis p Conclusion 2

The task p Negation Cue Detection p Recognize the tokens (words, multi-word units or morphemes) inherently expressing negation p A prerequisite for detecting negation scope p An Example 所有住客均表示不会追究酒店的这次管理失职。 (All of guests said that they would not investigate the dereliction of hotel.) Negation Cue “ 不 (not)”: Indicate the clause is negative 3

Goal p Previous Work p [Zou et al. 2015] ■ sequential classifier ■ Lexical features (word n-grams) ■ Syntactic features (PoS n-grams) ■ Morphemic features (whether a character has appeared in training data as part of a cue) ■ Chinese-to-English word-alignment. 4

This work p Question: p Can we detect negative cues without highly-engineered features ? 5

Challenges p Challenges p Homographs (e.g. “ 非常 (very)” � “ 非 (not)”). p False negation cue (e.g.“ 非要 (can’t help)” -> “ 非 (not)”). p High combinatory power of negation affixes (e.g. “ 够 (sufficient)”-> “ 不够 (insufficient)”). 6

Model p Sequence Tagging p Given a sentence ch = ch 1 … ch |c| . (We do not do segmentation and the input is a sequence of character.) p We represent each character ch i ∈ ch as a d -dimensional character embedding p The goal of automatic cue detection is to predict a vector s ∈ {O,I} |n| s.t. s i = I if ch i is part of the cue or otherwise. 8

Character Based BiLSTM Neural Network 9

Transition Probability p The predictions made are independent from each other p A new joint model p Add a 4-parameter transition matrix to create the dependency on the previous input s i-1. 10

Experiments p Data p Chinese Negation and Speculation (CNeSp) corpus [Zou et al., 2015] p CNeSp is divided into three sub-corpora: Product reviews ( product ), Financial Articles ( financial ) and Scientific literature ( scientific ). p Although [Zou et al. 2015] used 10-fold cross-validation. We use a fixed 70%/15%/15% split of these in order to define a fixed development set for error analysis. 12

Baselines p Negation cues in training data: p Such as “ 不 (not)”,“ 非 (not)”... p An Example p Ground truth …, 受经济不景⽓影响 ,… (…,influenced by the economic depression,…) p Baseline-Char …, 受经济不景⽓影响 ,… p Baseline-Word …, 受经济不景⽓影响 ,… (segment first) 13

Results 90 67.5 45 22.5 0 financial-Precision financial-Recall financial-F1 product-Precision product-Recall product-F1 Zou et al. (2015) Baseline-Char BiLSTM+Transition 14

Results 90 67.5 45 22.5 0 Scientific-Precision Scientific-Recall Scientific-F1 Zou et al. (2015) Baseline-Char BiLSTM+Transition 15

Financial Articles p Error p most of the errors are under-prediction errors. p An Example …, 受经济不景⽓影响 ,… (…,influenced by the economic depression,…) 17

Financial Articles p Method p We first used the NLPIR toolkit to segment the sentence and if the detected cue is part of a word, then the whole word is considered as cue. p Improvement 18

Product Reviews p Error p Our models predict more negative cues than gold one. These errors concern the most frequent negative cues such as “ 不 (not)”and “ 没 (not)”. p An Example 房间设施⼀般，⽹速不仅慢还经常断⽹。 (The room facilities are common and the network not only is slow but also often disconnect.) 19

Conclusions p We confirm that character-based neural networks are able to achieve on par or better performance than previous highly-engineered sequence classifiers. p Future Work p Given the positive results obtained for Chinese, future work should focus on testing the method in other language as well. 21

Thank you! Any question? 22

Neural Networks for Negation Cue Detection in Chinese Hangfeng He 1 - PowerPoint PPT Presentation

Neural Networks for Negation Cue Detection in Chinese Hangfeng He 1 , Federico Fancellu 2 and Bonnie Webber 2 1 School of Electronics Engineering and Computer Science, Peking University 2 ILCC, School of Informatics, University of Edinburgh

Cue validity Cue validity - predictiveness of a cue for a given category Central

Depth Perception Deep Blue See April 5, 2020 PSYCH 4041 / 6014 Overview Cue Theory

Stratied Negation Negation wrapp ed inside a recursion mak es no sense. Ev

Identifying Negation in the DGS Corpus Graz, 2019-05-03 Marc Schulder, Thomas Hanke

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Cue Based Feeding in the NICU ANNA ELSENBROCK, MS, OTR/L, CPST, CNT LAURA LUCAS, MS, RD, CSP, LD

CUE Library Information Everything you need to know about CUE Library. We're here to help!

Breaking the TV Habit Valerie Lanard @valer @gigabody Duhiggs Habit Cycle Routine Cue

WELCOME CHINESE Your Access Channel to the Chinese Market Welcome Chinese mission statement

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Today Closed World Assumption & Negation as Failure. Clark completion Lloyd-Topor

Double Negation Translations as Morphisms Olivier Hermant CRI, MINES ParisTech December 1, 2014

Variable Negation Strategy Decision Table-Based Testing Variable Negation Strategy An

Logic Programming Theory Lecture 7: Negation as Failure Richard Mayr School of Informatics 6th

Variable Negation Strategy Decision Table-Based Testing Variable Negation Strategy An

mend things @KevlinHenney Move fast and break things

Rumour Spreading without the Network Alessandro Panconesi Dipartimento di Informatica Joint work

Industrial Innovation and R&D Knowledge/learning spillover to industry D. Michel Judkiewicz

Social Web Instructor: Prof. Vassilis Kostakos vassilis@cmu.edu TAs: Tiago Camacho -

Sequential Sampling Models of Adaptive Human Decision-Making (FA9550-11-1-0181) PI: Michael Lee

People Watching: Human Actions as a Cue for Single-View Geometry David Fouhey, Vincent Delaitre,

A limiting random analytic function related to the CUE Joseph Najnudel Joint work with Reda

Cross-Domain Cue Switching Tiansi Dong tdong@uni-bonn.de AI Foundations Group Bonn-Aachen

Neural Networks for Negation Cue Detection in Chinese Hangfeng He 1 - PowerPoint PPT Presentation

Neural Networks for Negation Cue Detection in Chinese Hangfeng He 1 , Federico Fancellu 2 and Bonnie Webber 2 1 School of Electronics Engineering and Computer Science, Peking University 2 ILCC, School of Informatics, University of Edinburgh

Cue validity Cue validity - predictiveness of a cue for a given category Central

Depth Perception Deep Blue See April 5, 2020 PSYCH 4041 / 6014 Overview Cue Theory

Stratied Negation Negation wrapp ed inside a recursion mak es no sense. Ev

Identifying Negation in the DGS Corpus Graz, 2019-05-03 Marc Schulder, Thomas Hanke

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Cue Based Feeding in the NICU ANNA ELSENBROCK, MS, OTR/L, CPST, CNT LAURA LUCAS, MS, RD, CSP, LD

CUE Library Information Everything you need to know about CUE Library. We're here to help!

Breaking the TV Habit Valerie Lanard @valer @gigabody Duhiggs Habit Cycle Routine Cue

WELCOME CHINESE Your Access Channel to the Chinese Market Welcome Chinese mission statement

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Today Closed World Assumption &amp; Negation as Failure. Clark completion Lloyd-Topor

Double Negation Translations as Morphisms Olivier Hermant CRI, MINES ParisTech December 1, 2014

Variable Negation Strategy Decision Table-Based Testing Variable Negation Strategy An

Logic Programming Theory Lecture 7: Negation as Failure Richard Mayr School of Informatics 6th

Variable Negation Strategy Decision Table-Based Testing Variable Negation Strategy An

mend things @KevlinHenney Move fast and break things

Rumour Spreading without the Network Alessandro Panconesi Dipartimento di Informatica Joint work

Industrial Innovation and R&amp;D Knowledge/learning spillover to industry D. Michel Judkiewicz

Social Web Instructor: Prof. Vassilis Kostakos vassilis@cmu.edu TAs: Tiago Camacho -

Sequential Sampling Models of Adaptive Human Decision-Making (FA9550-11-1-0181) PI: Michael Lee

People Watching: Human Actions as a Cue for Single-View Geometry David Fouhey, Vincent Delaitre,

A limiting random analytic function related to the CUE Joseph Najnudel Joint work with Reda

Cross-Domain Cue Switching Tiansi Dong tdong@uni-bonn.de AI Foundations Group Bonn-Aachen

Today Closed World Assumption & Negation as Failure. Clark completion Lloyd-Topor

Industrial Innovation and R&D Knowledge/learning spillover to industry D. Michel Judkiewicz