Learning to Recognize Discontiguous Entities Aldrian Obaja Muis and - PowerPoint PPT Presentation

Introduction Our Model Experiments Ambiguity Conclusion Appendix Discontiguous Entity Recognition Definition A task to recognize entities in text, where they can be discontiguous (and possibly overlapping with each other) Examples from SemEval 2014 Task 7: Analysis of Clinical Text: EGD showed hiatal hernia and vertical laceration in distal esophagus with blood in stomach and overlying lac. hiatal hernia 1 laceration . . . esophagus 2 blood in stomach 3 stomach . . . lac 4 Infarctions either water shed or embolic Infarctions 1 Infarctions . . . water shed 2 Infarctions . . . embolic 3 4 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Previous Approaches In SemEval 2014 Task 7, there were only two teams that could handle discontiguous and overlapping entities: line1 line2 line1 line2 line1 line2 5 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Previous Approaches In SemEval 2014 Task 7, there were only two teams that could handle discontiguous and overlapping entities: 1 Pathak et al. (2014) 5 5 Parth Pathak et al. (2014). “ezDI: A Hybrid CRF and SVM based Model for Detecting and Encoding Disorder Mentions in Clinical Notes”. In: SemEval 2014 line1 line2 line1 line2 5 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Previous Approaches In SemEval 2014 Task 7, there were only two teams that could handle discontiguous and overlapping entities: 1 Pathak et al. (2014) 5 Standard NER using BIO tagset pipelined with SVM to combine the spans 5 Parth Pathak et al. (2014). “ezDI: A Hybrid CRF and SVM based Model for Detecting and Encoding Disorder Mentions in Clinical Notes”. In: SemEval 2014 line1 line2 line1 line2 5 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Previous Approaches In SemEval 2014 Task 7, there were only two teams that could handle discontiguous and overlapping entities: 1 Pathak et al. (2014) 5 Standard NER using BIO tagset pipelined with SVM to combine the spans 2 Zhang et al. (2014) 6 (best team) 5 Parth Pathak et al. (2014). “ezDI: A Hybrid CRF and SVM based Model for Detecting and Encoding Disorder Mentions in Clinical Notes”. In: SemEval 2014 6 Yaoyun Zhang et al. (2014). “UTH CCB: A report for SemEval 2014 – Task 7 Analysis of Clinical Text”. In: SemEval 2014 line1 line2 5 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Previous Approaches In SemEval 2014 Task 7, there were only two teams that could handle discontiguous and overlapping entities: 1 Pathak et al. (2014) 5 Standard NER using BIO tagset pipelined with SVM to combine the spans 2 Zhang et al. (2014) 6 (best team) Use extended BIO tagset coupled with heuristics 7 5 Parth Pathak et al. (2014). “ezDI: A Hybrid CRF and SVM based Model for Detecting and Encoding Disorder Mentions in Clinical Notes”. In: SemEval 2014 6 Yaoyun Zhang et al. (2014). “UTH CCB: A report for SemEval 2014 – Task 7 Analysis of Clinical Text”. In: SemEval 2014 7 Buzhou Tang et al. (2013). “Recognizing and Encoding Discorder Concepts in Clinical Text using Machine Learning and Vector Space”. In: ShARe/CLEF Eval. Lab 5 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Previous Approaches In SemEval 2014 Task 7, there were only two teams that could handle discontiguous and overlapping entities: 1 Pathak et al. (2014) 5 Standard NER using BIO tagset pipelined with SVM to combine the spans 2 Zhang et al. (2014) 6 (best team) Use extended BIO tagset coupled with heuristics 7 B , I for contiguous tokens 5 Parth Pathak et al. (2014). “ezDI: A Hybrid CRF and SVM based Model for Detecting and Encoding Disorder Mentions in Clinical Notes”. In: SemEval 2014 6 Yaoyun Zhang et al. (2014). “UTH CCB: A report for SemEval 2014 – Task 7 Analysis of Clinical Text”. In: SemEval 2014 7 Buzhou Tang et al. (2013). “Recognizing and Encoding Discorder Concepts in Clinical Text using Machine Learning and Vector Space”. In: ShARe/CLEF Eval. Lab 5 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Previous Approaches In SemEval 2014 Task 7, there were only two teams that could handle discontiguous and overlapping entities: 1 Pathak et al. (2014) 5 Standard NER using BIO tagset pipelined with SVM to combine the spans 2 Zhang et al. (2014) 6 (best team) Use extended BIO tagset coupled with heuristics 7 B , I for contiguous tokens BD , ID for discontiguous tokens 5 Parth Pathak et al. (2014). “ezDI: A Hybrid CRF and SVM based Model for Detecting and Encoding Disorder Mentions in Clinical Notes”. In: SemEval 2014 6 Yaoyun Zhang et al. (2014). “UTH CCB: A report for SemEval 2014 – Task 7 Analysis of Clinical Text”. In: SemEval 2014 7 Buzhou Tang et al. (2013). “Recognizing and Encoding Discorder Concepts in Clinical Text using Machine Learning and Vector Space”. In: ShARe/CLEF Eval. Lab 5 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Previous Approaches In SemEval 2014 Task 7, there were only two teams that could handle discontiguous and overlapping entities: 1 Pathak et al. (2014) 5 Standard NER using BIO tagset pipelined with SVM to combine the spans 2 Zhang et al. (2014) 6 (best team) Use extended BIO tagset coupled with heuristics 7 B , I for contiguous tokens BD , ID for discontiguous tokens BH , IH for overlapping tokens 5 Parth Pathak et al. (2014). “ezDI: A Hybrid CRF and SVM based Model for Detecting and Encoding Disorder Mentions in Clinical Notes”. In: SemEval 2014 6 Yaoyun Zhang et al. (2014). “UTH CCB: A report for SemEval 2014 – Task 7 Analysis of Clinical Text”. In: SemEval 2014 7 Buzhou Tang et al. (2013). “Recognizing and Encoding Discorder Concepts in Clinical Text using Machine Learning and Vector Space”. In: ShARe/CLEF Eval. Lab 5 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Encoding in Model of Zhang et al. Infarctions either water shed or embolic Example taken from the full sentence: “ ... protocol to evaluate for any infarctions, either water shed or embolic, ... ” 6 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Encoding in Model of Zhang et al. [Infarctions] 1 either [water shed] 1 or embolic 1 Infarctions ... water shed Example taken from the full sentence: “ ... protocol to evaluate for any infarctions, either water shed or embolic, ... ” 6 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Encoding in Model of Zhang et al. [[Infarctions] 1 ] 2 either [water shed] 1 or [embolic] 2 1 Infarctions ... water shed 2 Infarctions ... embolic Example taken from the full sentence: “ ... protocol to evaluate for any infarctions, either water shed or embolic, ... ” 6 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Encoding in Model of Zhang et al. [[[Infarctions] 1 ] 2 ] 3 either [water shed] 1 or [embolic] 2 1 Infarctions ... water shed 2 Infarctions ... embolic 3 Infarctions Example taken from the full sentence: “ ... protocol to evaluate for any infarctions, either water shed or embolic, ... ” 6 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Encoding in Model of Zhang et al. [[[Infarctions] 1 ] 2 ] 3 either [water shed] 1 or [embolic] 2 O O 1 Infarctions ... water shed 2 Infarctions ... embolic 3 Infarctions Example taken from the full sentence: “ ... protocol to evaluate for any infarctions, either water shed or embolic, ... ” 6 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Encoding in Model of Zhang et al. [[[Infarctions] 1 ] 2 ] 3 either [water shed] 1 or [embolic] 2 BH O O 1 Infarctions ... water shed 2 Infarctions ... embolic 3 Infarctions Example taken from the full sentence: “ ... protocol to evaluate for any infarctions, either water shed or embolic, ... ” 6 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Encoding in Model of Zhang et al. [[[Infarctions] 1 ] 2 ] 3 either [water shed] 1 or [embolic] 2 BH O BD ID O 1 Infarctions ... water shed 2 Infarctions ... embolic 3 Infarctions Example taken from the full sentence: “ ... protocol to evaluate for any infarctions, either water shed or embolic, ... ” 6 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Encoding in Model of Zhang et al. [[[Infarctions] 1 ] 2 ] 3 either [water shed] 1 or [embolic] 2 BH O BD ID O BD 1 Infarctions ... water shed 2 Infarctions ... embolic 3 Infarctions Example taken from the full sentence: “ ... protocol to evaluate for any infarctions, either water shed or embolic, ... ” 6 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Encoding in Model of Zhang et al. [[[Infarctions] 1 ] 2 ] 3 either [water shed] 1 or [embolic] 2 BH O BD ID O BD 1 Infarctions ... water shed 2 Infarctions ... embolic 3 Infarctions This is the canonical encoding of this particular set of entities Example taken from the full sentence: “ ... protocol to evaluate for any infarctions, either water shed or embolic, ... ” 6 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Decoding in Model of Zhang et al. Infarctions either water shed or embolic BH O BD ID O BD 7 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Decoding in Model of Zhang et al. Infarctions either [water shed] 1 or [embolic] 1 BH O BD ID O BD 7 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Decoding in Model of Zhang et al. Infarctions either water shed or embolic BH O BD ID O BD 7 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Decoding in Model of Zhang et al. [Infarctions] 1 either [water shed] 1 or embolic BH O BD ID O BD 1 Infarctions ... water shed 7 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Decoding in Model of Zhang et al. [[Infarctions] 1 ] 2 either [water shed] 1 or [embolic] 2 BH O BD ID O BD 1 Infarctions ... water shed 2 Infarctions ... embolic 7 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Decoding in Model of Zhang et al. [[Infarctions] 1 ] 2 either [water shed] 1 or [embolic] 2 BH O BD ID O BD 1 Infarctions ... water shed 2 Infarctions ... embolic 3 Infarctions (?) 7 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Decoding in Model of Zhang et al. [[Infarctions] 1 ] 2 either [water shed] 1 or [embolic] 2 BH O BD ID O BD 1 Infarctions ... water shed 2 Infarctions ... embolic 3 Infarctions (?) Ambiguous! 7 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Number of Entity Combinations In a sentence with n words, there are: 1 2 n − 1 possible discontiguous entities 8 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Number of Entity Combinations In a sentence with n words, there are: 1 2 n − 1 possible discontiguous entities 2 2 2 n − 1 possible combinations of discontiguous entities * 8 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Discontiguous Entities Recognition 1 How to efficiently model these discontiguous (and possibly overlapping) entities? 9 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Discontiguous Entities Recognition 1 How to efficiently model these discontiguous (and possibly overlapping) entities? 2 How to compare the ambiguity between models for discontiguous entities? 9 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Contributions In this paper, we contributed: 1 A new hypergraph-based model to handle discontiguous entities better 10 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Contributions In this paper, we contributed: 1 A new hypergraph-based model to handle discontiguous entities better 2 A simple theoretical framework to compare ambiguity between models 10 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model water or Infarctions either shed embolic A A A A A A E E E E E E T T T T T T X X X X X X B 0 B 0 B 0 B 0 B 0 B 0 X X X X X X O 1 O 1 O 1 O 1 O 1 O 1 B 1 B 1 B 1 B 1 B 1 B 1 X X X X X X 11 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model water water or Infarctions Infarctions either shed shed embolic embolic A A A A A A E E E E E E T T T T T T X X X X X B 0 X O 1 O 1 O 1 O 1 Infarctions B 1 B 1 B 1 Infarctions . . . water shed X Infarctions . . . embolic X 12 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model Key ideas: 1 Build a hypergraph that can encode any entity combination 13 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model Key ideas: 1 Build a hypergraph that can encode any entity combination 2 For any sentence annotated with entities, there would be a unique subgraph that represents it (canonical encoding) 13 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model Key ideas: 1 Build a hypergraph that can encode any entity combination 2 For any sentence annotated with entities, there would be a unique subgraph that represents it (canonical encoding) 3 Each entity is represented as a path in the entity-encoded hypergraph, where the B -nodes indicate which tokens are part of the entity 13 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model or Infarctions either water shed embolic A A A A A A E E E E E E T T T T T T X X X X X B 0 X O 1 O 1 O 1 O 1 Infarctions B 1 B 1 B 1 Infarctions . . . water shed X Infarctions . . . embolic X 14 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model or Infarctions either water shed embolic A A A A A A E E E E E E T T T T T T X X X X X 14 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model or Infarctions either water shed embolic A A A A A A E E E E E E T T T T T T X X X X X B 0 14 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model or Infarctions Infarctions either water shed embolic A A A A A A E E E E E E T T T T T T X X X X X B 0 X Infarctions 14 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model or Infarctions either water shed embolic A A A A A A E E E E E E T T T T T T X X X X X B 0 X O 1 Infarctions 14 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model or Infarctions either water shed embolic A A A A A A E E E E E E T T T T T T X X X X X B 0 X O 1 Infarctions B 1 14 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model or Infarctions either water shed embolic A A A A A A E E E E E E T T T T T T X X X X X B 0 X O 1 Infarctions B 1 B 1 14 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model or Infarctions Infarctions either water water shed shed embolic A A A A A A E E E E E E T T T T T T X X X X X B 0 X O 1 Infarctions B 1 B 1 Infarctions . . . water shed X 14 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model or Infarctions either water shed embolic A A A A A A E E E E E E T T T T T T X X X X X B 0 X O 1 O 1 Infarctions B 1 B 1 Infarctions . . . water shed X 14 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model or Infarctions either water shed embolic A A A A A A E E E E E E T T T T T T X X X X X B 0 X O 1 O 1 O 1 Infarctions B 1 B 1 Infarctions . . . water shed X 14 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model or Infarctions either water shed embolic A A A A A A E E E E E E T T T T T T X X X X X B 0 X O 1 O 1 O 1 O 1 Infarctions B 1 B 1 Infarctions . . . water shed X 14 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model or Infarctions either water shed embolic A A A A A A E E E E E E T T T T T T X X X X X B 0 X O 1 O 1 O 1 O 1 Infarctions B 1 B 1 B 1 Infarctions . . . water shed X 14 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model or Infarctions Infarctions either water shed embolic embolic A A A A A A E E E E E E T T T T T T X X X X X B 0 X O 1 O 1 O 1 O 1 Infarctions B 1 B 1 B 1 Infarctions . . . water shed X Infarctions . . . embolic X 14 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model Training and predicting: 1 Training: Maximize conditional log-likelihood of training data 15 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Our Hypergraph-based Model Training and predicting: 1 Training: Maximize conditional log-likelihood of training data 2 Predicting: Use Viterbi to find the highest-scoring subgraph 15 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Experiments 16 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Experimental Setup Dataset taken from SemEval 2014 Task 7, taking sentences containing discontiguous entities 17 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Experimental Setup Dataset taken from SemEval 2014 Task 7, taking sentences containing discontiguous entities Two setups for training set: “Discontiguous” (smaller) and “Original” (larger) 17 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Experimental Setup Dataset taken from SemEval 2014 Task 7, taking sentences containing discontiguous entities Two setups for training set: “Discontiguous” (smaller) and “Original” (larger) Models optimized for F1-score in dev set by varying λ 17 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Experimental Setup Dataset taken from SemEval 2014 Task 7, taking sentences containing discontiguous entities Two setups for training set: “Discontiguous” (smaller) and “Original” (larger) Models optimized for F1-score in dev set by varying λ Features followed Tang et al. (2013): words, POS, Brown cluster, semantic category, . . . 17 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Results Using Smaller Training Set Li-Enh Li-All Sh-Enh Sh-All 100 80 76 . 90 76 . 00 Score (%) 60 54 . 70 52 . 70 52 . 80 47 . 00 44 . 90 41 . 20 40 . 10 40 . 50 40 22 . 70 20 15 . 20 0 Precision Recall F1-score 18 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Results Using Larger Training Set Li-Enh Li-All Sh-Enh Sh-All 100 80 73 . 90 73 . 40 64 . 10 59 . 00 59 . 10 Score (%) 60 53 . 90 52 . 80 51 . 10 49 . 40 49 . 10 49 . 50 46 . 50 40 20 0 Precision Recall F1-score 19 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Ambiguity 20 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Ambiguity One encoding can have multiple interpretations (set of entities) apparent [atrial [pacemaker] 2 artifact] 1 without [capture] 2 A A A A A A 1 atrial pacemaker artifact 2 E E E E E E pacemaker . . . capture T T T T T T X X X X B 0 B 0 B 0 1 pacemaker artifact atrial pacemaker artifact 2 atrial pacemaker . . . capture X pacemaker artifact O 1 O 1 B 1 pacemaker . . . capture X atrial pacemaker . . . capture 1 infarctions . . . water shed 2 infarctions . . . embolic Infarctions either water shed or embolic BH O BD ID O BD 1 infarctions 2 infarctions . . . water shed 3 infarctions . . . embolic 21 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Ambiguity The models need further processing after prediction to generate one set of entities 22 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Ambiguity The models need further processing after prediction to generate one set of entities We compare two heuristics: 22 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Ambiguity The models need further processing after prediction to generate one set of entities We compare two heuristics: All : Return the union of all possible interpretations 1 22 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Ambiguity The models need further processing after prediction to generate one set of entities We compare two heuristics: All : Return the union of all possible interpretations 1 Enough : Return one possible interpretation 2 22 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Ambiguity 23 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Ambiguity Definition Ambiguity level A ( M ) of model M is the average number of interpretations of each canonical encoding in the model 23 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings How many canonical encodings do the models have? 24 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings How many canonical encodings do the models have? For the baseline model: 24 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings How many canonical encodings do the models have? For the baseline model: There are 7 possible tags per word (B, I, BD, ID, BH, IH, O) 24 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings How many canonical encodings do the models have? For the baseline model: There are 7 possible tags per word (B, I, BD, ID, BH, IH, O) The model can output any combination of those: 7 n 24 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings How many canonical encodings do the models have? For the baseline model: There are 7 possible tags per word (B, I, BD, ID, BH, IH, O) The model can output any combination of those: 7 n Not all are canonical, so: M Li ( n ) < 7 n < 2 3 n 24 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings How many canonical encodings do the models have? For the baseline model: There are 7 possible tags per word (B, I, BD, ID, BH, IH, O) The model can output any combination of those: 7 n Not all are canonical, so: M Li ( n ) < 7 n < 2 3 n For our hypergraph-based model: 24 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings How many canonical encodings do the models have? For the baseline model: There are 7 possible tags per word (B, I, BD, ID, BH, IH, O) The model can output any combination of those: 7 n Not all are canonical, so: M Li ( n ) < 7 n < 2 3 n For our hypergraph-based model: Number of canonical encoding = number of subgraphs 24 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings How many canonical encodings do the models have? For the baseline model: There are 7 possible tags per word (B, I, BD, ID, BH, IH, O) The model can output any combination of those: 7 n Not all are canonical, so: M Li ( n ) < 7 n < 2 3 n For our hypergraph-based model: Number of canonical encoding = number of subgraphs Q : How to calculate the number of subgraphs? 24 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings A : Use dynamic programming on combination of nodes 25 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings A : Use dynamic programming on combination of nodes Fig. 1: Simplified graph to illustrate subgraph counting 25 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings A : Use dynamic programming on combination of nodes Fig. 1: Simplified graph to illustrate Fig. 2: State transitions subgraph counting 25 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings A : Use dynamic programming on combination of nodes Fig. 1: Simplified graph to illustrate Fig. 2: State transitions subgraph counting f 11 ( n ) = 2 ∗ f 11 ( n − 1) + f 01 ( n − 1) (1) 25 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings How many canonical encodings do the models have? For the baseline: There are 7 possible tags per word (B, I, BD, ID, BH, IH, O) The model can output any combination of those: 7 n Not all are canonical, so: M Li ( n ) < 7 n < 2 3 n For our hypergraph-based model: Number of canonical encoding = number of subgraphs 26 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings How many canonical encodings do the models have? For the baseline: There are 7 possible tags per word (B, I, BD, ID, BH, IH, O) The model can output any combination of those: 7 n Not all are canonical, so: M Li ( n ) < 7 n < 2 3 n For our hypergraph-based model: Number of canonical encoding = number of subgraphs After more calculations: M Sh ( n ) > C · 2 10 n 26 / 37

Introduction Our Model Experiments Ambiguity Conclusion Appendix Counting Number of Encodings How many canonical encodings do the models have? For the baseline: There are 7 possible tags per word (B, I, BD, ID, BH, IH, O) The model can output any combination of those: 7 n Not all are canonical, so: M Li ( n ) < 7 n < 2 3 n For our hypergraph-based model: Number of canonical encoding = number of subgraphs After more calculations: M Sh ( n ) > C · 2 10 n So our model is less ambiguous compared to the baseline model 26 / 37

Learning to Recognize Discontiguous Entities Aldrian Obaja Muis and - PowerPoint PPT Presentation

Introduction Our Model Experiments Ambiguity Conclusion Appendix Learning to Recognize Discontiguous Entities Aldrian Obaja Muis and Wei Lu Singapore University of Technology and Design aldrian muis@sutd.edu.sg luwei@sutd.edu.sg

XML and Databases Chapter 2: XML II: Entities and Marked Sections Prof. Dr. Stefan Brass

References and Memory 15-110 Wednesday 09/30 Learning Goals Recognize whether two values

Outline Morning program Preliminaries Semantic matching Learning to rank Entities Afternoon

The Triple Helix Model Role of different entities 1 The Triple Helix Model Role of

Organisational setup setup of of external external Organisational QA entities entities: :

Distributed Computing Environments 1 Distributed computing environment consists of entities

ABSTRACT ENTITIES Are abstract entities occult? Ney, .72 That the houses and roses and

VI.2 IE for Entities, Relations, Roles Extracting named entities (either type-less constants or

Aliasing and Mutability 15-110 Wednesday 02/19 Learning Goals Recognize how aliasing

Learning the Species of Biomedical Named Entities from Annotated Corpora Xinglong Wang and Claire

Outline Morning program Preliminaries Modeling user behavior Semantic matching Learning to

Delirium in the Elderly of disease in the elderly Recognize that delirium is associated with

CS 335 Machine Learning What is Machine Learning? Dan Sheldon Spring 2019 What is Machine

Antibiotics PCN Danita Dee Narciso Pharm D 1 2 Objectives Recognize the signs and symptoms

UGA LADY AWARD The UGA is pleased to recognize a Utah Women golfer based solely upon what she

Kansas Maternal & Child Health Council JULY 31, 2019 MEETING Recognize Outgoing &

Management of She was not taking NSAIDS and was h.pylori negative. Paraesophageal Hernia Upper GI

md_poly : A Performance-Portable Polyhedral Compiler based on Multi-Dimensional Homomorphisms Ari

Performance and Stability Comparison of Vehicular Congestion Control Algorithms Presented by Ali

Computer Animation CSE169: Computer Animation Instructor: Steve Rotenberg UCSD, Winter 2020

Saving Lives In the Medical Surgical Unit and Establishing a Successful Capnography

The Case of the Upset Stomach A Mystery only Quantified Self could solve! June 2011 Why???

Pilot Study for a Pragmatic Trial Comparing Chlorthalidone and Hydrochlorothiazide: Results and

CO COVID-19 Dai Daily De Deat aths vs. s. T Top C Cau ause ses o of De Deat ath (

Learning to Recognize Discontiguous Entities Aldrian Obaja Muis and - PowerPoint PPT Presentation

Introduction Our Model Experiments Ambiguity Conclusion Appendix Learning to Recognize Discontiguous Entities Aldrian Obaja Muis and Wei Lu Singapore University of Technology and Design aldrian muis@sutd.edu.sg luwei@sutd.edu.sg

XML and Databases Chapter 2: XML II: Entities and Marked Sections Prof. Dr. Stefan Brass

References and Memory 15-110 Wednesday 09/30 Learning Goals Recognize whether two values

Outline Morning program Preliminaries Semantic matching Learning to rank Entities Afternoon

The Triple Helix Model Role of different entities 1 The Triple Helix Model Role of

Organisational setup setup of of external external Organisational QA entities entities: :

Distributed Computing Environments 1 Distributed computing environment consists of entities

ABSTRACT ENTITIES Are abstract entities occult? Ney, .72 That the houses and roses and

VI.2 IE for Entities, Relations, Roles Extracting named entities (either type-less constants or

Aliasing and Mutability 15-110 Wednesday 02/19 Learning Goals Recognize how aliasing

Learning the Species of Biomedical Named Entities from Annotated Corpora Xinglong Wang and Claire

Outline Morning program Preliminaries Modeling user behavior Semantic matching Learning to

Delirium in the Elderly of disease in the elderly Recognize that delirium is associated with

CS 335 Machine Learning What is Machine Learning? Dan Sheldon Spring 2019 What is Machine

Antibiotics PCN Danita Dee Narciso Pharm D 1 2 Objectives Recognize the signs and symptoms

UGA LADY AWARD The UGA is pleased to recognize a Utah Women golfer based solely upon what she

Kansas Maternal &amp; Child Health Council JULY 31, 2019 MEETING Recognize Outgoing &amp;

Management of She was not taking NSAIDS and was h.pylori negative. Paraesophageal Hernia Upper GI

md_poly : A Performance-Portable Polyhedral Compiler based on Multi-Dimensional Homomorphisms Ari

Performance and Stability Comparison of Vehicular Congestion Control Algorithms Presented by Ali

Computer Animation CSE169: Computer Animation Instructor: Steve Rotenberg UCSD, Winter 2020

Saving Lives In the Medical Surgical Unit and Establishing a Successful Capnography

The Case of the Upset Stomach A Mystery only Quantified Self could solve! June 2011 Why???

Pilot Study for a Pragmatic Trial Comparing Chlorthalidone and Hydrochlorothiazide: Results and

CO COVID-19 Dai Daily De Deat aths vs. s. T Top C Cau ause ses o of De Deat ath (

Kansas Maternal & Child Health Council JULY 31, 2019 MEETING Recognize Outgoing &