A Hybrid Neural Model for Type Classification of Entity Mentions

Motivation ▪ Types group entities to categories ▪ Entity types are important for various NLP tasks ▪ Question answering ▪ Relation extraction ▪ Semantic role labeling ▪ … ▪ Our task ▪ predict an entity mention’s type

Type Classification of Entity Mentions ▪ Input ▪ 𝑑 −𝑇 … 𝑑 −1 𝑥 1 … 𝑥 𝑜 𝑑 1 … 𝑑 𝑇 left context mention right context ▪ Output ▪ Type ▪ [an initiative sponsored by][Bill & Melinda Gates Foundation][to fight HIV infection] Organization

▪ Mention ▪ Bill & Melinda Gates Foundation ( Organization ) ▪ Bill, Melinda, Gates -> {Person Name} ▪ {Person Name} + Foundation -> Organization ▪ Context ▪ [The greater part of ][ Gates ][ ' population is in Marion County .] ( Location ) ▪ [ Gates ][ was a baseball player .] ( Person )

Related Work ▪ Named Entity Recognition ▪ Limited types ▪ Location, Person, Organization, Misc ▪ (e.g.) Question Answering ▪ Questions are classified into more answer types ▪ Named Entity Linking (1. Link to an entity in knowledge base 2. Query its entity type) ▪ Performance drops for uncommon entities ▪ (e.g.) Question Answering ▪ Extracted answer candidate may not appear in knowledge base ▪ NEL is a harder problem than type classification ▪ Design rich features ▪ N-gram, morphological features, gazetteers, WordNet, ReVerb patterns, POS tags, dependency parsing results, etc.

Architecture 𝑓 𝜾 𝟐 𝒚 Decision Model 1 𝒛 = ⋮ (softmax classifier) 𝑓 𝜾𝒌𝒚 𝐷 σ 𝑘=1 𝑓 𝜾 𝑫 𝒚 Context Model Mention Model Context Model

RNN-based Mention Model ▪ Learn composition patterns for entity mention ▪ {Name} + Foundation / University -> (Organization) ▪ {Body Region} + {Disease} -> (Disease) ▪ R ecurrent N eural N etworks (Elman Networks) ▪ Use a global composition matrix to compute representation recurrently ▪ A natural way to learn composition patterns 𝒘 𝟑 = 𝑢𝑏𝑜ℎ(𝑋 𝒙 𝟐 𝒙 𝟑 + 𝒄 𝒏 ) 𝒘 𝟒 = 𝑢𝑏𝑜ℎ(𝑋 𝒘 𝟑 𝒘 𝟔 = 𝑢𝑏𝑜ℎ(𝑋 𝒘 𝟓 𝒙 𝟒 + 𝒄 𝒏 ) 𝒙 𝟔 + 𝒄 𝒏 ) … Mention Representation & Melinda Gates Foundation Bill 𝒙 𝟐 𝒙 𝟑 𝒙 𝟒 𝒙 𝟓 𝒙 𝟔

MLP-based Context Model ▪ Use context to disambiguate ▪ [The greater part of ][ Gates ][ ' population is in Marion County .] ( Location ) ▪ [ Gates ][ was a baseball player .] ( Person ) ▪ M ulti L ayer P erceptrons ▪ Location-aware, jointly trained Left Context Right Context Representation Representation Hidden Layer Concatenation Layer an initiative sponsored by to fight HIV infection

Model Training ▪ Objective function 𝜇 𝜄 𝑗 log 𝒛 𝑘 𝑗 2 minimize − ෍ ෍ 𝒖 𝑘 + 𝜄 2 2 𝜄 𝑗 𝑘 regularization cross entropy loss ▪ Back-propagation algorithm ▪ Back-propagate errors of softmax classifier to other layers ▪ Optimization ▪ Mini-batched AdaGrad

Automatically Generating Training Data Wikipedia Article Context Mention Context Anchor link Wikipedia ID DBpedia Entity DBpedia rdf:type Organization

Automatically Generating Training Data ▪ DBpedia ontology ▪ 22 top-level types ▪ Wiki-22 ▪ #Train: 2 million ▪ #Dev: 0.1 million ▪ #Test: 0.28 million

Evaluation on Wiki-22 ▪ micro-F1 / macro-F1 score ▪ Baseline methods ▪ Support Vector Machine (SVM) ▪ Multinomial Naive Bayes (MNB) ▪ Sum word vectors (ADD) ▪ Use a softmax classifier ▪ *-mention ▪ Only use mention ▪ *-context ▪ Only use context ▪ *-joint ▪ Use both mention and context

Comparison with Previous Systems ▪ HYENA [Yosef et al., 2012] ▪ Support Vector Machine ▪ unigrams, bigrams, and trigrams of mentions, surrounding sentences, mention paragraphs, part-of-speech tags of context words, gazetteer dictionary ▪ FIGER [Ling and Weld, 2012] ▪ Perceptron ▪ unigrams, word shapes, part-of-speech tags, length, Brown clusters, head words, dependency structures, ReVerb patterns

Evaluation on Unseen Mentions ▪ Evaluate on unseen mentions (length > 2) ▪ Mentions which do not appear in the train set ▪ Help us deal with uncommon or unseen mentions ▪ RNN-based mention model utilizes the compositional nature of mentions

Examples: Compositionality of Mentions ▪ Query similar mention examples ▪ cosine similarity of mentions' vector representations ▪ Mentions that are of similar patterns are closer

Evaluation in Question Answering (QA) ▪ Web-based QA system [Cucerzan and Agichtein, 2005; Lin, 2007] ▪ Add Q&A type interaction feature template Answer Type Q: who is the ceo of microsoft? Classifier Person (18 types) Search Engine (Bing) √ [left context] [Satya Nadella] [right context] Person Candidates Extracted from [left context] [Xbox] [right context] Device Titles and Snippets Feature Template: {Type(Q)|Type(A)} {Person|Person} – positive weight Ranker Add {Person|Device} – negative weight Answers

Evaluation in Question Answering (QA) ▪ WebQuestions dataset [Berant et al., 2013] ▪ Manually annotated question-answer pairs ▪ Our type classifier improves the accuracy of QA systems

Conclusion and Future Work ▪ Conclusion ▪ Recurrent Neural Networks are good at learning soft patterns ▪ Compositional nature of entity mentions ▪ Generalize for Unseen or uncommon mentions ▪ Automatically generate training data instead of annotating manually ▪ Type information is important for many NLP tasks ▪ Future work ▪ Fine-grained type classification ▪ Person -> doctor, actor, etc. ▪ Utilize hierarchical taxonomy ▪ Multi-label ▪ Utilize global information (e.g., document topic) ▪ …

A Hybrid Neural Model for Type Classification of Entity Mentions - PowerPoint PPT Presentation

A Hybrid Neural Model for Type Classification of Entity Mentions Motivation Types group entities to categories Entity types are important for various NLP tasks Question answering Relation extraction Semantic role labeling

Web/CD Hybrid Model Web/CD Hybrid Model Web/CD Hybrid Model Web/CD Hybrid Model for t he Dist

Type Checking Grammar Rule Semantic Rule var-decl id : type-exp Insert (id.name, type-exp .

Model Predictive Control Model Predictive Control of Hybrid Systems of Hybrid Systems Model

Hybrid Construction Hybrid Construction Hybrid Construction Hybrid Construction 1 VP

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural-Symbolic Integration Strategies Neural-Symbolic Integration Unification Hybrid

Hybrid Automobiles Hybrid Automobiles It switches easily between fuel, batteries, or both It

EXPO REAL Hybrid Summit Your virtual exhibition EXPO REAL Hybrid Summit The Hybrid Conference

Hybrid NLP Hybrid NLP O UTLINE O UTLINE Problems of Deep and Shallow Processing

Type Reconstruction and Polymorphism 1 Type Checking and Type Reconstruction We now come to the

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Identification of Hybrid Systems Identification of Hybrid Systems Therefore, a model must be

A Hybrid, Dynamic Logic for Hybrid-Dynamic Information Flow Brandon Bohrer and Andr e Platzer

SMD Type Raised SMD Type Solderless Type Thru-Hole Type 05/03/09 2 MiniGrid and SMT

A Hybrid Neural Model for Type Classification of Entity Mentions Li Dong Furu Wei Hong

CLL - ibrutinib Peter Hillmen peter.hillmen@nhs.net St Jamess University Hospital Leeds 10 th

UC UC SF SF Vascular Trauma @ SFGH: Practical Lessons Learned Shant M. Vartanian, MD

A New Practical Intracerebral Hematoma Volume Calculation Method and Comparison to simple ABC/2

Comprehensive Care of Delirious Patients Vanja Douglas, MD Sara & Evan Williams Foundation

10/22/16 Lay of the land Update on Acute Coronary Syndrome: Case studies with an emphasis on

Caring for Families Impacted by Substance Use Disorder Barbara Callahan Priya Gopalan May 29,

THE RAPIDLY CHANGING FACE OF SUBSTANCE ABUSE A Defining Moment for Employers Presented By

THOUGHTS ON A MARKET APPROACH TO CURBING SUBSTANCE USE DISORDER John J. Dreyzehner, MD, MPH