Cons nstruc ucting g Knowledg dge e Gr Graph ph from Un - PowerPoint PPT Presentation

Cons nstruc ucting g Knowledg dge e Gr Graph ph from Un Unstruc uctur ured d Tex ext Kundan Kumar Siddhant Manocha Image Source: www.ibm.com/smarterplanet/us/en/ibmwatson/

MOTIVATION Image Source: KDD 2014 Tutorial on Constructing and Mining Web-scale Knowledge Graphs, New York

PROBLEM STATEMENT

KNOWLEDGE GRAPH http://courses.cs.washington.edu/courses/cse517/13wi/slides/cse517wi13-RelationExtraction.pdf

QUESTION ANSWERING

EXISTING KNOWLEDGE BASES Image Source: KDD 2014 Tutorial on Constructing and Mining Web-scale Knowledge Graphs, New York

EXISTING KNOWLEDGE BASES Supervised Models: ◦ Learn classifiers from +/- examples, typical features: context words + POS, dependency path between entities, named entity tags ◦ Require large number of tagged training examples ◦ Cannot be generalized Semi-Supervised Models: ◦ Bootstrap Algorithms: Use seed examples to learn initial set of relations ◦ Generate +ve/-ve examples to learn a classifier ◦ Learn more relations using this classifier Distant Supervision: ◦ Existing knowledge base + unlabeled text generate examples ◦ Learn models using this set of relations

OUR APPROACH Bootstrapping Relations using Distributed Word Vector Embedding 1) Word that occur in similar context lie close together in the word embedding space. 2) Word Vectors is semantically consistent and capture many linguistic properties (like 'capital city', 'native language', 'plural relations') 3) Obtain word vectors from unstructured text ( using Google word2vec, Glove, etc ) 4) Exploit the properties of the manifold to obtain binary relations between entities

Image Source: KDD 2014 Tutorial on Constructing and Mining Web-scale Knowledge Graphs, New York ALGORITHM

SIMILARITY METRIC Image Source:A survey on relation extraction, Nguyen Bach, Carnegie Mellon University

KERNEL BASED APPROACHES

DEPENDENCY KERNELS 1.Actual Sentences 2. Dependency Graph Kernel: K(x,y)=3×1×1×1×2×1×3 = 18 3.Kernel Computation Image Source:A Shortest Path Dependency Kernel for Relation Extraction,Mooney,et al

PRELIMINARY RESULTS Word Vector Embedding: Wikipedia Corpus

PRELIMINARY RESULTS (wikipedia corpus) Negative Relations learnt Positive relations learnt Seed Examples for capital relationship

PRELIMINARY RESULTS (google news corpus) Negative Relations Learned Seed Examples Positive Relations Learned

References 1)Tomas Mikolov, Wen-tau Yih, and Geoffrey Zweig. Linguistic Regularities in Continuous Space Word Representations. In Proceedings of NAACL HLT, 2013. 2) Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, and Jeffrey Dean. Distributed Representations of Words and Phrases and their Compositionality. In Proceedings of NIPS, 2013. 3) Eugene Agichtein Luis Gravano. Snowball: Extracting Relations from Large Plain-T ext Collections. In Proceedings of the fifth ACM conference on Digital libraries, June 2000

Questions!

CBOW MODEL • input vector represented as 1-of-V encoding • Linear sum of input vectors are projected onto the projection layer • Hierarchical Softmax layer is used to ensure that the weights in the output layer are between 0<=p<=1 • Weights learnt using back- propagation • The projection matrix from the projection layer to the hidden layer give the word vector embeddings Image Source: Linguistic Regularities in Continuous Space Word Representations,Mikolov,et.al 2013

WORD VECTOR MODEL

KERNEL BASED APPROACHES Image Source:Kernel Methods for Relation Extraction,Zelenko,et al

Cons nstruc ucting g Knowledg dge e Gr Graph ph from Un - PowerPoint PPT Presentation

Cons nstruc ucting g Knowledg dge e Gr Graph ph from Un Unstruc uctur ured d Tex ext Kundan Kumar Siddhant Manocha Image Source: www.ibm.com/smarterplanet/us/en/ibmwatson/ MOTIVATION Image

The Q Quantum E Eraser: de demonstrating t the effect o of pa path k knowledg dge o on o

Department of Global Education (DGE) Presentation by : Brittany Wheaton Calloway Director,

haskell cons In haskell consing is done via the infix operator (:). For example: (cons 1 (cons 2

Deconst construct ucting ng C Const onstruct uction: on: Lessons f Le ssons from om the

WERE The Scape pe C Cons nstruc uct information evening for Reside dent nts affected by

Con onstruc ucting a a Term rminal Gr Groi oin on on Hol Holde den n Be Beach at t

Project Overview Re c o nstruc t Co unty Ro a d 61 (F lying Clo ud Drive ) fro m Hig hwa y

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Por tland Har bor Dr e dge and CAD Ce ll Pr oje c t kshop South Por tland City Counc il Wor

Graph Indexing: Tree + Delta Delta >= Graph >= Graph Graph Indexing: Tree + Peixian Zhao,

Graph Mining Marco Serafini COMPSCI 532 Lecture 11 Classes of Graph Systems Graph

Role of marine and freshwater aquatic protected areas: pros and cons Steve Healy, Roger Chen,

Department of Consumer and Food Sciences Programmes offered: B Cons Sci Clothing Retail

Purchasing Card Solution in SAP Option 1: SAP EBP Pros Pros Cons Cons Allows de-central

A New Constuon for NSW Polo Associaon Creang the

Graphics 15-110 Bonus Content How Tkinter Works Starter Code import tkinter as tk #

Presentation of Findings Raftelis Financial Consultants, Inc. STORMWATER UTILITY RATE STUDY

Licensing of the T EX Gyre family of fonts Jerzy B. Ludwichowski Jerzy.Ludwichowski@umk.pl The

them all A story of cross-software ownage, shared codebases and advanced exploitation. Mateusz

Graphics Lecture 18 COP 3252 Summer 2017 June 6, 2017 Graphics classes In the original

This Time Font hunt you down in 4 bytes! FROM KERNEL ESCAPE TO SYSTEM CALC @promised_lu

PDF in Smalltalk Chris1an Haider Introduc1on PDF is a

Caradoc: a Pragmatic Approach to PDF Parsing and Validation IEEE Security & Privacy LangSec

Sambuz

Useful Links

Newsletter

Mail Us

Cons nstruc ucting g Knowledg dge e Gr Graph ph from Un - PowerPoint PPT Presentation

Cons nstruc ucting g Knowledg dge e Gr Graph ph from Un Unstruc uctur ured d Tex ext Kundan Kumar Siddhant Manocha Image Source: www.ibm.com/smarterplanet/us/en/ibmwatson/ MOTIVATION Image

The Q Quantum E Eraser: de demonstrating t the effect o of pa path k knowledg dge o on o

Department of Global Education (DGE) Presentation by : Brittany Wheaton Calloway Director,

haskell cons In haskell consing is done via the infix operator (:). For example: (cons 1 (cons 2

Deconst construct ucting ng C Const onstruct uction: on: Lessons f Le ssons from om the

WERE The Scape pe C Cons nstruc uct information evening for Reside dent nts affected by

Con onstruc ucting a a Term rminal Gr Groi oin on on Hol Holde den n Be Beach at t

Project Overview Re c o nstruc t Co unty Ro a d 61 (F lying Clo ud Drive ) fro m Hig hwa y

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Por tland Har bor Dr e dge and CAD Ce ll Pr oje c t kshop South Por tland City Counc il Wor

Graph Indexing: Tree + Delta Delta &gt;= Graph &gt;= Graph Graph Indexing: Tree + Peixian Zhao,

Graph Mining Marco Serafini COMPSCI 532 Lecture 11 Classes of Graph Systems Graph

Role of marine and freshwater aquatic protected areas: pros and cons Steve Healy, Roger Chen,

Department of Consumer and Food Sciences Programmes offered: B Cons Sci Clothing Retail

Purchasing Card Solution in SAP Option 1: SAP EBP Pros Pros Cons Cons Allows de-central

A New Cons*tu*on for NSW Polo Associa*on Crea*ng the

Graphics 15-110 Bonus Content How Tkinter Works Starter Code import tkinter as tk #

Presentation of Findings Raftelis Financial Consultants, Inc. STORMWATER UTILITY RATE STUDY

Licensing of the T EX Gyre family of fonts Jerzy B. Ludwichowski Jerzy.Ludwichowski@umk.pl The

them all A story of cross-software ownage, shared codebases and advanced exploitation. Mateusz

Graphics Lecture 18 COP 3252 Summer 2017 June 6, 2017 Graphics classes In the original

This Time Font hunt you down in 4 bytes! FROM KERNEL ESCAPE TO SYSTEM CALC @promised_lu

PDF in Smalltalk Chris1an Haider Introduc1on PDF is a

Caradoc: a Pragmatic Approach to PDF Parsing and Validation IEEE Security &amp; Privacy LangSec

Sambuz

Useful Links

Newsletter

Mail Us

Graph Indexing: Tree + Delta Delta >= Graph >= Graph Graph Indexing: Tree + Peixian Zhao,

A New Constuon for NSW Polo Associaon Creang the

Caradoc: a Pragmatic Approach to PDF Parsing and Validation IEEE Security & Privacy LangSec