lexical sense alignment using weighted bipartite b
play

Lexical sense alignment using weighted bipartite b -matching Sina - PowerPoint PPT Presentation

Lexical sense alignment using weighted bipartite b -matching Sina Ahmadi (ULD) Supervisors: John McCrae, Mihael Arcan 16th Ph.D. day on April 8, 2018 Lexical resources 2 Lexical resources Expert-made Collaboratively-curated 3 Why


  1. Lexical sense alignment using weighted bipartite b -matching Sina Ahmadi (ULD) Supervisors: John McCrae, Mihael Arcan 16th Ph.D. day on April 8, 2018

  2. Lexical resources �2

  3. Lexical resources Expert-made Collaboratively-curated �3

  4. Why combining resources? • To improve word and concept coverage • e.g., named entities, new senses • To improve domain coverage • To improve multilinguality • Creating resources for new language pairs • To combine expert-made semantic relations • e.g., Hypernymy, meronymy, etc. �4

  5. A few applications • Semantic parsing • Word-sense disambiguation and entity linking bat: or • Semantic role labeling �5

  6. Difficulty of resource alignment Spring (n) 6 senses 18 senses �6

  7. How does our resource alignment work? Wiktionary:spring a leap; a bound; a jump WordNet:spring traditionally the first of the the season of growth four seasons of the year in temperate regions the elasticity of something that can be stretched and a place where water or oil returns to its original length emerges from the ground a natural flow of ground water elasHc power or force. the property of a body of a light, self-propelled movement springing to its original form aJer upwards or forwards being compressed, stretched, etc. the Hme of growth and progress; early porHon; first stage. �7

  8. Step 1. Alignment problem as a graph V a leap; a bound; a jump U traditionally the first of the the season of growth four seasons of the year in temperate regions the elasticity of something that can be stretched and a place where water or oil returns to its original length emerges from the ground a natural flow of ground water elasHc power or force. the property of a body of a light, self-propelled movement springing to its original form aJer upwards or forwards being compressed, stretched, etc. the Hme of growth and progress; early porHon; first stage. U and V are disjoint and independent → bipartite �8

  9. Step 2. Extract similarity scores Determine how similar two senses are by training a model using textual and definitional similarity features such as: • Word length ratio * • Longest common subsequence • Jaccard measure • Word embeddings • Forward precision * McCrae, John P., and Paul Buitelaar. "Linking Datasets Using SemanHc Textual �9 Similarity." Cyberne)cs and Informa)on Technologies 18.1 (2018): 109-123.

  10. Step 2. Extract similarity scores V a leap; a bound; a jump U 0.00187 traditionally the first of the the season of growth four seasons of the year in 0.84391 temperate regions the elasticity of something that can be stretched and a place where water or oil 0.01021 returns to its original length emerges from the ground a natural flow of ground water 0.38672 elasHc power or force. the property of a body of a light, self-propelled movement 0.00021 springing to its original form aJer upwards or forwards being compressed, stretched, etc. 0.98951 the Hme of growth and progress; early porHon; first stage. Weighted bipartite graph �10

  11. Step 3. Graph matching

  12. Exhaustive matching V → Use a threshold a leap; a bound; a jump U 0.00187 traditionally the first of the the season of growth four seasons of the year in 0.84391 temperate regions the elasticity of something that can be stretched and a place where water or oil 0.01021 returns to its original length emerges from the ground a natural flow of ground water 0.38672 elasHc power or force. the property of a body of a light, self-propelled movement 0.00021 springing to its original form aJer upwards or forwards being compressed, stretched, etc. 0.98951 0.98951 • High precision, low recall the Hme of growth and progress; early porHon; first stage. • Difficult to find optimal threshold • Uniform matching �12

  13. Exact matching V → 1-to-1 links maximising overall weight a leap; a bound; a jump U 0.00187 0.00091 traditionally the first of the the season of growth four seasons of the year in 0.84391 0.00363 temperate regions the elasticity of something that can be stretched and a place where water or oil 0.01021 0.89191 0.89191 returns to its original length emerges from the ground a natural flow of ground water 0.38672 0.00362 elasHc power or force. the property of a body of a light, self-propelled movement 0.00021 0.02012 springing to its original form aJer upwards or forwards being compressed, stretched, etc. 0.98951 0.98951 0.00101 the Hme of growth and progress; • High precision, not so high recall early porHon; first stage. • Bijective mapping restriction �13

  14. Weighted bipartite b -matching (WB b M) V → maximising overall weight + node capacity [l, b] a leap; a bound; a jump U 0.00187 0.00091 traditionally the first of the the season of growth four seasons of the year in 0.84391 0.84391 0.00363 temperate regions l = 0 l = 0 the elasticity of something b = 1 b = 1 that can be stretched and a place where water or oil 0.01021 0.89191 0.89191 0.89191 returns to its original length l = 1 emerges from the ground l = 1 b = 2 b = 2 a natural flow of ground water 0.38672 0.00362 elasHc power or force. the property of a body of a light, self-propelled movement 0.00021 0.02012 springing to its original form aJer upwards or forwards being compressed, stretched, etc. 0.98951 0.98951 0.98951 0.00101 • High precision, high recall the Hme of growth and progress; early porHon; first stage. • Efficient in linking polysemous items • Still difficult to tune the parameters �14

  15. Our resource alignment mechanism: schema �15

  16. Alignment Experiments: Datasets WordNet synsets manually mapped to their corresponding concepts * * Matuschek, Michael, and Iryna Gurevych. "Dijkstra-wsa: A graph-based approach to word sense alignment." Transac)ons of the Associa)on for �16 Computa)onal Linguis)cs 1 (2013): 151-164.

  17. Experiments: WordNet-Wiktionary alignment Previous work: Our current method: �17

  18. Conclusion • Graph matching algorithms can be efficiently applied to lexical alignment problems • WB b M • includes more possible linking combinations by defining capacity • efficient in lexical alignment providing high precision and recall • difficult to find optimal parameters • highly dependent on the textual and definitional similarities �18

  19. Future directions • Exploring link prediction methods for lexical alignment • Extend our researches to multi-lingual resources • Using graph neural networks (GNNs) �19

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend