semantic categories using automatically acquired
play

Semantic Categories using Automatically Acquired Symmetric Patterns - PowerPoint PPT Presentation

Minimally Supervised Classification to Semantic Categories using Automatically Acquired Symmetric Patterns Roy Schwartz + , Roi Reichart * and Ari Rappoport + + The Hebrew University, * Technion IIT COLING 2014


  1. Minimally Supervised Classification to Semantic Categories using Automatically Acquired Symmetric Patterns Roy Schwartz + , Roi Reichart * and Ari Rappoport + + The Hebrew University, * Technion IIT COLING 2014

  2. http://www.slideshare.net/halucinex/friend-word-map Minimally Supervised Classification to Semantic 2 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  3. DS Hypothesis (Harris, 1954) – ... tokens to date, friend lists and recent ... – ... by my dear friend and companion, Fritz von ... – ... even have a friend who never fails ... – ... by my worthy friend Doctor Haygarth of ... – ... and as a friend pointed out to ... – ... partner, in-laws, relatives or friends speak a different ... – ... petition to a friend Go to the ... – ... otherwise, to a friend or family member ... – ...images from my friend Rory though - ... – ... great, and a friend as well as a colleague, who, ... – … Examples taken from the ukwac corpus (Baroni et al., 2009) Minimally Supervised Classification to Semantic 3 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  4. 0 0.5 0.76 -0.12 0.76 0 0 -0.51 . . . Minimally Supervised Classification to Semantic 4 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  5. 0 0.5 0.76 -0.12 friend Θ 0.76 0 colleague 0 -0.51 . . . Minimally Supervised Classification to Semantic 4 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  6. – ... tokens to date, friend lists and recent ... – ... by my dear friend and companion, Fritz von ... – ... even have a friend who never fails ... – ... by my worthy friend Doctor Haygarth of ... – ... and as a friend pointed out to ... – ... partner, in-laws, relatives or friends speak a different ... – ... petition to a friend Go to the ... – ... otherwise, to a friend or family member ... – ...images from my friend Rory though - ... – ... great, and a friend as well as a colleague, who, ... – … Minimally Supervised Classification to Semantic 5 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  7. – ... by my dear friend and companion, Fritz von ... – ... partner, in-laws, relatives or friends speak a different ... – ... great, and a friend as well as a colleague, who, ... – … Minimally Supervised Classification to Semantic 5 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  8. – ... by my dear friend and companion, Fritz von ... – ... partner, in-laws, relatives or friends speak a different ... – ... great, and a friend as well as a colleague, who, ... – … Minimally Supervised Classification to Semantic 5 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  9. – friend and companion – companion and friend – relatives or friends – friends or relatives – friend as well as a colleague – colleague as well as a friend Minimally Supervised Classification to Semantic 6 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  10. – friend and companion – companion and friend – relatives or friends – friends or relatives – friend as well as a colleague – colleague as well as a friend Minimally Supervised Classification to Semantic 6 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  11. Minimally Supervised Classification to Semantic 7 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  12. Symmetric Patterns Senna Brown Minimally Supervised Classification to Semantic 8 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  13. Overview • The task – Minimally supervised semantic classification • The method – Automatically acquired symmetric patterns • Results – Symmetric patterns outperform strong baselines by > 12% accuracy Minimally Supervised Classification to Semantic 9 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  14. The Task • Binary Classification of Nouns into Semantic Categories – Is “dog” an animal? – Is “couch” a tool? • Use minimal supervision Minimally Supervised Classification to Semantic 10 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  15. The Task Example • Animals Computer Chair Hammer House Couch Purse Dog Cat Whale Rat Car Mole Apple Owl Minimally Supervised Classification to Semantic 11 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  16. The Task Goal • Animals Computer Chair Hammer House Couch Purse Dog Cat Whale Rat Car Mole Apple Owl Minimally Supervised Classification to Semantic 12 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  17. Symmetric Patterns Contexts Minimally Supervised Classification to Semantic 13 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  18. Symmetric Patterns to Word Similarity • S XY  the number of times X,Y appeared in the same symmetric pattern Minimally Supervised Classification to Semantic 14 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  19. Symmetric Patterns to Word Similarity • • orange  apple 1. … apples and oranges … 2. … oranges as well as apples … … K. … neither apple nor orange … K  orange  apple = Z – Z: a normalization factor Minimally Supervised Classification to Semantic 14 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  20. Symmetric Patterns to Word Similarity • • • France  England 1. 1. … England or France … 2. 2. … from France to England … … M. … England and France … K M  France  England = Z Z – Z: a normalization factor Minimally Supervised Classification to Semantic 14 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  21. Symmetric Patterns Minimally Supervised Classification to Semantic 15 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  22. Symmetric Patterns Minimally Supervised Classification to Semantic 15 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  23. Automatically Extracted Symmetric Patterns The (Davidov and Rappoport, 2006) Algorithm • A graph-based algorithm – Input: a corpus of plain text – Output: a set of symmetric patterns Minimally Supervised Classification to Semantic 16 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  24. Automatically Extracted Symmetric Patterns The (Davidov and Rappoport, 2006) Algorithm • – – • The idea: search for patterns with interchangeable word pairs – For each pattern candidate, compute symmetry measure (M) – Select the patterns with the highest M values Minimally Supervised Classification to Semantic 16 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  25. Automatically Extracted Symmetric Patterns The (Davidov and Rappoport, 2006) algorithm • The M measure counts the proportion of pattern instances that appear in both directions (“cat and dog” + “dog and cat”) – See paper for more details • High M value  A symmetric pattern Minimally Supervised Classification to Semantic 17 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  26. Automatically Extracted Symmetric Patterns The (Davidov and Rappoport, 2006) algorithm • – • • Twenty symmetric patterns are extracted – “X and Y”, “X or Y” – “X and the Y”, “X rather than Y”, “X versus Y” Minimally Supervised Classification to Semantic 17 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  27. Word Similarity Measures S XY  Similarity Between Words X and Y • Symmetric patterns – Extract a set of symmetric patterns from plain text – S XY  the number of time X and Y participate in the same symmetric pattern Minimally Supervised Classification to Semantic 18 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  28. Word Similarity Measures S XY  Similarity Between Words X and Y • Symmetric patterns – Extract a set of symmetric patterns from plain text – S XY  the number of time X and Y participate in the same symmetric pattern • Baselines: – Senna word embeddings (Collobert et al., 2011): – S XY  cosine similarity between the word embeddings of X and Y Minimally Supervised Classification to Semantic 18 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  29. Word Similarity Measures S XY  Similarity Between Words X and Y • Symmetric patterns – Extract a set of symmetric patterns from plain text – S XY  the number of time X and Y participate in the same symmetric pattern • Baselines: – Senna word embeddings (Collobert et al., 2011): – S XY  cosine similarity between the word embeddings of X and Y – Brown Clusters (Brown et al., 1992): – S XY  1 - tree distance between X and Y clusters Minimally Supervised Classification to Semantic 18 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

  30. Word Classification • Reminder: Our Task – Minimally-supervised semantic word classification Minimally Supervised Classification to Semantic 19 Categories using Automatically Acquired Symmetric Patterns @ Schwartz et al.

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend