exploiting domain knowledge in aspect extraction
play

Exploiting Domain Knowledge in Aspect Extraction Meichun Hsu - PowerPoint PPT Presentation

Exploiting Domain Knowledge in Aspect Extraction Meichun Hsu Zhiyuan (Brett) Chen Malu Castellanos Arjun Mukherjee Riddhiman Ghosh Bing Liu Aspect Extraction Extracting aspect terms Aspect Terms This camera takes beautiful pictures but


  1. Exploiting Domain Knowledge in Aspect Extraction Meichun Hsu Zhiyuan (Brett) Chen Malu Castellanos Arjun Mukherjee Riddhiman Ghosh Bing Liu

  2. Aspect Extraction Extracting aspect terms

  3. Aspect Terms This camera takes beautiful pictures but its price is higher than $200.

  4. Aspect Terms This camera takes beautiful pictures but its price is higher than $200.

  5. Aspect Extraction Extracting aspect terms Clustering terms into categories

  6. Clustering Picture Price Photo Cost Image Money Aspect 1 Aspect 2

  7. Existing Work For extracting only Word frequency + syntactic dependency (e.g., Hu and Liu, 2004) Supervised sequence labeling/classification (e.g., Liu, Hu and Cheng 2005)

  8. Existing Work For extracting only Word frequency + syntactic dependency (e.g., Hu and Liu, 2004) Supervised sequence labeling/classification (e.g., Liu, Hu and Cheng 2005) For clustering only Grouping aspect terms (e.g., Zhai et al., 2010)

  9. Existing Work For both extracting and clustering Topic models (e.g., Mukherjee and Liu, 2012; Kim et al., 2013; Lazaridou et al., 2013; Lin and He, 2009; Lu and Zhai, 2008; Moghaddam and Ester, 2011; Sauper et al., 2011; Titov and McDonald, 2008;)

  10. Issues of Unsupervised Topic Models Many aspects/topics are not meaningful. Objective functions do not correlate well with human judgments (Chang et al., 2009).

  11. Remedy: Knowledge-based Topic Models

  12. Knowledge-based Topic Models Multiple senses DF-LDA Adverse effect Seeded Models Utilize “cannot” knowledge

  13. Knowledge-based Topic Models DF-LDA (Andrzejewski et al., 2009) Must-Link Picture Photo Cannot-Link Picture Price

  14. Knowledge-based Topic Models DF-LDA (Andrzejewski et al., 2009) Seeded models (Burns et al., 2012; Jagarlamudi et al., 2012; Lu et al., 2011; Mukherjee and Liu, 2012)

  15. Knowledge-based Topic Models Multiple senses Light

  16. Knowledge-based Topic Models Multiple senses {Light, Bright} Light {Light, Heavy}

  17. Knowledge-based Topic Models Multiple senses Light {Light, Bright, Heavy}

  18. Knowledge-based Topic Models Adverse effect of knowledge Color {Price, Cost} Price Cheap Color Price Cheap Cost Cost Pricy Pricy Cost … … Price

  19. Knowledge-based Topic Models Utilize “cannot” knowledge Amazon Amazon Price Review Expensive {Amazon,Price} Shipping Review Order Shipping Price Order Expensive Money Money Cheap Cheap

  20. Knowledge-based Topic Models Multiple senses DF-LDA Adverse effect Seeded Utilize “cannot” Models knowledge

  21. Addressing Issues Multiple senses Adverse effect Utilize “cannot” knowledge

  22. Addressing Issues Multiple senses Adding variable s Adverse effect GPU Model Utilize “cannot” E-GPU Model knowledge

  23. M-Set and C-Set Must-set: {Price, Cost, Money} Do not enforce transitivity. Cannot-set: {Price, Color, Size}

  24. Addressing First Issue Multiple senses Adding variable s Adverse effect GPU Model Utilize “cannot” E-GPU Model knowledge

  25. MDK-LDA (Chen et al., IJCAI 2013)

  26. MDK-LDA (Chen et al., IJCAI 2013) S1: {Light, Heavy, Weight} S2: {Light, Bright, Luminance}

  27. Addressing Second Issue Multiple senses Adding variable s Adverse effect GPU Model Utilize “cannot” E-GPU Model knowledge

  28. Simple Pólya Urn Model (SPU)

  29. Simple Pólya Urn Model (SPU)

  30. Simple Pólya Urn Model (SPU)

  31. Simple Pólya Urn Model (SPU)

  32. Simple Pólya Urn Model (SPU)

  33. Simple Pólya Urn Model (SPU) The richer get richer!

  34. Interpreting LDA Under SPU

  35. Interpreting LDA Under SPU price Topic 0

  36. Interpreting LDA Under SPU price price Topic 0

  37. Generalized Pólya Urn Model (GPU)

  38. Generalized Pólya Urn Model (GPU)

  39. Generalized Pólya Urn Model (GPU)

  40. Generalized Pólya Urn Model (GPU)

  41. Generalized Pólya Urn Model (GPU)

  42. Applying GPU price Topic 0

  43. Applying GPU price price money cost Topic 0

  44. Addressing Third Issue Multiple senses Adding variable s Adverse effect GPU Model Utilize “cannot” E-GPU Model knowledge

  45. Our Proposed E-GPU Model Topic 0 Topic 1 Topic 2

  46. E-GPU Model price Topic 0 Topic 1 Topic 2

  47. E-GPU Model price price money cost Topic 0 Topic 1 Topic 2

  48. E-GPU Model color {price, color} Topic 0 Topic 1 Topic 2

  49. E-GPU Model color 8 1 1 “color” “color” “color” Topic 0 Topic 1 Topic 2

  50. E-GPU Model color 8 1 1 “color” “color” “color” Topic 0 Topic 1 Topic 2

  51. E-GPU Model amazon {price, amazon} Topic 0 Topic 1 Topic 2

  52. E-GPU Model amazon 0 10 0 “amazon” “amazon” “amazon” Topic 0 Topic 1 Topic 2

  53. amazon E-GPU Model 0 10 0 “amazon” “amazon” “amazon” Topic 0 Topic 1 Topic 2

  54. amazon E-GPU Model 0 10 0 “amazon” “amazon” “amazon” Topic 0 Topic 1 Topic 2 Topic 3

  55. Addressing Issues Multiple senses Adding variable s Adverse effect GPU Model Utilize “cannot” E-GPU Model knowledge

  56. Evaluation

  57. Evaluation Four domains Knowledge Objective Evaluation Human

  58. Model Comparison LDA (Blei et al., 2003) LDA-GPU (Mimno et al., 2011) DF-LDA (Andrzejewski et al., 2009) MC-LDA

  59. Model Comparison LDA LDA-GPU DF-M DF-LDA DF-MC M-LDA MC-LDA MC-LDA

  60. Model Comparison LDA LDA-GPU Baselines DF-M DF-LDA DF-MC M-LDA MC-LDA MC-LDA

  61. Objective Evaluation Topic Coherence

  62. Objective Evaluation Topic Coherence

  63. Human Evaluation Precision @ 5

  64. Human Evaluation Precision @ 10

  65. Example Aspects

  66. Conclusions Discover meaningful aspects using knowledge

  67. Conclusions Discover meaningful aspects using knowledge Multiple senses Adverse effect Utilize “cannot” knowledge

  68. Conclusions Discover meaningful aspects using knowledge Multiple senses Adding variable s Adverse effect GPU Model Utilize “cannot” E-GPU Model knowledge

  69. Datasets: http://www.cs.uic.edu/~zchen/

  70. Datasets: http://www.cs.uic.edu/~zchen/

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend