Discovering Contextual Information from User Reviews for - - PowerPoint PPT Presentation

discovering contextual information from user reviews for
SMART_READER_LITE
LIVE PREVIEW

Discovering Contextual Information from User Reviews for - - PowerPoint PPT Presentation

Discovering Contextual Information from User Reviews for Recommendation Purposes Konstantin Bauman , Alexander Tuzhilin CIST November 12, 2016 Context Aware Recommender Systems Netflix can improve performance of its RS up to 3% when


slide-1
SLIDE 1

Discovering Contextual Information from User Reviews for Recommendation Purposes

Konstantin Bauman, Alexander Tuzhilin

CIST

November 12, 2016

slide-2
SLIDE 2

Context Aware 
 Recommender Systems

  • “Netflix can improve performance of its RS up to 3%

when taking into account such contextual information as the time of the day or location in their recommendation algorithms” R. Hastings CEO of Netflix

  • Contextual information helps to provide better

recommendations (Adomavicius et al. 2011)

  • music (Kaminskas and Ricci 2011), (Hariri et al. 2013)
  • movies (Odic et al. 2013)
  • restaurants (Li et al. 2010)
  • hotels (Aciar 2010), (Hariri et al. 2011)

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

slide-3
SLIDE 3

Examples of Contexts 
 in Different Applications

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

Application Context Types Tourist Guide Time, Location, Weather, Traffic Movies Time, Place, Company Music Time, Location, Situation, Weather, Temperature, Noise, Illumination, Emotion, Previous Experience, User Current Interest, Last songs E-commerce Time, Intent of purchase Hotels Objective of the Trip Restaurants Time, Location, Company, Occasion

slide-4
SLIDE 4

Research Idea

Question: How to identify important contextual variables in an application?

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

Our approach: Identify the contextual variables discussed in customer reviews.

slide-5
SLIDE 5

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

Method of Context Discovery

  • 1. Separate reviews into Specific and Generic
  • 2. Identify context related Phrases
  • 3. Identify context related LDA topics
  • 4. Discover contextual variables
slide-6
SLIDE 6
  • 1. Separating reviews into 


Specific and Generic

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

slide-7
SLIDE 7
  • 1. Separating reviews into 


Specific and Generic (cont.)

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

  • number of sentences
  • number of words
  • number of verbs
  • number of verbs in the past tenses
  • VRatio – the ratio of the last two measures

Specific: describes a particular visit to an establishment Generic: describes the overall impressions The approach: K-means clustering based on the following features:

slide-8
SLIDE 8
  • 2. Method of Identifying Context

Related Phrases

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

  • 1. Part of Speech tagging
  • 2. Search for POS patterns within five word collocation

window

  • 3. Calculate phrase frequencies 


in specific and generic reviews respectively

  • 4. Select the frequent phrases and determine

ratio(ni) = ps(ni) pg(ni)

  • 5. Sort phrases by this ratio in the descending order.

(ps(ni), pg(ni))

slide-9
SLIDE 9
  • 2. Identifying Context Related

Phrases

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

Examples Phrase Specific Generic Wife 5.3% 1.6% Morning 3.1% 1.4% Birthday 2.9% 0.7%

slide-10
SLIDE 10
  • 3. Method of Identifying Context Related 


LDA Topics

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

  • 1. Build LDA model on Specific reviews
  • 2. Identify topics within the reviews
  • 3. Calculate weighted frequencies 


in specific and generic reviews respectively

  • 4. Select frequent topics and determine
  • 5. Sort topics by this ratio in the descending order.

ratio(tk) = ws(tk) wg(tk)

(ws(tk), wg(tk))

slide-11
SLIDE 11
  • 4. Discovering contextual variables

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

  • 1. Examine constructed lists of phrases and topics
  • 2. Identify groups of phrases or topics having the same

meaning in an application (e.g. “colleague”, “friend”, and “daughter”)

  • 3. Determine the values of contextual variables based
  • n these phrases and topics (e.g. “parent” value

based on the phrases “father” and “mother”)

  • 4. Create their hierarchical structure.
slide-12
SLIDE 12
  • 4. Discovering contextual variables

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

slide-13
SLIDE 13

Experimental Settings

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

Application Reviews Users Businesses Restaurant 1,344,405 384,821 24,917 Hotel 96,384 65,387 1,424 Beauty & Spa 104,199 71,422 6,536

slide-14
SLIDE 14

Contextual variables in 
 Restaurants application

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

slide-15
SLIDE 15

Contextual variables in 
 Hotels application

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

slide-16
SLIDE 16

Contextual variables in 
 Beauty & Spas application

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

slide-17
SLIDE 17

Identifying Importance of 
 Contextual Variables

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

Example: Time of the Day

slide-18
SLIDE 18

Phrase-based Method Performance

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

Ordinal numbers of phrases in sorted lists Cumulative
 numbers

  • f discovered

contextual
 variables

slide-19
SLIDE 19

LDA-based Method Performance

Konstantin Bauman, Stern School of Business NYU Konstantin Bauman, Stern School of Business NYU

Cumulative
 numbers

  • f discovered

contextual
 variables Ordinal numbers of LDA topics in sorted lists

slide-20
SLIDE 20
  • Proposed a new approach to discover Contextual Information
  • Presented a method of discovering context from customer

reviews

  • Separating reviews into Specific and Generic
  • Phrase based method
  • LDA based method
  • Tested it on 3 applications: restaurant, hotel, beauty&spas
  • Extracted almost all of the contextual information
  • Produced more comprehensive sets than used previously
  • All the discovered variables are useful for rating predictions

CONCLUSION

Konstantin Bauman, Stern School of Business NYU